Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14294
Cited By
v1
v2 (latest)
Emerging Properties in Self-Supervised Vision Transformers
29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Emerging Properties in Self-Supervised Vision Transformers"
50 / 4,175 papers shown
Title
Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation
Chang Liu
Giulia Rizzoli
Pietro Zanuttigh
Fu Li
Yi Niu
CLL
122
2
0
18 Jul 2024
Multi-sentence Video Grounding for Long Video Generation
Wei Feng
Xin Wang
Hong Chen
Zeyang Zhang
Wenwu Zhu
DiffM
71
0
0
18 Jul 2024
OVGNet: A Unified Visual-Linguistic Framework for Open-Vocabulary Robotic Grasping
Meng Li
Qi Zhao
Shuchang Lyu
Chunlei Wang
Yujing Ma
Guangliang Cheng
Chenguang Yang
113
5
0
18 Jul 2024
Audio-visual Generalized Zero-shot Learning the Easy Way
Shentong Mo
Pedro Morgado
61
5
0
18 Jul 2024
ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders
Carlos Hinojosa
Shuming Liu
Guohao Li
67
2
0
17 Jul 2024
EchoSight: Advancing Visual-Language Models with Wiki Knowledge
Yibin Yan
Weidi Xie
RALM
141
14
0
17 Jul 2024
InfoNorm: Mutual Information Shaping of Normals for Sparse-View Reconstruction
Xulong Wang
Siyan Dong
Youyi Zheng
Yanchao Yang
91
1
0
17 Jul 2024
Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks
Antoni Kowalczuk
Jan Dubiñski
Atiyeh Ashari Ghomi
Yi Sui
George Stein
Jiapeng Wu
Jesse C. Cresswell
Franziska Boenisch
Adam Dziedzic
SSL
AAML
75
3
0
17 Jul 2024
Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation
Hyun Seok Seong
WonJun Moon
Subeen Lee
Jae-Pil Heo
90
1
0
17 Jul 2024
ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference
Mengcheng Lan
Chaofeng Chen
Yiping Ke
Xinjiang Wang
Xue Jiang
Wayne Zhang
VLM
117
29
0
17 Jul 2024
GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features
Luc P.J. Strater
Mohammadreza Salehi
E. Gavves
Cees G. M. Snoek
Yuki M. Asano
83
9
0
17 Jul 2024
R+X: Retrieval and Execution from Everyday Human Videos
Georgios Papagiannis
Norman Di Palo
Pietro Vitiello
Edward Johns
144
18
0
17 Jul 2024
Generalized Coverage for More Robust Low-Budget Active Learning
Wonho Bae
Junhyug Noh
Danica J. Sutherland
129
4
0
16 Jul 2024
A Closer Look at Benchmarking Self-Supervised Pre-training with Image Classification
Markus Marks
Manuel Knott
Neehar Kondapaneni
Elijah Cole
T. Defraeye
Fernando Pérez-Cruz
Pietro Perona
SSL
125
5
0
16 Jul 2024
CroMo-Mixup: Augmenting Cross-Model Representations for Continual Self-Supervised Learning
Erum Mushtaq
D. Yaldiz
Yavuz Faruk Bakman
Jie Ding
Chenyang Tao
Dimitrios Dimitriadis
A. Avestimehr
CLL
85
1
0
16 Jul 2024
Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning
Yanting Miao
William Loh
Suraj Kothawade
Pascal Poupart
Abdullah Rashwan
Yeqing Li
EGVM
57
5
0
16 Jul 2024
Click-Gaussian: Interactive Segmentation to Any 3D Gaussians
Seokhun Choi
H. Song
Jaechul Kim
Taehyeong Kim
Hoseok Do
3DGS
105
23
0
16 Jul 2024
Rate-Distortion-Cognition Controllable Versatile Neural Image Compression
Jinming Liu
Ruoyu Feng
Yunpeng Qi
Qiuyu Chen
Zhibo Chen
Wenjun Zeng
Xin Jin
89
2
0
16 Jul 2024
DiNO-Diffusion. Scaling Medical Diffusion via Self-Supervised Pre-Training
Guillermo Jiménez-Pérez
Pedro Osório
Josef Cersovsky
Javier Montalt-Tordera
Jens Hooge
Steffen Vogler
Sadegh Mohammadi
MedIm
94
2
0
16 Jul 2024
Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes
Zhi Cai
Yingjie Gao
Yaoyan Zheng
Nan Zhou
Di Huang
VLM
91
6
0
16 Jul 2024
Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Yanqin Jiang
Chaohui Yu
Chenjie Cao
Fan Wang
Weiming Hu
Jin Gao
VGen
80
19
0
16 Jul 2024
SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of Images
Nir Barel
Ron Shapira Weber
Nir Mualem
Shahaf E. Finder
Oren Freifeld
171
2
0
16 Jul 2024
Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing
Ioannis Maniadis Metaxas
Georgios Tzimiropoulos
Ioannis Patras
SSL
104
0
0
15 Jul 2024
STARS: Self-supervised Tuning for 3D Action Recognition in Skeleton Sequences
Soroush Mehraban
Mohammad Javad Rajabi
Babak Taati
3DPC
102
1
0
15 Jul 2024
DINO Pre-training for Vision-based End-to-end Autonomous Driving
Shubham Juneja
P. Daniušis
Virginijus Marcinkevičius
93
2
0
15 Jul 2024
Aligning Neuronal Coding of Dynamic Visual Scenes with Foundation Vision Models
Rining Wu
Feixiang Zhou
Ziwei Yin
Jian K. Liu
70
0
0
15 Jul 2024
Joint-Embedding Predictive Architecture for Self-Supervised Learning of Mask Classification Architecture
Donghee Kim
Sungduk Cho
Hyeonwoo Cho
Chanmin Park
Jinyoung Kim
Won Hwa Kim
96
0
0
15 Jul 2024
Representation Learning and Identity Adversarial Training for Facial Behavior Understanding
Mang Ning
A. A. Salah
Itir Onal Ertugrul
CVBM
178
5
0
15 Jul 2024
Enhancing Weakly-Supervised Histopathology Image Segmentation with Knowledge Distillation on MIL-Based Pseudo-Labels
Yinsheng He
Xingyu Li
Roger J. Zemp
VLM
98
0
0
14 Jul 2024
A Self-Supervised Learning Pipeline for Demographically Fair Facial Attribute Classification
Sreeraj Ramachandran
A. Rattani
74
1
0
14 Jul 2024
Part2Object: Hierarchical Unsupervised 3D Instance Segmentation
Cheng Shi
Yulin Zhang
Bin Yang
Jiajin Tang
Yuexin Ma
Sibei Yang
3DPC
110
1
0
14 Jul 2024
CLOVER: Context-aware Long-term Object Viewpoint- and Environment- Invariant Representation Learning
Dongmyeong Lee
Amanda Adkins
Joydeep Biswas
99
0
0
12 Jul 2024
3x2: 3D Object Part Segmentation by 2D Semantic Correspondences
Anh Thai
Weiyao Wang
Hao Tang
Stefan Stojanov
Matt Feiszli
James M. Rehg
3DPC
104
6
0
12 Jul 2024
StyleSplat: 3D Object Style Transfer with Gaussian Splatting
Sahil Jain
Avik Kuthiala
P. Sethi
Prakanshul Saxena
3DGS
74
5
0
12 Jul 2024
iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning
Tom Fischer
Yaoyao Liu
Artur Jesslen
Noor Ahmed
Prakhar Kaushik
Angtian Wang
Alan Yuille
Adam Kortylewski
Eddy Ilg
CLL
AI4CE
81
1
0
12 Jul 2024
On the Role of Discrete Tokenization in Visual Representation Learning
Tianqi Du
Yifei Wang
Yisen Wang
101
7
0
12 Jul 2024
Textual Query-Driven Mask Transformer for Domain Generalized Segmentation
Byeonghyun Pak
Byeongju Woo
Sunghwan Kim
Dae-Hwan Kim
Hoseong Kim
134
5
0
12 Jul 2024
Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness
Honghao Chen
Yurong Zhang
Xiaokun Feng
Xiangxiang Chu
Kaiqi Huang
AAML
81
6
0
12 Jul 2024
Tissue-Contrastive Semi-Masked Autoencoders for Segmentation Pretraining on Chest CT
Jie Zheng
Ru Wen
Haiqin Hu
Lina Wei
Kui Su
Wei Chen
Chen Liu
Jun Wang
91
1
0
12 Jul 2024
Bora: Biomedical Generalist Video Generation Model
Weixiang Sun
Xiaocao You
Ruizhe Zheng
Zhengqing Yuan
Xiang Li
Lifang He
Quanzheng Li
Lichao Sun
VGen
MedIm
83
9
0
12 Jul 2024
ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions
Jiu Feng
Mehmet Hamza Erol
Joon Son Chung
Arda Senocak
64
1
0
11 Jul 2024
NODE-Adapter: Neural Ordinary Differential Equations for Better Vision-Language Reasoning
Yi Zhang
Chun-Wun Cheng
Ke Yu
Zhihai He
Carola-Bibiane Schonlieb
Angelica I Aviles-Rivero
VLM
85
2
0
11 Jul 2024
Semantic GUI Scene Learning and Video Alignment for Detecting Duplicate Video-based Bug Reports
Yanfu Yan
Nathan Cooper
Oscar Chaparro
Kevin Moran
Denys Poshyvanyk
87
8
0
11 Jul 2024
Paving the way toward foundation models for irregular and unaligned Satellite Image Time Series
Iris Dumeur
Silvia Valero
Jordi Inglada
111
3
0
11 Jul 2024
WildGaussians: 3D Gaussian Splatting in the Wild
Jonáš Kulhánek
Songyou Peng
Zuzana Kukelova
Marc Pollefeys
Torsten Sattler
3DGS
161
51
0
11 Jul 2024
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation
Tong Shao
Zhuotao Tian
Hang Zhao
Jingyong Su
VLM
114
16
0
11 Jul 2024
Hierarchical Consensus-Based Multi-Agent Reinforcement Learning for Multi-Robot Cooperation Tasks
Pu Feng
Junkang Liang
Size Wang
Xin Yu
Xin Ji
Yiting Chen
Kui Zhang
Rongye Shi
Wenjun Wu
122
7
0
11 Jul 2024
Swiss DINO: Efficient and Versatile Vision Framework for On-device Personal Object Search
Kirill Paramonov
Jia-Xing Zhong
Umberto Michieli
J. Moon
Mete Ozay
121
2
0
10 Jul 2024
Pan-cancer Histopathology WSI Pre-training with Position-aware Masked Autoencoder
Kun-Hsuan Wu
Zhiguo Jiang
Kunming Tang
Jun Shi
Fengying Xie
Wei Wang
Haibo Wu
Yushan Zheng
43
1
0
10 Jul 2024
Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining
Tianfang Sun
Zhizhong Zhang
Xin Tan
Yanyun Qu
Yuan Xie
109
0
0
10 Jul 2024
Previous
1
2
3
...
25
26
27
...
82
83
84
Next