Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14294
Cited By
v1
v2 (latest)
Emerging Properties in Self-Supervised Vision Transformers
29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Emerging Properties in Self-Supervised Vision Transformers"
50 / 4,175 papers shown
Title
TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at Scale
Ziyun Zeng
Yixiao Ge
Zhan Tong
Xihui Liu
Shutao Xia
Ying Shan
82
9
0
23 May 2023
Federated Generalized Category Discovery
Nan Pu
Zhun Zhong
Xinyuan Ji
N. Sebe
FedML
79
14
0
23 May 2023
Weakly Supervised 3D Open-vocabulary Segmentation
Kunhao Liu
Fangneng Zhan
Jiahui Zhang
Muyu Xu
Yingchen Yu
Abdulmotaleb El Saddik
Christian Theobalt
Eric P. Xing
Shijian Lu
125
70
0
23 May 2023
A Study on Deep CNN Structures for Defect Detection From Laser Ultrasonic Visualization Testing Images
Miya Nakajima
Takahiro Saitoh
Tsuyoshi Kato
64
2
0
23 May 2023
NORM: Knowledge Distillation via N-to-One Representation Matching
Xiaolong Liu
Lujun Li
Chao Li
Anbang Yao
113
71
0
23 May 2023
Pulling Target to Source: A New Perspective on Domain Adaptive Semantic Segmentation
Haochen Wang
Yujun Shen
Jingjing Fei
Wei Li
Liwei Wu
Yuxi Wang
Zhaoxiang Zhang
OOD
101
7
0
23 May 2023
Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative Training
Utku Ozbulak
Hyun Jung Lee
Beril Boga
Esla Timothy Anzaku
Ho-min Park
Arnout Van Messem
W. D. Neve
J. Vankerschaver
DiffM
107
38
0
23 May 2023
Revisiting pre-trained remote sensing model benchmarks: resizing and normalization matters
Isaac Corley
Caleb Robinson
Rahul Dodhia
J. L. Ferres
Peyman Najafirad
122
19
0
22 May 2023
Materialistic: Selecting Similar Materials in Images
Prafull Sharma
Julien Philip
Michael Gharbi
Bill Freeman
F. Durand
Valentin Deschaintre
DiffM
99
18
0
22 May 2023
HGFormer: Hierarchical Grouping Transformer for Domain Generalized Semantic Segmentation
Jian Ding
Nan Xue
Guisong Xia
Bernt Schiele
Dengxin Dai
ViT
85
32
0
22 May 2023
You Only Look at One: Category-Level Object Representations for Pose Estimation From a Single Example
Walter Goodwin
Ioannis Havoutis
Ingmar Posner
67
10
0
22 May 2023
Unsupervised Multi-view Pedestrian Detection
Mengyin Liu
Chao Zhu
Shiqi Ren
Xu-Cheng Yin
133
6
0
21 May 2023
What Makes for Good Visual Tokenizers for Large Language Models?
Guangzhi Wang
Yixiao Ge
Xiaohan Ding
Mohan S. Kankanhalli
Ying Shan
MLLM
VLM
96
39
0
20 May 2023
Annealing Self-Distillation Rectification Improves Adversarial Training
Yuehua Wu
Hung-Jui Wang
Shang-Tse Chen
AAML
104
5
0
20 May 2023
Multimodal Web Navigation with Instruction-Finetuned Foundation Models
Hiroki Furuta
Kuang-Huei Lee
Ofir Nachum
Yutaka Matsuo
Aleksandra Faust
S. Gu
Izzeddin Gur
LM&Ro
179
103
0
19 May 2023
Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes
Aran Nayebi
R. Rajalingham
M. Jazayeri
G. R. Yang
82
20
0
19 May 2023
MALM: Mask Augmentation based Local Matching for Food-Recipe Retrieval
Bhanu Prakash Voutharoja
Peng Wang
Lei Wang
Vivienne Guan
69
6
0
18 May 2023
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models
Ziyi Wu
Jingyu Hu
Wuyue Lu
Igor Gilitschenski
Animesh Garg
DiffM
OCL
126
47
0
18 May 2023
Going Denser with Open-Vocabulary Part Segmentation
Pei Sun
Shoufa Chen
Chenchen Zhu
Fanyi Xiao
Ping Luo
Saining Xie
Zhicheng Yan
ObjD
VLM
115
49
0
18 May 2023
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Peng Wang
Shijie Wang
Junyang Lin
Shuai Bai
Xiaohuan Zhou
Jingren Zhou
Xinggang Wang
Chang Zhou
VLM
MLLM
ObjD
151
122
0
18 May 2023
Annotation-free Audio-Visual Segmentation
Jinxian Liu
Yu Wang
Chen Ju
Chaofan Ma
Ya Zhang
Weidi Xie
VOS
VLM
109
30
0
18 May 2023
HMSN: Hyperbolic Self-Supervised Learning by Clustering with Ideal Prototypes
A. Durrant
Georgios Leontidis
SSL
71
4
0
18 May 2023
Tuned Contrastive Learning
Chaitanya Animesh
Manmohan Chandraker
SSL
45
0
0
18 May 2023
OR-NeRF: Object Removing from 3D Scenes Guided by Multiview Segmentation with Neural Radiance Fields
Youtan Yin
Zhoujie Fu
Fan Yang
Guosheng Lin
117
30
0
17 May 2023
CLIP-GCD: Simple Language Guided Generalized Category Discovery
Rabah Ouldnoughi
Chia-Wen Kuo
Z. Kira
VLM
82
14
0
17 May 2023
Cold PAWS: Unsupervised class discovery and addressing the cold-start problem for semi-supervised learning
Evelyn J. Mannix
H. Bondell
SSL
36
0
0
17 May 2023
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
Alexander H. Liu
Heng-Jui Chang
Michael Auli
Wei-Ning Hsu
James R. Glass
90
26
0
17 May 2023
Online Continual Learning Without the Storage Constraint
Ameya Prabhu
Zhipeng Cai
P. Dokania
Philip Torr
V. Koltun
Ozan Sener
CLL
174
32
0
16 May 2023
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Yuyang Zhao
Enze Xie
Lanqing Hong
Zhenguo Li
G. Lee
DiffM
VGen
102
34
0
15 May 2023
Learning Better Contrastive View from Radiologist's Gaze
Sheng Wang
Zixu Zhuang
Xi Ouyang
Lichi Zhang
Zheren Li
Chong Ma
Tianming Liu
Dinggang Shen
Qian Wang
MedIm
66
2
0
15 May 2023
AutoRecon: Automated 3D Object Discovery and Reconstruction
Yuang Wang
Xingyi He He
Sida Peng
Haotong Lin
Hujun Bao
Xiaowei Zhou
73
12
0
15 May 2023
Improved baselines for vision-language pre-training
Enrico Fini
Pietro Astolfi
Adriana Romero Soriano
Jakob Verbeek
M. Drozdzal
SSL
CLIP
VLM
128
23
0
15 May 2023
Fast Traversability Estimation for Wild Visual Navigation
Jonas Frey
Matías Mattamala
Nived Chebrolu
Cesar Cadena
Maurice F. Fallon
Marco Hutter
126
68
0
15 May 2023
Component-aware anomaly detection framework for adjustable and logical industrial visual inspection
Tongkun Liu
Bing Li
Xiao Du
Bingke Jiang
Xiao Jin
Liuyi Jin
Zhu Zhao
81
29
0
15 May 2023
Masked Collaborative Contrast for Weakly Supervised Semantic Segmentation
Fangwen Wu
Jingxuan He
Yufei Yin
Y. Hao
Gang Huang
Lechao Cheng
ISeg
85
6
0
15 May 2023
PLIP: Language-Image Pre-training for Person Representation Learning
Jia-li Zuo
Jiahao Hong
Feng Zhang
Changqian Yu
Hanyu Zhou
Changxin Gao
Nong Sang
Jingdong Wang
VLM
MLLM
132
38
0
15 May 2023
A Comprehensive Survey on Segment Anything Model for Vision and Beyond
Chunhui Zhang
Li Liu
Yawen Cui
Guanjie Huang
Weilin Lin
Yiqian Yang
Yuehong Hu
VLM
102
100
0
14 May 2023
Meta-DM: Applications of Diffusion Models on Few-Shot Learning
W. Hu
Xiurong Jiang
Jiarun Liu
Yuqi Yang
Hui Tian
DiffM
80
7
0
14 May 2023
Consistency Regularization for Domain Generalization with Logit Attribution Matching
Han Gao
Kaican Li
Weiyan Xie
Zhi Lin
Yongxiang Huang
Luning Wang
Caleb Chen Cao
N. Zhang
82
2
0
13 May 2023
CEMFormer: Learning to Predict Driver Intentions from In-Cabin and External Cameras via Spatial-Temporal Transformers
Yunsheng Ma
Wenqian Ye
Xu Cao
Amr Abdelraouf
Kyungtae Han
Rohit Gupta
Ziran Wang
72
11
0
13 May 2023
CLIP-Count: Towards Text-Guided Zero-Shot Object Counting
Ruixia Jiang
Lin Liu
Changan Chen
VLM
121
72
0
12 May 2023
Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers
Dahun Kim
A. Angelova
Weicheng Kuo
ObjD
ViT
VLM
84
80
0
11 May 2023
Hyperbolic Deep Learning in Computer Vision: A Survey
Pascal Mettes
Mina Ghadimi Atigh
Martin Keller-Ressel
Jeffrey Gu
Serena Yeung
119
44
0
11 May 2023
Text-To-Concept (and Back) via Cross-Model Alignment
Mazda Moayeri
Keivan Rezaei
Maziar Sanjabi
Soheil Feizi
CLIP
75
44
0
10 May 2023
Learning Semi-supervised Gaussian Mixture Models for Generalized Category Discovery
Bingchen Zhao
Xin Wen
Kai Han
68
58
0
10 May 2023
Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era
Chenghao Li
Chaoning Zhang
Atish Waghwase
Lik-Hang Lee
François Rameau
Yang Yang
Sung-Ho Bae
Choong Seon Hong
104
78
0
10 May 2023
ImageBind: One Embedding Space To Bind Them All
Rohit Girdhar
Alaaeldin El-Nouby
Zhuang Liu
Mannat Singh
Kalyan Vasudev Alwala
Armand Joulin
Ishan Misra
VLM
207
944
0
09 May 2023
Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval
Shiyin Dong
Mingrui Zhu
N. Wang
Xinbo Gao
VLM
81
3
0
09 May 2023
Model-Contrastive Federated Domain Adaptation
Chang’an Yi
Haotian Chen
Yonghui Xu
Yifan Zhang
MedIm
FedML
59
0
0
07 May 2023
PointCMP: Contrastive Mask Prediction for Self-supervised Learning on Point Cloud Videos
Zhiqiang Shen
Xiaoxiao Sheng
Longguang Wang
Y. Guo
Qiong Liu
Xiaoping Zhou
3DPC
SSL
77
15
0
06 May 2023
Previous
1
2
3
...
58
59
60
...
82
83
84
Next