Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14294
Cited By
v1
v2 (latest)
Emerging Properties in Self-Supervised Vision Transformers
29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Emerging Properties in Self-Supervised Vision Transformers"
50 / 4,175 papers shown
Title
CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow
Philippe Weinzaepfel
Thomas Lucas
Vincent Leroy
Yohann Cabon
Vaibhav Arora
Romain Brégier
G. Csurka
L. Antsfeld
Boris Chidlovskii
Jérôme Revaud
ViT
133
97
0
18 Nov 2022
Invariant Learning via Diffusion Dreamed Distribution Shifts
Priyatham Kattakinda
Alexander Levine
Soheil Feizi
DiffM
62
10
0
18 Nov 2022
Explanation on Pretraining Bias of Finetuned Vision Transformer
Bumjin Park
Jaesik Choi
ViT
70
1
0
18 Nov 2022
Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization
Zongshang Pang
Yuta Nakashima
Mayu Otani
Hajime Nagahara
45
6
0
18 Nov 2022
Vision Transformers in Medical Imaging: A Review
Emerald U. Henry
Onyeka Emebob
C. Omonhinmin
ViT
MedIm
88
36
0
18 Nov 2022
Weighted Ensemble Self-Supervised Learning
Yangjun Ruan
Saurabh Singh
Warren Morningstar
Alexander A. Alemi
Sergey Ioffe
Ian S. Fischer
Joshua V. Dillon
FedML
83
16
0
18 Nov 2022
Self-Supervised Visual Representation Learning via Residual Momentum
T. Pham
Axi Niu
Zhang Kang
Sultan Rizky Hikmawan Madjid
Jiajing Hong
Daehyeok Kim
Joshua Tian Jin Tee
Chang D. Yoo
SSL
89
6
0
17 Nov 2022
Data-Centric Debugging: mitigating model failures via targeted data collection
Sahil Singla
Atoosa Malemir Chegini
Mazda Moayeri
Soheil Feiz
94
4
0
17 Nov 2022
CAE v2: Context Autoencoder with CLIP Target
Xinyu Zhang
Jiahui Chen
Junkun Yuan
Qiang Chen
Jian Wang
...
Jimin Pi
Kun Yao
Junyu Han
Errui Ding
Jingdong Wang
VLM
CLIP
109
24
0
17 Nov 2022
UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer
Kunchang Li
Yali Wang
Yinan He
Yizhuo Li
Yi Wang
Limin Wang
Yu Qiao
ViT
122
113
0
17 Nov 2022
How to Fine-Tune Vision Models with SGD
Ananya Kumar
Ruoqi Shen
Sébastien Bubeck
Suriya Gunasekar
VLM
127
31
0
17 Nov 2022
Prompt Tuning for Parameter-efficient Medical Image Segmentation
Marc Fischer
Alexander Bartler
Bin Yang
SSeg
61
21
0
16 Nov 2022
MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis
Tianhong Li
Huiwen Chang
Shlok Kumar Mishra
Han Zhang
Dina Katabi
Dilip Krishnan
90
170
0
16 Nov 2022
Label-Efficient Object Detection via Region Proposal Network Pre-Training
Nanqing Dong
Linus Ericsson
Yongxin Yang
A. Leonardis
Jingyu Sun
ObjD
89
5
0
16 Nov 2022
Stare at What You See: Masked Image Modeling without Reconstruction
Hongwei Xue
Peng Gao
Hongyang Li
Yu Qiao
Hao Sun
Houqiang Li
Jiebo Luo
68
32
0
16 Nov 2022
Masked Reconstruction Contrastive Learning with Information Bottleneck Principle
Ziwen Liu
Bonan li
Congying Han
Tiande Guo
Xuecheng Nie
SSL
64
2
0
15 Nov 2022
Self-supervised remote sensing feature learning: Learning Paradigms, Challenges, and Future Works
Chao Tao
Ji Qi
Mingning Guo
Qing Zhu
Haifeng Li
SSL
104
59
0
15 Nov 2022
DeS3: Adaptive Attention-driven Self and Soft Shadow Removal using ViT Similarity
Yeying Jin
W. Ye
Wenhan Yang
Yuan. Yuan
R. Tan
DiffM
148
28
0
15 Nov 2022
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
Yuxin Fang
Wen Wang
Binhui Xie
Quan-Sen Sun
Ledell Yu Wu
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
CLIP
249
729
0
14 Nov 2022
Language models are good pathologists: using attention-based sequence reduction and text-pretrained transformers for efficient WSI classification
Juan Pisula
Katarzyna Bozek
VLM
MedIm
83
3
0
14 Nov 2022
SSL4EO-S12: A Large-Scale Multi-Modal, Multi-Temporal Dataset for Self-Supervised Learning in Earth Observation
Yi Wang
Nassim Ait Ali Braham
Zhitong Xiong
Chenying Liu
C. Albrecht
Xiao Xiang Zhu
103
73
0
13 Nov 2022
Far Away in the Deep Space: Dense Nearest-Neighbor-Based Out-of-Distribution Detection
Silvio Galesso
Max Argus
Thomas Brox
UQCV
96
11
0
12 Nov 2022
MARLIN: Masked Autoencoder for facial video Representation LearnINg
Zhixi Cai
Shreya Ghosh
Kalin Stefanov
Abhinav Dhall
Jianfei Cai
Hamid Rezatofighi
Reza Haffari
Munawar Hayat
ViT
CVBM
114
62
0
12 Nov 2022
Masked Contrastive Representation Learning
Yuan Yao
Nandakishor Desai
M. Palaniswami
SSL
146
8
0
11 Nov 2022
Contrastive Self-Supervised Learning for Skeleton Representations
N. Lingg
Miguel Sarabia
Luca Zappella
B. Theobald
SSL
47
0
0
10 Nov 2022
Pushing the limits of self-supervised speaker verification using regularized distillation framework
Yafeng Chen
Siqi Zheng
Haibo Wang
Luyao Cheng
Qian Chen
75
27
0
08 Nov 2022
Polite Teacher: Semi-Supervised Instance Segmentation with Mutual Learning and Pseudo-Label Thresholding
Dominik Filipiak
Andrzej Zapala
Piotr Tempczyk
A. Fensel
Marek Cygan
ISeg
104
11
0
07 Nov 2022
Generalizable Re-Identification from Videos with Cycle Association
Zhongdao Wang
Zhaopeng Dou
Jingwei Zhang
Liang Zhen
Yifan Sun
Yali Li
Shengjin Wang
BDL
66
2
0
07 Nov 2022
Contrastive Classification and Representation Learning with Probabilistic Interpretation
Rahaf Aljundi
Yash J. Patel
Milan Šulc
Daniel Olmeda
N. Chumerin
SSL
56
7
0
07 Nov 2022
Okapi: Generalising Better by Making Statistical Matches Match
Myles Bartlett
Sara Romiti
V. Sharmanska
Novi Quadrianto
83
3
0
07 Nov 2022
MogaNet: Multi-order Gated Aggregation Network
Siyuan Li
Zedong Wang
Zicheng Liu
Cheng Tan
Haitao Lin
Di Wu
Zhiyuan Chen
Jiangbin Zheng
Stan Z. Li
107
65
0
07 Nov 2022
Unsupervised Visual Representation Learning via Mutual Information Regularized Assignment
Dong Lee
Sung-Ik Choi
Hyunwoo J. Kim
Sae-Young Chung
SSL
97
7
0
04 Nov 2022
Embed and Emulate: Learning to estimate parameters of dynamical systems with uncertainty quantification
Ruoxi Jiang
Rebecca Willett
61
7
0
03 Nov 2022
Neural Systematic Binder
Gautam Singh
Yeongbin Kim
Sungjin Ahn
OCL
114
37
0
02 Nov 2022
Self-supervised Character-to-Character Distillation for Text Recognition
Tongkun Guan
Wei Shen
Xuehang Yang
Qi Feng
Zekun Jiang
Xiaokang Yang
147
25
0
01 Nov 2022
Trade-off Between Efficiency and Consistency for Removal-based Explanations
Yifan Zhang
Haowei He
Zhiyuan Tan
Yang Yuan
FAtt
97
4
0
31 Oct 2022
Max Pooling with Vision Transformers reconciles class and shape in weakly supervised semantic segmentation
Simone Rossetti
Damiano Zappia
Marta Sanzari
M. Schaerf
F. Pirri
ViT
104
59
0
31 Oct 2022
A simple, efficient and scalable contrastive masked autoencoder for learning visual representations
Shlok Kumar Mishra
Joshua Robinson
Huiwen Chang
David Jacobs
Aaron Sarna
Aaron Maschinot
Dilip Krishnan
DiffM
114
31
0
30 Oct 2022
Saliency Can Be All You Need In Contrastive Self-Supervised Learning
Veysel Kocaman
O. M. Shir
Thomas Bäck
A. Belbachir
63
1
0
30 Oct 2022
Elastic Weight Consolidation Improves the Robustness of Self-Supervised Learning Methods under Transfer
Andrius Ovsianas
Jason Ramapuram
Dan Busbridge
Eeshan Gunesh Dhekane
Russ Webb
29
4
0
28 Oct 2022
cRedAnno+: Annotation Exploitation in Self-Explanatory Lung Nodule Diagnosis
Jiahao Lu
Chong Yin
Kenny Erleben
M. B. Nielsen
S. Darkner
104
1
0
28 Oct 2022
A comprehensive study on self-supervised distillation for speaker representation learning
Zhengyang Chen
Yao Qian
Bing Han
Y. Qian
Michael Zeng
SSL
129
17
0
28 Oct 2022
FUSSL: Fuzzy Uncertain Self Supervised Learning
S. Mohamadi
Gianfranco Doretto
Donald Adjeroh
68
6
0
28 Oct 2022
State of the Art in Dense Monocular Non-Rigid 3D Reconstruction
Edith Tretschk
Navami Kairanda
R. MallikarjunB.
Rishabh Dabral
Adam Kortylewski
Bernhard Egger
Marc Habermann
Pascal Fua
Christian Theobalt
Vladislav Golyanik
3DH
120
36
0
27 Oct 2022
Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
Ruijie Tao
Kong Aik Lee
Rohan Kumar Das
Ville Hautamaki
Haizhou Li
SSL
90
12
0
27 Oct 2022
Open-vocabulary Semantic Segmentation with Frozen Vision-Language Models
Chaofan Ma
Yu-Hao Yang
Yanfeng Wang
Ya Zhang
Weidi Xie
VLM
74
48
0
27 Oct 2022
CLIP-FLow: Contrastive Learning by semi-supervised Iterative Pseudo labeling for Optical Flow Estimation
Zhiqi Zhang
Nitin Bansal
Changjiang Cai
Pan Ji
Qingan Yan
Xiangyu Xu
Yi Tian Xu
102
5
0
25 Oct 2022
From colouring-in to pointillism: revisiting semantic segmentation supervision
Rodrigo Benenson
V. Ferrari
VLM
74
21
0
25 Oct 2022
Learning Explicit Object-Centric Representations with Vision Transformers
Oscar Vikström
Alexander Ilin
OCL
ViT
79
4
0
25 Oct 2022
On Fine-Tuned Deep Features for Unsupervised Domain Adaptation
Qian Wang
T. Breckon
55
3
0
25 Oct 2022
Previous
1
2
3
...
68
69
70
...
82
83
84
Next