Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14294
Cited By
v1
v2 (latest)
Emerging Properties in Self-Supervised Vision Transformers
29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Emerging Properties in Self-Supervised Vision Transformers"
50 / 4,175 papers shown
Title
Lessons learned from the NeurIPS 2021 MetaDL challenge: Backbone fine-tuning without episodic meta-learning dominates for few-shot learning image classification
Adrian El Baz
Ihsan Ullah
Edesio Alcobaça
André C. P. L. F. de Carvalho
Hong Chen
...
Ekrem Öztürk
J. V. Rijn
Haozhe Sun
Xin Wang
Wenwu Zhu
79
12
0
15 Jun 2022
Rethinking Generalization in Few-Shot Classification
Markus Hiller
Rongkai Ma
Mehrtash Harandi
Tom Drummond
OCL
VLM
112
57
0
15 Jun 2022
It's Time for Artistic Correspondence in Music and Video
Dídac Surís
Carl Vondrick
Bryan C. Russell
Justin Salamon
64
37
0
14 Jun 2022
ReCo: Retrieve and Co-segment for Zero-shot Transfer
Gyungin Shin
Weidi Xie
Samuel Albanie
VLM
129
92
0
14 Jun 2022
Peripheral Vision Transformer
Juhong Min
Yucheng Zhao
Chong Luo
Minsu Cho
ViT
MDE
79
33
0
14 Jun 2022
Exploring Adversarial Attacks and Defenses in Vision Transformers trained with DINO
Javier Rando
Nasib Naimi
Thomas Baumann
Max Mathys
AAML
53
6
0
14 Jun 2022
Transformers are Meta-Reinforcement Learners
Luckeciano C. Melo
OffRL
81
50
0
14 Jun 2022
Learning Task-Independent Game State Representations from Unlabeled Images
C. Trivedi
Konstantinos Makantasis
Antonios Liapis
Georgios N. Yannakakis
SSL
79
6
0
13 Jun 2022
Multimodal Learning with Transformers: A Survey
Peng Xu
Xiatian Zhu
David Clifton
ViT
233
575
0
13 Jun 2022
Discovering Object Masks with Transformers for Unsupervised Semantic Segmentation
Wouter Van Gansbeke
Simon Vandenhende
Luc Van Gool
109
55
0
13 Jun 2022
Narrowing the Gap: Improved Detector Training with Noisy Location Annotations
Shaoru Wang
Jin Gao
Bing Li
Weiming Hu
ObjD
NoLa
73
9
0
12 Jun 2022
Deep Learning Models for Automated Classification of Dog Emotional States from Facial Expressions
Tali Boneh-Shitrit
Shirzad Amir
A. Bremhorst
D. Mills
S. Riemer
Dror Fried
Anna Zamansky
51
9
0
11 Jun 2022
Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?
Xiang Li
Jinghuan Shang
Srijan Das
Michael S. Ryoo
SSL
104
33
0
10 Jun 2022
Is Self-Supervised Learning More Robust Than Supervised Learning?
Yuanyi Zhong
Haoran Tang
Jun-Kun Chen
Jian-wei Peng
Yu-Xiong Wang
SSL
OOD
77
25
0
10 Jun 2022
Lost in Transmission: On the Impact of Networking Corruptions on Video Machine Learning Models
Trenton Chang
Daniel Y. Fu
30
0
0
10 Jun 2022
SERE: Exploring Feature Self-relation for Self-supervised Transformer
Zhong-Yu Li
Shanghua Gao
Ming-Ming Cheng
ViT
MDE
101
14
0
10 Jun 2022
Federated Momentum Contrastive Clustering
Runxuan Miao
Erdem Koyuncu
FedML
63
14
0
10 Jun 2022
Positional Label for Self-Supervised Vision Transformer
Zhemin Zhang
Xun Gong
ViT
MDE
59
6
0
10 Jun 2022
Masked Autoencoders are Robust Data Augmentors
Haohang Xu
Shuangrui Ding
Xiaopeng Zhang
H. Xiong
139
28
0
10 Jun 2022
Extreme Masking for Learning Instance and Distributed Visual Representations
Zhirong Wu
Zihang Lai
Xiao Sun
Stephen Lin
106
22
0
09 Jun 2022
Spatial Entropy as an Inductive Bias for Vision Transformers
E. Peruzzo
E. Sangineto
Yahui Liu
Marco De Nadai
Wei Bi
Bruno Lepri
N. Sebe
ViT
MDE
121
2
0
09 Jun 2022
CASS: Cross Architectural Self-Supervision for Medical Image Analysis
Pranav Singh
E. Sizikova
Jacopo Cirrone
OOD
173
8
0
08 Jun 2022
Can CNNs Be More Robust Than Transformers?
Zeyu Wang
Yutong Bai
Yuyin Zhou
Cihang Xie
UQCV
OOD
115
46
0
07 Jun 2022
DiMS: Distilling Multiple Steps of Iterative Non-Autoregressive Transformers for Machine Translation
Sajad Norouzi
Rasa Hosseinzadeh
Felipe Pérez
M. Volkovs
44
2
0
07 Jun 2022
Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning
Richard J. Chen
Chengkuan Chen
Yicong Li
Tiffany Y. Chen
A. Trister
Rahul G. Krishnan
Faisal Mahmood
ViT
MedIm
125
432
0
06 Jun 2022
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
Zhiwei Xu
Bin Zhang
Dapeng Li
Zeren Zhang
Guangchong Zhou
Hao Chen
Guoliang Fan
77
15
0
06 Jun 2022
Beyond Just Vision: A Review on Self-Supervised Representation Learning on Multimodal and Temporal Data
Shohreh Deldari
Hao Xue
Aaqib Saeed
Jiayuan He
Daniel V. Smith
Flora D. Salim
AI4TS
75
37
0
06 Jun 2022
Integrating Prior Knowledge in Contrastive Learning with Kernel
Benoit Dufumier
C. Barbano
Robin Louiset
Edouard Duchesnay
Pietro Gori
SSL
71
8
0
03 Jun 2022
Pruning for Feature-Preserving Circuits in CNNs
Christopher Hamblin
Talia Konkle
G. Alvarez
76
2
0
03 Jun 2022
On the duality between contrastive and non-contrastive self-supervised learning
Q. Garrido
Yubei Chen
Adrien Bardes
Laurent Najman
Yann LeCun
SSL
92
94
0
03 Jun 2022
Learning an Adaptation Function to Assess Image Visual Similarities
Olivier Risser-Maroix
Amine Marzouki
Hala Djeghim
Camille Kurtz
Nicolas Loménie
41
3
0
03 Jun 2022
Using Representation Expressiveness and Learnability to Evaluate Self-Supervised Learning Methods
Yuchen Lu
Zhen Liu
A. Baratin
Romain Laroche
Rameswar Panda
Alessandro Sordoni
SSL
76
0
0
02 Jun 2022
Siamese Image Modeling for Self-Supervised Vision Representation Learning
Chenxin Tao
Xizhou Zhu
Weijie Su
Gao Huang
Bin Li
Jie Zhou
Yu Qiao
Xiaogang Wang
Jifeng Dai
SSL
109
96
0
02 Jun 2022
Hard Negative Sampling Strategies for Contrastive Representation Learning
Afrina Tabassum
Muntasir Wahed
Hoda Eldardiry
Ismini Lourentzou
SSL
109
25
0
02 Jun 2022
EfficientFormer: Vision Transformers at MobileNet Speed
Yanyu Li
Geng Yuan
Yang Wen
Eric Hu
Georgios Evangelidis
Sergey Tulyakov
Yanzhi Wang
Jian Ren
ViT
145
372
0
02 Jun 2022
Optimizing Relevance Maps of Vision Transformers Improves Robustness
Hila Chefer
Idan Schwartz
Lior Wolf
ViT
107
38
0
02 Jun 2022
Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives
Jun Li
Junyu Chen
Yucheng Tang
Ce Wang
Bennett A. Landman
S. K. Zhou
ViT
OOD
MedIm
175
46
0
02 Jun 2022
A Survey on Video Action Recognition in Sports: Datasets, Methods and Applications
Fei Wu
Qingzhong Wang
Jian Bian
Haoyi Xiong
Ning Ding
Feixiang Lu
Junqing Cheng
Dejing Dou
AI4TS
84
57
0
02 Jun 2022
Positive Unlabeled Contrastive Learning
Anish Acharya
Sujay Sanghavi
Li Jing
Bhargav Bhushanam
Dhruv Choudhary
Michael G. Rabbat
Inderjit Dhillon
SSL
65
11
0
01 Jun 2022
A comparative study between vision transformers and CNNs in digital pathology
Luca Deininger
Bernhard Stimpel
Anil Yüce
Samaneh Abbasi-Sureshjani
Simon Schönenberger
P. Ocampo
Konstanty Korski
F. Gaire
ViT
MedIm
47
30
0
01 Jun 2022
Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction
Jun Chen
Ming Hu
Boyang Albert Li
Mohamed Elhoseiny
142
37
0
01 Jun 2022
CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping
Junlin Han
L. Petersson
Hongdong Li
Ian Reid
86
9
0
31 May 2022
Surface Analysis with Vision Transformers
Simon Dahan
Logan Z. J. Williams
Abdulah Fawaz
Daniel Rueckert
E. C. Robinson
ViT
MedIm
74
2
0
31 May 2022
Decomposing NeRF for Editing via Feature Field Distillation
Sosuke Kobayashi
Eiichi Matsumoto
Vincent Sitzmann
266
343
0
31 May 2022
Few-Shot Diffusion Models
Giorgio Giannone
Didrik Nielsen
Ole Winther
DiffM
231
51
0
30 May 2022
Exploring Advances in Transformers and CNN for Skin Lesion Diagnosis on Small Datasets
Leandro M. de Lima
R. Krohling
ViT
MedIm
70
11
0
30 May 2022
Self-Supervised Visual Representation Learning with Semantic Grouping
Xin Wen
Bingchen Zhao
Anlin Zheng
Xinming Zhang
Xiaojuan Qi
SSL
219
74
0
30 May 2022
Conformal Credal Self-Supervised Learning
Julian Lienen
Caglar Demir
Eyke Hüllermeier
92
14
0
30 May 2022
Self-Supervised Pre-training of Vision Transformers for Dense Prediction Tasks
Jaonary Rabarisoa
Velentin Belissen
Florian Chabot
Q. C. Pham
VLM
ViT
SSL
MDE
45
3
0
30 May 2022
GMML is All you Need
Sara Atito
Muhammad Awais
J. Kittler
ViT
VLM
87
18
0
30 May 2022
Previous
1
2
3
...
74
75
76
...
82
83
84
Next