ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.14294
  4. Cited By
Emerging Properties in Self-Supervised Vision Transformers
v1v2 (latest)

Emerging Properties in Self-Supervised Vision Transformers

29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
ArXiv (abs)PDFHTML

Papers citing "Emerging Properties in Self-Supervised Vision Transformers"

50 / 4,175 papers shown
Title
Non-Contrastive Self-supervised Learning for Utterance-Level Information
  Extraction from Speech
Non-Contrastive Self-supervised Learning for Utterance-Level Information Extraction from Speech
Jaejin Cho
Jesús Villalba
Laureano Moro-Velazquez
Najim Dehak
SSL
90
18
0
10 Aug 2022
Non-Contrastive Self-Supervised Learning of Utterance-Level Speech
  Representations
Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations
Jaejin Cho
R. Pappagari
Piotr Żelasko
Laureano Moro-Velazquez
Jesús Villalba
Najim Dehak
SSL
66
13
0
10 Aug 2022
PatchDropout: Economizing Vision Transformers Using Patch Dropout
PatchDropout: Economizing Vision Transformers Using Patch Dropout
Yue Liu
Christos Matsoukas
Fredrik Strand
Hossein Azizpour
Kevin Smith
64
24
0
10 Aug 2022
Self-supervised Multi-modal Training from Uncurated Image and Reports
  Enables Zero-shot Oversight Artificial Intelligence in Radiology
Self-supervised Multi-modal Training from Uncurated Image and Reports Enables Zero-shot Oversight Artificial Intelligence in Radiology
Sangjoon Park
Eunha Lee
Kyung Sook Shin
Jeonghyeon Lee
Jong Chul Ye
53
2
0
10 Aug 2022
How Well Do Vision Transformers (VTs) Transfer To The Non-Natural Image
  Domain? An Empirical Study Involving Art Classification
How Well Do Vision Transformers (VTs) Transfer To The Non-Natural Image Domain? An Empirical Study Involving Art Classification
Vincent Tonkes
M. Sabatelli
ViT
63
6
0
09 Aug 2022
Understanding Masked Image Modeling via Learning Occlusion Invariant
  Feature
Understanding Masked Image Modeling via Learning Occlusion Invariant Feature
Xiangwen Kong
Xiangyu Zhang
SSL
78
54
0
08 Aug 2022
Global Hierarchical Attention for 3D Point Cloud Analysis
Global Hierarchical Attention for 3D Point Cloud Analysis
Dan Jia
Alexander Hermans
Bastian Leibe
3DPC
49
0
0
07 Aug 2022
Hierarchical Semantic Regularization of Latent Spaces in StyleGANs
Hierarchical Semantic Regularization of Latent Spaces in StyleGANs
Tejan Karmali
Rishubh Parihar
Susmit Agrawal
Harsh Rangwani
Varun Jampani
M. Singh
R. Venkatesh Babu
75
11
0
07 Aug 2022
P2P: Tuning Pre-trained Image Models for Point Cloud Analysis with
  Point-to-Pixel Prompting
P2P: Tuning Pre-trained Image Models for Point Cloud Analysis with Point-to-Pixel Prompting
Ziyi Wang
Xumin Yu
Yongming Rao
Jie Zhou
Jiwen Lu
VPVLMVLM
95
77
0
04 Aug 2022
Analyzing Data-Centric Properties for Graph Contrastive Learning
Analyzing Data-Centric Properties for Graph Contrastive Learning
Puja Trivedi
Ekdeep Singh Lubana
Mark Heimann
Danai Koutra
Jayaraman J. Thiagarajan
103
11
0
04 Aug 2022
OpenCon: Open-world Contrastive Learning
OpenCon: Open-world Contrastive Learning
Yiyou Sun
Yixuan Li
VLMSSLDRL
147
43
0
04 Aug 2022
MVSFormer: Multi-View Stereo by Learning Robust Image Features and
  Temperature-based Depth
MVSFormer: Multi-View Stereo by Learning Robust Image Features and Temperature-based Depth
Chenjie Cao
Xinlin Ren
Yanwei Fu
108
54
0
04 Aug 2022
RAZE: Region Guided Self-Supervised Gaze Representation Learning
RAZE: Region Guided Self-Supervised Gaze Representation Learning
Neeru Dubey
Shreya Ghosh
Abhinav Dhall
72
2
0
04 Aug 2022
Self-Supervised Speaker Verification Using Dynamic Loss-Gate and Label
  Correction
Self-Supervised Speaker Verification Using Dynamic Loss-Gate and Label Correction
Bing Han
Zhengyang Chen
Y. Qian
61
32
0
03 Aug 2022
Automatically Discovering Novel Visual Categories with Self-supervised
  Prototype Learning
Automatically Discovering Novel Visual Categories with Self-supervised Prototype Learning
Lu Zhang
Lu Qi
Xu Yang
Hong Qiao
Ming-Hsuan Yang
Zhiyong Liu
SSL
60
3
0
01 Aug 2022
COCOA: Cross Modality Contrastive Learning for Sensor Data
COCOA: Cross Modality Contrastive Learning for Sensor Data
Shohreh Deldari
Hao Xue
Aaqib Saeed
Daniel V. Smith
Flora D. Salim
SSL
94
41
0
31 Jul 2022
SdAE: Self-distillated Masked Autoencoder
SdAE: Self-distillated Masked Autoencoder
Yabo Chen
Yuchen Liu
Dongsheng Jiang
Xiaopeng Zhang
Wenrui Dai
H. Xiong
Qi Tian
ViT
99
73
0
31 Jul 2022
Revisiting the Critical Factors of Augmentation-Invariant Representation
  Learning
Revisiting the Critical Factors of Augmentation-Invariant Representation Learning
Junqiang Huang
Xiangwen Kong
Xiangyu Zhang
45
6
0
30 Jul 2022
A Survey on Masked Autoencoder for Self-supervised Learning in Vision
  and Beyond
A Survey on Masked Autoencoder for Self-supervised Learning in Vision and Beyond
Chaoning Zhang
Chenshuang Zhang
Junha Song
John Seon Keun Yi
Kang Zhang
In So Kweon
SSL
96
77
0
30 Jul 2022
SimCURL: Simple Contrastive User Representation Learning from Command
  Sequences
SimCURL: Simple Contrastive User Representation Learning from Command Sequences
Hang Chu
Amir Hosein Khasahmadi
Karl D. D. Willis
Fraser Anderson
Yaoli Mao
Linh-Tam Tran
Justin Matejka
Jo Vermeulen
SSL
63
2
0
29 Jul 2022
ALADIN: Distilling Fine-grained Alignment Scores for Efficient
  Image-Text Matching and Retrieval
ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval
Nicola Messina
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
Fabrizio Falchi
Giuseppe Amato
Rita Cucchiara
VLM
40
22
0
29 Jul 2022
Global-Local Self-Distillation for Visual Representation Learning
Global-Local Self-Distillation for Visual Representation Learning
Tim Lebailly
Tinne Tuytelaars
SSL
53
6
0
29 Jul 2022
Self-supervised learning with rotation-invariant kernels
Self-supervised learning with rotation-invariant kernels
Léon Zheng
Gilles Puy
E. Riccietti
Patrick Pérez
Rémi Gribonval
SSL
58
2
0
28 Jul 2022
On the robustness of self-supervised representations for multi-view
  object classification
On the robustness of self-supervised representations for multi-view object classification
David Torpey
Richard Klein
SSL
26
1
0
27 Jul 2022
Contrastive Masked Autoencoders are Stronger Vision Learners
Contrastive Masked Autoencoders are Stronger Vision Learners
Zhicheng Huang
Xiaojie Jin
Cheng Lu
Qibin Hou
Mingg-Ming Cheng
Dongmei Fu
Xiaohui Shen
Jiashi Feng
154
154
0
27 Jul 2022
Deep Clustering with Features from Self-Supervised Pretraining
Deep Clustering with Features from Self-Supervised Pretraining
Xingzhi Zhou
N. Zhang
ViT3DPCSSL
45
11
0
27 Jul 2022
Exploring the Design of Adaptation Protocols for Improved Generalization
  and Machine Learning Safety
Exploring the Design of Adaptation Protocols for Improved Generalization and Machine Learning Safety
Puja Trivedi
Danai Koutra
Jayaraman J. Thiagarajan
AAML
57
0
0
26 Jul 2022
Active Learning Strategies for Weakly-supervised Object Detection
Active Learning Strategies for Weakly-supervised Object Detection
Huy V. Vo
Oriane Siméoni
Spyros Gidaris
Andrei Bursuc
Patrick Pérez
Jean Ponce
113
19
0
25 Jul 2022
Jigsaw-ViT: Learning Jigsaw Puzzles in Vision Transformer
Jigsaw-ViT: Learning Jigsaw Puzzles in Vision Transformer
Yingyi Chen
Xiaoke Shen
Yahui Liu
Qinghua Tao
Johan A. K. Suykens
AAMLViT
85
24
0
25 Jul 2022
High-Resolution Swin Transformer for Automatic Medical Image
  Segmentation
High-Resolution Swin Transformer for Automatic Medical Image Segmentation
Chen Wei
Shenghan Ren
Kaitai Guo
Haihong Hu
Jimin Liang
ViTOODMedIm
57
43
0
23 Jul 2022
Discrete Key-Value Bottleneck
Discrete Key-Value Bottleneck
Frederik Trauble
Anirudh Goyal
Nasim Rahaman
Michael C. Mozer
Kenji Kawaguchi
Yoshua Bengio
Bernhard Schölkopf
CLL
92
23
0
22 Jul 2022
Few-Shot Class-Incremental Learning via Entropy-Regularized Data-Free
  Replay
Few-Shot Class-Incremental Learning via Entropy-Regularized Data-Free Replay
Huan Liu
Li Gu
Zhixiang Chi
Yang Wang
Yuanhao Yu
Jun Chen
Jingshan Tang
106
88
0
22 Jul 2022
Contrastive Self-Supervised Learning Leads to Higher Adversarial
  Susceptibility
Contrastive Self-Supervised Learning Leads to Higher Adversarial Susceptibility
Rohit Gupta
Naveed Akhtar
Ajmal Mian
M. Shah
AAMLSSL
60
5
0
22 Jul 2022
Towards Efficient Adversarial Training on Vision Transformers
Towards Efficient Adversarial Training on Vision Transformers
Boxi Wu
Jindong Gu
Zhifeng Li
Deng Cai
Xiaofei He
Wei Liu
ViTAAML
94
40
0
21 Jul 2022
On Label Granularity and Object Localization
On Label Granularity and Object Localization
Elijah Cole
Kimberly Wilber
Grant Van Horn
Xuan S. Yang
Marco Fornoni
Pietro Perona
Serge Belongie
Andrew G. Howard
Oisin Mac Aodha
WSOL
84
13
0
20 Jul 2022
Learning from Synthetic Data: Facial Expression Classification based on
  Ensemble of Multi-task Networks
Learning from Synthetic Data: Facial Expression Classification based on Ensemble of Multi-task Networks
Jae-Yeop Jeong
Yeong-Gi Hong
Jiyeon Oh
Sumin Hong
Jin-Woo Jeong
Yuchul Jung
CVBM
70
8
0
20 Jul 2022
What Do We Maximize in Self-Supervised Learning?
What Do We Maximize in Self-Supervised Learning?
Ravid Shwartz-Ziv
Randall Balestriero
Yann LeCun
SSL
76
17
0
20 Jul 2022
Adversarial Pixel Restoration as a Pretext Task for Transferable
  Perturbations
Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations
H. Malik
Shahina Kunhimon
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
AAML
57
8
0
18 Jul 2022
Class-incremental Novel Class Discovery
Class-incremental Novel Class Discovery
Subhankar Roy
Mingxuan Liu
Zhun Zhong
N. Sebe
Elisa Ricci
CLL
76
45
0
18 Jul 2022
Fast-MoCo: Boost Momentum-based Contrastive Learning with Combinatorial
  Patches
Fast-MoCo: Boost Momentum-based Contrastive Learning with Combinatorial Patches
Yuanzheng Ci
Chen Lin
Lei Bai
Wanli Ouyang
SSL
74
26
0
17 Jul 2022
Is a Caption Worth a Thousand Images? A Controlled Study for
  Representation Learning
Is a Caption Worth a Thousand Images? A Controlled Study for Representation Learning
Shibani Santurkar
Yann Dubois
Rohan Taori
Percy Liang
Tatsunori Hashimoto
CLIPVLM
81
41
0
15 Jul 2022
Position Prediction as an Effective Pretraining Strategy
Position Prediction as an Effective Pretraining Strategy
Shuangfei Zhai
Navdeep Jaitly
Jason Ramapuram
Dan Busbridge
Tatiana Likhomanenko
Joseph Y. Cheng
Walter A. Talbott
Chen Huang
Hanlin Goh
J. Susskind
ViT
88
25
0
15 Jul 2022
Bootstrapped Masked Autoencoders for Vision BERT Pretraining
Bootstrapped Masked Autoencoders for Vision BERT Pretraining
Xiaoyi Dong
Jianmin Bao
Ting Zhang
Dongdong Chen
Weiming Zhang
Lu Yuan
Dong Chen
Fang Wen
Nenghai Yu
89
78
0
14 Jul 2022
Benchmarking Omni-Vision Representation through the Lens of Visual
  Realms
Benchmarking Omni-Vision Representation through the Lens of Visual Realms
Yuanhan Zhang
Zhen-fei Yin
Jing Shao
Ziwei Liu
VLM
112
29
0
14 Jul 2022
Unsupervised Visual Representation Learning by Synchronous Momentum
  Grouping
Unsupervised Visual Representation Learning by Synchronous Momentum Grouping
Bo Pang
Yifan Zhang
Yaoyi Li
Jia Cai
Cewu Lu
SSL
70
28
0
13 Jul 2022
Wayformer: Motion Forecasting via Simple & Efficient Attention Networks
Wayformer: Motion Forecasting via Simple & Efficient Attention Networks
Nigamaa Nayakanti
Rami Al-Rfou
Aurick Zhou
Kratarth Goel
Khaled S. Refaat
Benjamin Sapp
AI4TS
135
259
0
12 Jul 2022
Modality-Aware Contrastive Instance Learning with Self-Distillation for
  Weakly-Supervised Audio-Visual Violence Detection
Modality-Aware Contrastive Instance Learning with Self-Distillation for Weakly-Supervised Audio-Visual Violence Detection
Jiashuo Yu
Jin-Yuan Liu
Ying Cheng
Rui Feng
Yuejie Zhang
99
37
0
12 Jul 2022
eX-ViT: A Novel eXplainable Vision Transformer for Weakly Supervised
  Semantic Segmentation
eX-ViT: A Novel eXplainable Vision Transformer for Weakly Supervised Semantic Segmentation
Lu Yu
Wei Xiang
Juan Fang
Yi-Ping Phoebe Chen
Lianhua Chi
ViT
77
26
0
12 Jul 2022
IDEA: Increasing Text Diversity via Online Multi-Label Recognition for
  Vision-Language Pre-training
IDEA: Increasing Text Diversity via Online Multi-Label Recognition for Vision-Language Pre-training
Xinyu Huang
Youcai Zhang
Ying Cheng
Weiwei Tian
Ruiwei Zhao
Rui Feng
Yuejie Zhang
Yaqian Li
Yandong Guo
Xiao-Yong Zhang
VLM
74
14
0
12 Jul 2022
Demystifying Unsupervised Semantic Correspondence Estimation
Demystifying Unsupervised Semantic Correspondence Estimation
Mehmet Aygun
Oisin Mac Aodha
66
11
0
11 Jul 2022
Previous
123...727374...828384
Next