ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.14294
  4. Cited By
Emerging Properties in Self-Supervised Vision Transformers
v1v2 (latest)

Emerging Properties in Self-Supervised Vision Transformers

29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
ArXiv (abs)PDFHTML

Papers citing "Emerging Properties in Self-Supervised Vision Transformers"

50 / 4,176 papers shown
Title
PixelDINO: Semi-Supervised Semantic Segmentation for Detecting
  Permafrost Disturbances
PixelDINO: Semi-Supervised Semantic Segmentation for Detecting Permafrost Disturbances
Konrad Heidler
Ingmar Nitze
Guido Grosse
Xiao Xiang Zhu
58
4
0
17 Jan 2024
P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced
  Clustering
P2^22OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering
Chuyu Zhang
Hui Ren
Xuming He
115
7
0
17 Jan 2024
Continuous Piecewise-Affine Based Motion Model for Image Animation
Continuous Piecewise-Affine Based Motion Model for Image Animation
Hexiang Wang
Fengqi Liu
Qianyu Zhou
Ran Yi
Xin Tan
Lizhuang Ma
VGen
68
10
0
17 Jan 2024
Visual Robotic Manipulation with Depth-Aware Pretraining
Visual Robotic Manipulation with Depth-Aware Pretraining
Wanying Wang
Jinming Li
Yichen Zhu
Zhiyuan Xu
Zhengping Che
Chaomin Shen
Yaxin Peng
Dong Liu
Feifei Feng
Jian Tang
MDE
91
4
0
17 Jan 2024
ICON: Incremental CONfidence for Joint Pose and Radiance Field
  Optimization
ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization
Weiyao Wang
Pierre Gleize
Hao Tang
Xingyu Chen
Kevin J. Liang
Matt Feiszli
61
1
0
17 Jan 2024
B-Cos Aligned Transformers Learn Human-Interpretable Features
B-Cos Aligned Transformers Learn Human-Interpretable Features
Manuel Tran
Amal Lahiani
Yashin Dicente Cid
Melanie Boxberg
Peter Lienemann
C. Matek
S. J. Wagner
Fabian J. Theis
Eldad Klaiman
Tingying Peng
MedImViT
52
2
0
16 Jan 2024
Scalable Pre-training of Large Autoregressive Image Models
Scalable Pre-training of Large Autoregressive Image Models
Alaaeldin El-Nouby
Michal Klein
Shuangfei Zhai
Miguel Angel Bautista
Alexander Toshev
Vaishaal Shankar
J. Susskind
Armand Joulin
VLM
105
80
0
16 Jan 2024
Forging Vision Foundation Models for Autonomous Driving: Challenges,
  Methodologies, and Opportunities
Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities
Xu Yan
Haiming Zhang
Yingjie Cai
Jingming Guo
Weichao Qiu
...
Lihui Jiang
Wei Zhang
Hongbo Zhang
Dengxin Dai
Bingbing Liu
175
20
0
16 Jan 2024
Cross-Level Multi-Instance Distillation for Self-Supervised Fine-Grained Visual Categorization
Cross-Level Multi-Instance Distillation for Self-Supervised Fine-Grained Visual Categorization
Qi Bi
Wei Ji
Jingjun Yi
Haolan Zhan
Gui-Song Xia
123
1
0
16 Jan 2024
The Faiss library
The Faiss library
Matthijs Douze
Alexandr Guzhva
Chengqi Deng
Jeff Johnson
Gergely Szilvasy
Pierre-Emmanuel Mazaré
Maria Lomeli
Lucas Hosseini
Hervé Jégou
210
189
0
16 Jan 2024
Image Similarity using An Ensemble of Context-Sensitive Models
Image Similarity using An Ensemble of Context-Sensitive Models
Zukang Liao
Min Chen
75
1
0
15 Jan 2024
VeCAF: Vision-language Collaborative Active Finetuning with Training
  Objective Awareness
VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness
Rongyu Zhang
Zefan Cai
Huanrui Yang
Zidong Liu
Denis A. Gudovskiy
...
Kurt Keutzer
Baobao Chang
Yuan Du
Li Du
Shanghang Zhang
VLM
107
1
0
15 Jan 2024
Collaboratively Self-supervised Video Representation Learning for Action Recognition
Collaboratively Self-supervised Video Representation Learning for Action Recognition
Jie Zhang
Zhifan Wan
Lanqing Hu
Stephen Lin
Shuzhe Wu
Shiguang Shan
TTA
166
1
0
15 Jan 2024
Transformer for Object Re-Identification: A Survey
Transformer for Object Re-Identification: A Survey
Mang Ye
Shuo Chen
Chenyue Li
Wei-Shi Zheng
David J. Crandall
Bo Du
ViT
162
16
0
13 Jan 2024
AffordanceLLM: Grounding Affordance from Vision Language Models
AffordanceLLM: Grounding Affordance from Vision Language Models
Shengyi Qian
Weifeng Chen
Min Bai
Xiong Zhou
Zhuowen Tu
Li Erran Li
112
24
0
12 Jan 2024
A Study on Self-Supervised Pretraining for Vision Problems in
  Gastrointestinal Endoscopy
A Study on Self-Supervised Pretraining for Vision Problems in Gastrointestinal Endoscopy
Edward Sanderson
B. Matuszewski
79
2
0
11 Jan 2024
End-to-end Learnable Clustering for Intent Learning in Recommendation
End-to-end Learnable Clustering for Intent Learning in Recommendation
Yue Liu
Shihao Zhu
Jun Xia
Yingwei Ma
Jian Ma
Wenliang Zhong
Xinwang Liu
Guannan Zhang
Kejun Zhang
114
11
0
11 Jan 2024
Efficient Vision-and-Language Pre-training with Text-Relevant Image
  Patch Selection
Efficient Vision-and-Language Pre-training with Text-Relevant Image Patch Selection
Wei Ye
Chaoya Jiang
Haiyang Xu
Chenhao Ye
Chenliang Li
Mingshi Yan
Shikun Zhang
Songhang Huang
Fei Huang
VLM
84
0
0
11 Jan 2024
FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D
  Neural Radiance Fields
FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields
GeonU Kim
Youwang Kim
Tae-Hyun Oh
3DH
74
4
0
10 Jan 2024
Transformer-CNN Fused Architecture for Enhanced Skin Lesion Segmentation
Transformer-CNN Fused Architecture for Enhanced Skin Lesion Segmentation
Siddharth Tiwari
MedImViT
52
0
0
10 Jan 2024
Do Vision and Language Encoders Represent the World Similarly?
Do Vision and Language Encoders Represent the World Similarly?
Mayug Maniparambil
Raiymbek Akshulakov
Y. A. D. Djilali
Sanath Narayan
M. Seddik
K. Mangalam
Noel E. O'Connor
VLM
102
14
0
10 Jan 2024
Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial
  Robustness
Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial Robustness
Sibo Wang
Jie Zhang
Zheng Yuan
Shiguang Shan
VLM
109
24
0
09 Jan 2024
RudolfV: A Foundation Model by Pathologists for Pathologists
RudolfV: A Foundation Model by Pathologists for Pathologists
Jonas Dippel
Barbara Feulner
Tobias Winterhoff
Timo Milbich
Stephan Tietz
...
David Horst
Lukas Ruff
Klaus-Robert Muller
Frederick Klauschen
Maximilian Alber
132
32
0
08 Jan 2024
Attention-Guided Erasing: A Novel Augmentation Method for Enhancing
  Downstream Breast Density Classification
Attention-Guided Erasing: A Novel Augmentation Method for Enhancing Downstream Breast Density Classification
A. B. Panambur
Hui Yu
Sheethal Bhat
Prathmesh Madhu
Siming Bayer
Andreas Maier
MedIm
71
1
0
08 Jan 2024
Fully Attentional Networks with Self-emerging Token Labeling
Fully Attentional Networks with Self-emerging Token Labeling
Bingyin Zhao
Zhiding Yu
Shiyi Lan
Yutao Cheng
A. Anandkumar
Yingjie Lao
Jose M. Alvarez
1.0K
6
0
08 Jan 2024
EAT: Self-Supervised Pre-Training with Efficient Audio Transformer
EAT: Self-Supervised Pre-Training with Efficient Audio Transformer
Wenxi Chen
Yuzhe Liang
Ziyang Ma
Zhisheng Zheng
Xie Chen
ViT
107
22
0
07 Jan 2024
The Stronger the Diffusion Model, the Easier the Backdoor: Data
  Poisoning to Induce Copyright Breaches Without Adjusting Finetuning Pipeline
The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright Breaches Without Adjusting Finetuning Pipeline
Haonan Wang
Qianli Shen
Yao Tong
Yang Zhang
Kenji Kawaguchi
125
31
0
07 Jan 2024
Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection
Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection
Yuanpeng Tu
Boshen Zhang
Liang Liu
Yuxi Li
Xuhai Chen
Jiangning Zhang
Yabiao Wang
Chengjie Wang
C. Zhao
138
13
0
06 Jan 2024
Denoising Vision Transformers
Denoising Vision Transformers
Jiawei Yang
Katie Z Luo
Jie Li
Kilian Q. Weinberger
Yonglong Tian
Yue Wang
DiffM
54
15
0
05 Jan 2024
Subjective and Objective Analysis of Indian Social Media Video Quality
Subjective and Objective Analysis of Indian Social Media Video Quality
Sandeep Mishra
Mukul Jha
A. Bovik
123
0
0
05 Jan 2024
GTA: Guided Transfer of Spatial Attention from Object-Centric
  Representations
GTA: Guided Transfer of Spatial Attention from Object-Centric Representations
SeokHyun Seo
Jinwoo Hong
Jungwoo Chae
Kyungyul Kim
Sangheum Hwang
77
0
0
05 Jan 2024
Fus-MAE: A cross-attention-based data fusion approach for Masked Autoencoders in remote sensing
Fus-MAE: A cross-attention-based data fusion approach for Masked Autoencoders in remote sensing
Hugo Chan-To-Hing
B. Veeravalli
105
9
0
05 Jan 2024
Learning the 3D Fauna of the Web
Learning the 3D Fauna of the Web
Zizhang Li
Dor Litvak
Ruining Li
Yunzhi Zhang
Tomas Jakab
Christian Rupprecht
Shangzhe Wu
Andrea Vedaldi
Jiajun Wu
91
25
0
04 Jan 2024
Data-Centric Foundation Models in Computational Healthcare: A Survey
Data-Centric Foundation Models in Computational Healthcare: A Survey
Yunkun Zhang
Jin Gao
Zheling Tan
Lingfeng Zhou
Kexin Ding
Mu Zhou
Shaoting Zhang
Dequan Wang
AI4CE
113
25
0
04 Jan 2024
PILoRA: Prototype Guided Incremental LoRA for Federated
  Class-Incremental Learning
PILoRA: Prototype Guided Incremental LoRA for Federated Class-Incremental Learning
Haiyang Guo
Fei Zhu
Wenzhuo Liu
Xu-Yao Zhang
Cheng-Lin Liu
CLL
104
9
0
04 Jan 2024
Spikformer V2: Join the High Accuracy Club on ImageNet with an SNN
  Ticket
Spikformer V2: Join the High Accuracy Club on ImageNet with an SNN Ticket
Zhaokun Zhou
Kaiwei Che
Wei Fang
Keyu Tian
Yuesheng Zhu
Shuicheng Yan
Yonghong Tian
Liuliang Yuan
ViT
124
33
0
04 Jan 2024
FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D
  Scene Understanding
FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding
Xingxing Zuo
Pouya Samangouei
Yunwen Zhou
Yan Di
Mingyang Li
3DGS
105
51
0
03 Jan 2024
LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry
LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry
Weirong Chen
Le Chen
Rui Wang
Marc Pollefeys
130
24
0
03 Jan 2024
Self-supervised Reflective Learning through Self-distillation and Online Clustering for Speaker Representation Learning
Self-supervised Reflective Learning through Self-distillation and Online Clustering for Speaker Representation Learning
Danwei Cai
Zexin Cai
Ze Li
Ming Li
66
0
0
03 Jan 2024
Image Sculpting: Precise Object Editing with 3D Geometry Control
Image Sculpting: Precise Object Editing with 3D Geometry Control
Jiraphon Yenphraphai
Xichen Pan
Sainan Liu
Daniele Panozzo
Saining Xie
87
22
0
02 Jan 2024
Distilling Local Texture Features for Colorectal Tissue Classification
  in Low Data Regimes
Distilling Local Texture Features for Colorectal Tissue Classification in Low Data Regimes
Dmitry Demidov
Roba Al Majzoub
Amandeep Kumar
Fahad Khan
90
1
0
02 Jan 2024
CityPulse: Fine-Grained Assessment of Urban Change with Street View Time
  Series
CityPulse: Fine-Grained Assessment of Urban Change with Street View Time Series
Tianyuan Huang
Zejia Wu
Jiajun Wu
Jackelyn Hwang
Ram Rajagopal
AI4TS
47
4
0
02 Jan 2024
Refining Pre-Trained Motion Models
Refining Pre-Trained Motion Models
Xinglong Sun
Adam W. Harley
Leonidas Guibas
71
11
0
01 Jan 2024
Diffusion Models, Image Super-Resolution And Everything: A Survey
Diffusion Models, Image Super-Resolution And Everything: A Survey
Brian B. Moser
Arundhati S. Shanbhag
Federico Raue
Stanislav Frolov
Sebastián M. Palacio
Andreas Dengel
108
41
0
01 Jan 2024
GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for
  One-shot Generalizable Neural Radiance Fields
GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields
X. Pan
Zongxin Yang
Shuai Bai
Yi Yang
DiffMOffRL
94
1
0
01 Jan 2024
Analyzing Local Representations of Self-supervised Vision Transformers
Analyzing Local Representations of Self-supervised Vision Transformers
Ani Vanyan
Alvard Barseghyan
Hakob Tamazyan
Vahan Huroyan
Hrant Khachatrian
Martin Danelljan
114
3
0
31 Dec 2023
Morphing Tokens Draw Strong Masked Image Models
Morphing Tokens Draw Strong Masked Image Models
Taekyung Kim
Byeongho Heo
Dongyoon Han
194
3
0
30 Dec 2023
HEAP: Unsupervised Object Discovery and Localization with Contrastive
  Grouping
HEAP: Unsupervised Object Discovery and Localization with Contrastive Grouping
Xin Zhang
Jinheng Xie
Yuan. Yuan
Michael Bi Mi
Robby T. Tan
VOSOCLVLM
151
4
0
29 Dec 2023
A randomized algorithm to solve reduced rank operator regression
A randomized algorithm to solve reduced rank operator regression
G. Turri
Vladimir Kostic
P. Novelli
Massimiliano Pontil
103
4
0
28 Dec 2023
Learning Vision from Models Rivals Learning Vision from Data
Learning Vision from Models Rivals Learning Vision from Data
Yonglong Tian
Lijie Fan
Kaifeng Chen
Dina Katabi
Dilip Krishnan
Phillip Isola
113
52
0
28 Dec 2023
Previous
123...414243...828384
Next