Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14294
Cited By
v1
v2 (latest)
Emerging Properties in Self-Supervised Vision Transformers
29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Emerging Properties in Self-Supervised Vision Transformers"
50 / 4,175 papers shown
Title
Improving 2D Feature Representations by 3D-Aware Fine-Tuning
Yuanwen Yue
Anurag Das
Francis Engelmann
Siyu Tang
J. E. Lenssen
110
28
0
29 Jul 2024
SANGRIA: Surgical Video Scene Graph Optimization for Surgical Workflow Prediction
cCaughan Koksal
Ghazal Ghazaei
Felix Holm
Azade Farshad
Nassir Navab
MedIm
87
3
0
29 Jul 2024
Theia: Distilling Diverse Vision Foundation Models for Robot Learning
Jinghuan Shang
Karl Schmeckpeper
Brandon B. May
M. Minniti
Tarik Kelestemur
David Watkins
Laura Herlant
VLM
101
24
0
29 Jul 2024
Diffusion Feedback Helps CLIP See Better
Wenxuan Wang
Quan-Sen Sun
Fan Zhang
Yepeng Tang
Jing Liu
Xinlong Wang
VLM
138
17
0
29 Jul 2024
FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention
Yu Lu
Yuanzhi Liang
Linchao Zhu
Yi Yang
DiffM
VGen
116
32
0
29 Jul 2024
Contextuality Helps Representation Learning for Generalized Category Discovery
Tingzhang Luo
Mingxuan Du
Jiatao Shi
Xinxiang Chen
Bingchen Zhao
Shaoguang Huang
72
4
0
29 Jul 2024
Self-Supervised Learning for Text Recognition: A Critical Survey
Carlos Peñarrubia
J. J. Valero-Mas
Jorge Calvo-Zaragoza
177
2
0
29 Jul 2024
Large-scale cervical precancerous screening via AI-assisted cytology whole slide image analysis
Honglin Li
Yusuan Sun
Chenglu Zhu
Yunlong Zhang
Shichuan Zhang
...
Pingyi Chen
Jingxiong Li
Sunyi Zheng
Can Cui
Lin Yang
85
3
0
28 Jul 2024
HRP: Human Affordances for Robotic Pre-Training
Mohan Kumar Srirama
Sudeep Dasari
Shikhar Bahl
Abhinav Gupta
101
19
0
26 Jul 2024
PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery
Fernando Julio Cendra
Bingchen Zhao
Kai Han
VLM
CLL
102
6
0
26 Jul 2024
SHIC: Shape-Image Correspondences with no Keypoint Supervision
Aleksandar Shtedritski
Christian Rupprecht
Andrea Vedaldi
3DPC
3DH
3DV
70
3
0
26 Jul 2024
Deep Companion Learning: Enhancing Generalization Through Historical Consistency
Ruizhao Zhu
Venkatesh Saligrama
FedML
87
0
0
26 Jul 2024
Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers
Longkun Zou
Wanru Zhu
Ke Chen
Lihua Guo
K. Guo
Kui Jia
Yaowei Wang
3DPC
ViT
84
0
0
26 Jul 2024
Trajectory-aligned Space-time Tokens for Few-shot Action Recognition
Pulkit Kumar
Namitha Padmanabhan
Luke Luo
Sai Saketh Rambhatla
Abhinav Shrivastava
93
4
0
25 Jul 2024
MARINE: A Computer Vision Model for Detecting Rare Predator-Prey Interactions in Animal Videos
Zsófia Katona
Seyed Sahand Mohamadi Ziabari
Fatemeh Karimi Nejadasl
54
1
0
25 Jul 2024
DINOv2 Rocks Geological Image Analysis: Classification, Segmentation, and Interpretability
Florent Brondolo
Samuel Beaussant
AI4CE
86
1
0
25 Jul 2024
Leveraging Foundation Models via Knowledge Distillation in Multi-Object Tracking: Distilling DINOv2 Features to FairMOT
Niels G. Faber
Seyed Sahand Mohamadi Ziabari
Fatemeh Karimi Nejadasl
95
3
0
25 Jul 2024
Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer Vision
Tim J. M. Jaspers
Ronald L.P.D. de Jong
Yasmina Alkhalil
Tijn Zeelenberg
C. H. Kusters
...
Franciscus Hendericus Aäron Bakker
J P Ruurda
Willem M. Brinkman
Peter H. N. de With
Fons van der Sommen
83
3
0
25 Jul 2024
Balancing Complementarity and Consistency via Delayed Activation in Incomplete Multi-view Clustering
Bo Li
98
1
0
25 Jul 2024
Revisiting Machine Unlearning with Dimensional Alignment
Seonguk Seo
Dongwan Kim
Bohyung Han
MU
58
1
0
25 Jul 2024
Unsqueeze [CLS] Bottleneck to Learn Rich Representations
Qing Su
Shihao Ji
90
0
0
24 Jul 2024
PEEKABOO: Hiding parts of an image for unsupervised object localization
Hasib Zunair u
24 A.BenHamza
SSL
130
0
0
24 Jul 2024
Graph Neural Networks: A suitable Alternative to MLPs in Latent 3D Medical Image Classification?
Johannes Kiechle
Daniel M. Lang
Stefan M. Fischer
Lina Felsner
J. Peeken
Julia A. Schnabel
MedIm
70
0
0
24 Jul 2024
Contrastive Learning Is Not Optimal for Quasiperiodic Time Series
A. Atienza
J. Bardram
S. Puthusserypady
BDL
AI4TS
85
2
0
24 Jul 2024
Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective
Jingren Liu
Zhong Ji
YunLong Yu
Jiale Cao
Yanwei Pang
Jungong Han
Xuelong Li
CLL
142
5
0
24 Jul 2024
SINDER: Repairing the Singular Defects of DINOv2
Haoqian Wang
Tong Zhang
Mathieu Salzmann
57
4
0
23 Jul 2024
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
Junyi Li
Junfeng Wu
Weizhi Zhao
Song Bai
Xiang Bai
75
3
0
23 Jul 2024
A Multi-view Mask Contrastive Learning Graph Convolutional Neural Network for Age Estimation
Yiping Zhang
Yuntao Shou
Tao Meng
Wei Ai
Keqin Li
CVBM
110
10
0
23 Jul 2024
INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model
Yiwei Ma
Zhibin Wang
Xiaoshuai Sun
Weihuang Lin
Qiang-feng Zhou
Jiayi Ji
Rongrong Ji
MLLM
VLM
105
2
0
23 Jul 2024
Reconstructing Training Data From Real World Models Trained with Transfer Learning
Yakir Oz
Gilad Yehudai
Gal Vardi
Itai Antebi
Michal Irani
Niv Haim
67
3
0
22 Jul 2024
CarFormer: Self-Driving with Learned Object-Centric Representations
Shadi S. Hamdan
Fatma Guney
3DPC
OCL
98
3
0
22 Jul 2024
Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning
Yibing Wei
Abhinav Gupta
Pedro Morgado
SSL
75
8
0
22 Jul 2024
MILAN: Milli-Annotations for Lidar Semantic Segmentation
Nermin Samet
Gilles Puy
Oriane Siméoni
Renaud Marlet
3DPC
82
0
0
22 Jul 2024
Disentangling spatio-temporal knowledge for weakly supervised object detection and segmentation in surgical video
Guiqiu Liao
M. Jogan
Sai Koushik
Eric Eaton
Daniel A. Hashimoto
VOS
99
2
0
22 Jul 2024
Predicting the Best of N Visual Trackers
B. Alawode
S. Javed
Arif Mahmood
Jirí Matas
107
1
0
22 Jul 2024
Not All Pairs are Equal: Hierarchical Learning for Average-Precision-Oriented Video Retrieval
Yang Liu
Qianqian Xu
Peisong Wen
Siran Dai
Qingming Huang
100
1
0
22 Jul 2024
SIGMA:Sinkhorn-Guided Masked Video Modeling
Mohammadreza Salehi
Michael Dorkenwald
Fida Mohammad Thoker
E. Gavves
Cees G. M. Snoek
Yuki M. Asano
96
7
0
22 Jul 2024
Towards Robust Vision Transformer via Masked Adaptive Ensemble
Fudong Lin
Jiadong Lou
Xu Yuan
Nianfeng Tzeng
ViT
AAML
88
2
0
22 Jul 2024
CP-Prompt: Composition-Based Cross-modal Prompting for Domain-Incremental Continual Learning
Yu Feng
Zhen Tian
Zhonghong Ou
Zongfu Han
Haoran Luo
Guangwei Zhang
Meina Song
CLL
VLM
63
8
0
22 Jul 2024
Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models
Amir Mohammad Karimi Mamaghan
Samuele Papa
Karl Henrik Johansson
Stefan Bauer
Andrea Dittadi
OCL
168
9
0
22 Jul 2024
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning
Shuai Wang
Zheng-Shou Chen
Kong Aik Lee
Yan-min Qian
Haizhou Li
112
6
0
21 Jul 2024
Assessing Sample Quality via the Latent Space of Generative Models
Jingyi Xu
Hieu M. Le
Dimitris Samaras
MedIm
101
3
0
21 Jul 2024
Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-Language Models
Md Zarif Hossain
Ahmed Imteaj
VLM
AAML
66
6
0
20 Jul 2024
Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning
Chen Shen
Chunfeng Lian
Wanqing Zhang
Fan Wang
Jianhua Zhang
...
Hongshu Mu
Hao Wu
Xinggong Liang
Jianhua Ma
Zhenyuan Wang
109
1
0
20 Jul 2024
∞
\infty
∞
-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions
Minh-Quan Le
Alexandros Graikos
Srikar Yellapragada
Rajarsi R. Gupta
Joel H. Saltz
Dimitris Samaras
96
10
0
20 Jul 2024
On Learning Discriminative Features from Synthesized Data for Self-Supervised Fine-Grained Visual Recognition
Zihu Wang
Lingqiao Liu
Scott Ricardo Figueroa Weston
Samuel Tian
Peng Li
76
2
0
19 Jul 2024
Bucketed Ranking-based Losses for Efficient Training of Object Detectors
Feyza Yavuz
Baris Can Cam
Adnan Harun Dogan
Kemal Oksuz
Emre Akbas
Sinan Kalkan
62
2
0
19 Jul 2024
LogoSticker: Inserting Logos into Diffusion Models for Customized Generation
Mingkang Zhu
Xi Chen
Zhongdao Wang
Hengshuang Zhao
Jiaya Jia
DiffM
94
3
0
18 Jul 2024
Temporal Representation Learning for Stock Similarities and Its Applications in Investment Management
Yoon-Jeong Hwang
Stefan Zohren
Yongjae Lee
AIFin
73
1
0
18 Jul 2024
FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder Architectures
Hao Lu
Wenze Liu
Hongtao Fu
Zhiguo Cao
59
3
0
18 Jul 2024
Previous
1
2
3
...
24
25
26
...
82
83
84
Next