Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14294
Cited By
Emerging Properties in Self-Supervised Vision Transformers
29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Emerging Properties in Self-Supervised Vision Transformers"
50 / 1,320 papers shown
Title
Neural Slot Interpreters: Grounding Object Semantics in Emergent Slot Representations
Bhishma Dedhia
N. Jha
OCL
54
1
0
02 Feb 2024
What Do Self-Supervised Speech and Speaker Models Learn? New Findings From a Cross Model Layer-Wise Analysis
Takanori Ashihara
Marc Delcroix
Takafumi Moriya
Kohei Matsuura
Taichi Asami
Yusuke Ijima
SSL
24
7
0
31 Jan 2024
MLEM: Generative and Contrastive Learning as Distinct Modalities for Event Sequences
Viktor Moskvoretskii
Dmitry Osin
Egor Shvetsov
Igor Udovichenko
Maxim Zhelnin
Andrey Dukhovny
Anna Zhimerikina
E. Burnaev
AI4TS
34
2
0
29 Jan 2024
Rethinking Patch Dependence for Masked Autoencoders
Letian Fu
Long Lian
Renhao Wang
Baifeng Shi
Xudong Wang
Adam Yala
Trevor Darrell
Alexei A. Efros
Ken Goldberg
34
14
0
25 Jan 2024
Memory Consistency Guided Divide-and-Conquer Learning for Generalized Category Discovery
Yuanpeng Tu
Zhun Zhong
Yuxi Li
Hengshuang Zhao
38
0
0
24 Jan 2024
Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration
Yifan Zhang
Siyu Ren
Junhui Hou
Jinjian Wu
Guangming Shi
Guangming Shi
SSL
3DPC
90
3
0
23 Jan 2024
Template-Free Single-View 3D Human Digitalization with Diffusion-Guided LRM
Zhenzhen Weng
Jingyuan Liu
Hao Tan
Zhan Xu
Yang Zhou
Serena Yeung-Levy
Jimei Yang
3DH
43
8
0
22 Jan 2024
LDReg: Local Dimensionality Regularized Self-Supervised Learning
Hanxun Huang
R. Campello
S. Erfani
Xingjun Ma
Michael E. Houle
James Bailey
38
5
0
19 Jan 2024
Image Translation as Diffusion Visual Programmers
Cheng Han
James Liang
Qifan Wang
Majid Rabbani
S. Dianat
Raghuveer M. Rao
Ying Nian Wu
Dongfang Liu
29
8
0
18 Jan 2024
Continuous Piecewise-Affine Based Motion Model for Image Animation
Hexiang Wang
Fengqi Liu
Qianyu Zhou
Ran Yi
Xin Tan
Lizhuang Ma
VGen
29
9
0
17 Jan 2024
ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization
Weiyao Wang
Pierre Gleize
Hao Tang
Xingyu Chen
Kevin J Liang
Matt Feiszli
28
1
0
17 Jan 2024
B-Cos Aligned Transformers Learn Human-Interpretable Features
Manuel Tran
Amal Lahiani
Yashin Dicente Cid
Melanie Boxberg
Peter Lienemann
C. Matek
S. J. Wagner
Fabian J. Theis
Eldad Klaiman
Tingying Peng
MedIm
ViT
21
2
0
16 Jan 2024
The Faiss library
Matthijs Douze
Alexandr Guzhva
Chengqi Deng
Jeff Johnson
Gergely Szilvasy
Pierre-Emmanuel Mazaré
Maria Lomeli
Lucas Hosseini
Hervé Jégou
41
147
0
16 Jan 2024
Collaboratively Self-supervised Video Representation Learning for Action Recognition
Jie Zhang
Zhifan Wan
Lanqing Hu
Stephen Lin
Shuzhe Wu
Shiguang Shan
TTA
67
1
0
15 Jan 2024
A Study on Self-Supervised Pretraining for Vision Problems in Gastrointestinal Endoscopy
Edward Sanderson
B. Matuszewski
23
2
0
11 Jan 2024
Transformer-CNN Fused Architecture for Enhanced Skin Lesion Segmentation
Siddharth Tiwari
MedIm
ViT
48
0
0
10 Jan 2024
Do Vision and Language Encoders Represent the World Similarly?
Mayug Maniparambil
Raiymbek Akshulakov
Y. A. D. Djilali
Sanath Narayan
M. Seddik
K. Mangalam
Noel E. O'Connor
VLM
29
11
0
10 Jan 2024
RudolfV: A Foundation Model by Pathologists for Pathologists
Jonas Dippel
Barbara Feulner
Tobias Winterhoff
Timo Milbich
Stephan Tietz
...
David Horst
Lukas Ruff
Klaus-Robert Muller
Frederick Klauschen
Maximilian Alber
36
29
0
08 Jan 2024
Attention-Guided Erasing: A Novel Augmentation Method for Enhancing Downstream Breast Density Classification
A. B. Panambur
Hui Yu
Sheethal Bhat
Prathmesh Madhu
Siming Bayer
Andreas Maier
MedIm
34
1
0
08 Jan 2024
Fus-MAE: A cross-attention-based data fusion approach for Masked Autoencoders in remote sensing
Hugo Chan-To-Hing
B. Veeravalli
30
8
0
05 Jan 2024
Boosting Transformer's Robustness and Efficacy in PPG Signal Artifact Detection with Self-Supervised Learning
Thanh-Dung Le
34
1
0
02 Jan 2024
GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields
X. Pan
Zongxin Yang
Shuai Bai
Yi Yang
DiffM
OffRL
30
1
0
01 Jan 2024
Morphing Tokens Draw Strong Masked Image Models
Taekyung Kim
Byeongho Heo
Dongyoon Han
54
3
0
30 Dec 2023
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation
Yuxuan Zhang
Yiren Song
Jiaming Liu
Rui Wang
Jinpeng Yu
...
Huaxia Li
Xu Tang
Yao Hu
Han Pan
Zhongliang Jing
49
58
0
26 Dec 2023
Modality-Collaborative Transformer with Hybrid Feature Reconstruction for Robust Emotion Recognition
Chengxin Chen
Pengyuan Zhang
38
5
0
26 Dec 2023
A Survey on Open-Set Image Recognition
Jiaying Sun
Qiulei Dong
BDL
ObjD
34
3
0
25 Dec 2023
TADAP: Trajectory-Aided Drivable area Auto-labeling with Pre-trained self-supervised features in winter driving conditions
Eerik Alamikkotervo
Risto Ojala
Alvari Seppänen
Kari Tammi
24
0
0
20 Dec 2023
Expediting Contrastive Language-Image Pretraining via Self-distilled Encoders
Bumsoo Kim
Jinhyung Kim
Yeonsik Jo
S. Kim
VLM
31
3
0
19 Dec 2023
Unsupervised Segmentation of Colonoscopy Images
Heming Yao
Jérôme Lüscher
Benjamín Gutiérrez-Becker
Josep Arús-Pous
Tommaso Biancalani
A. Bigorgne
David Richmond
MedIm
35
0
0
19 Dec 2023
Learning to Act without Actions
Dominik Schmidt
Minqi Jiang
OffRL
34
31
0
17 Dec 2023
Progressive Feature Self-reinforcement for Weakly Supervised Semantic Segmentation
Jingxuan He
Lechao Cheng
Chaowei Fang
Zunlei Feng
Tingting Mu
Min-Gyoo Song
21
7
0
14 Dec 2023
Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance
Kuan-Chih Huang
Yi-Hsuan Tsai
Ming-Hsuan Yang
3DPC
40
4
0
12 Dec 2023
Benchmarking Pretrained Vision Embeddings for Near- and Duplicate Detection in Medical Images
Tuan Truong
Farnaz Khun Jush
Matthias Lenga
34
2
0
12 Dec 2023
Photorealistic Video Generation with Diffusion Models
Agrim Gupta
Lijun Yu
Kihyuk Sohn
Xiuye Gu
Meera Hahn
Fei-Fei Li
Irfan Essa
Lu Jiang
José Lezama
VGen
59
177
0
11 Dec 2023
TULIP: Transformer for Upsampling of LiDAR Point Clouds
Bin Yang
Patrick Pfreundschuh
Roland Siegwart
Marco Hutter
Peyman Moghadam
Vaishakh Patil
3DPC
26
6
0
11 Dec 2023
Mining Gaze for Contrastive Learning toward Computer-Assisted Diagnosis
Zihao Zhao
Sheng Wang
Qian Wang
Dinggang Shen
MedIm
39
5
0
11 Dec 2023
Learning Naturally Aggregated Appearance for Efficient 3D Editing
Ka Leong Cheng
Qiuyu Wang
Zifan Shi
Kecheng Zheng
Yinghao Xu
Ouyang Hao
Qifeng Chen
Yujun Shen
3DH
60
4
0
11 Dec 2023
AM-RADIO: Agglomerative Vision Foundation Model -- Reduce All Domains Into One
Michael Ranzinger
Greg Heinrich
Jan Kautz
Pavlo Molchanov
VLM
44
42
0
10 Dec 2023
Diffusion for Natural Image Matting
Yihan Hu
Yiheng Lin
Wei Wang
Yao-Min Zhao
Yunchao Wei
Humphrey Shi
30
7
0
10 Dec 2023
Human-in-the-Loop Visual Re-ID for Population Size Estimation
Gustavo Pérez
Daniel Sheldon
Grant Van Horn
Subhransu Maji
25
0
0
08 Dec 2023
Multimodal Industrial Anomaly Detection by Crossmodal Feature Mapping
Alex Costanzino
Pierluigi Zama Ramirez
Giuseppe Lisanti
Luigi Di Stefano
19
10
0
07 Dec 2023
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Zhiwu Qing
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yujie Wei
Yingya Zhang
Changxin Gao
Nong Sang
VGen
DiffM
32
37
0
07 Dec 2023
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Zhen Li
Mingdeng Cao
Xintao Wang
Zhongang Qi
Ming-Ming Cheng
Ying Shan
DiffM
60
189
0
07 Dec 2023
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion
Yujie Wei
Shiwei Zhang
Zhiwu Qing
Hangjie Yuan
Zhiheng Liu
Yu Liu
Yingya Zhang
Jingren Zhou
Hongming Shan
DiffM
VGen
19
89
0
07 Dec 2023
Instance Tracking in 3D Scenes from Egocentric Videos
Yunhan Zhao
Haoyu Ma
Shu Kong
Charless C. Fowlkes
3DPC
30
4
0
07 Dec 2023
LiDAR: Sensing Linear Probing Performance in Joint Embedding SSL Architectures
Vimal Thilak
Chen Huang
Omid Saremi
Laurent Dinh
Hanlin Goh
Preetum Nakkiran
Josh Susskind
Etai Littwin
23
9
0
07 Dec 2023
Auto-Vocabulary Semantic Segmentation
Osman Ülger
Maksymilian Kulicki
Yuki M. Asano
Martin R. Oswald
VLM
45
2
0
07 Dec 2023
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Zeyi Sun
Ye Fang
Tong Wu
Pan Zhang
Yuhang Zang
Shu Kong
Yuanjun Xiong
Dahua Lin
Jiaqi Wang
VLM
CLIP
51
83
0
06 Dec 2023
Anomaly Detection for Scalable Task Grouping in Reinforcement Learning-based RAN Optimization
Jimmy Li
Igor Kozlov
Di Wu
Xue Liu
Gregory Dudek
27
0
0
06 Dec 2023
Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields
Shijie Zhou
Haoran Chang
Sicheng Jiang
Zhiwen Fan
Zehao Zhu
Dejia Xu
Pradyumna Chari
Suya You
Zhangyang Wang
A. Kadambi
3DGS
42
162
0
06 Dec 2023
Previous
1
2
3
...
10
11
12
...
25
26
27
Next