Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14294
Cited By
v1
v2 (latest)
Emerging Properties in Self-Supervised Vision Transformers
29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Emerging Properties in Self-Supervised Vision Transformers"
50 / 4,176 papers shown
Title
VXP: Voxel-Cross-Pixel Large-scale Image-LiDAR Place Recognition
Yun-Jin Li
M. Gladkova
Yan Xia
Rui Wang
Daniel Cremers
114
5
0
21 Mar 2024
On Pretraining Data Diversity for Self-Supervised Learning
Hasan Hammoud
Tuhin Das
Fabio Pizzati
Philip Torr
Adel Bibi
Guohao Li
155
3
0
20 Mar 2024
Embedding Pose Graph, Enabling 3D Foundation Model Capabilities with a Compact Representation
Hugues Thomas
Jian Zhang
78
1
0
20 Mar 2024
MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining
Di Wang
Jing Zhang
Minqiang Xu
Lin Liu
Dongsheng Wang
...
Chengxi Han
Haonan Guo
Bo Du
Dacheng Tao
Lefei Zhang
83
53
0
20 Mar 2024
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework
Zhengqing Yuan
Ruoxi Chen
Zhaoxu Li
Haolong Jia
Lifang He
Chi Wang
Lichao Sun
VGen
109
28
0
20 Mar 2024
LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images
Jing Zhang
Irving Fang
Juexiao Zhang
Hao Wu
Akshat Kaushik
Alice Rodriguez
Hanwen Zhao
Zhuo Zheng
Radu Iovita
Chen Feng
73
5
0
19 Mar 2024
TAPTR: Tracking Any Point with Transformers as Detection
Hongyang Li
Hao Zhang
Shilong Liu
Zhaoyang Zeng
Tianhe Ren
Feng Li
Lei Zhang
91
20
0
19 Mar 2024
Opti-Acoustic Semantic SLAM with Unknown Objects in Underwater Environments
Kurran Singh
Jungseok Hong
Nick Rypkema
John J. Leonard
82
2
0
19 Mar 2024
ViTGaze: Gaze Following with Interaction Features in Vision Transformers
Yuehao Song
Xinggang Wang
Jingfeng Yao
Wenyu Liu
Jinglin Zhang
Xiangmin Xu
ViT
85
3
0
19 Mar 2024
Selective, Interpretable, and Motion Consistent Privacy Attribute Obfuscation for Action Recognition
Filip Ilic
Henghui Zhao
Thomas Pock
Richard P. Wildes
PICV
AAML
70
3
0
19 Mar 2024
Learning Cross-view Visual Geo-localization without Ground Truth
Haoyuan Li
Chang Xu
Wen Yang
Huai Yu
Gui-Song Xia
96
11
0
19 Mar 2024
Tuning-Free Image Customization with Image and Text Guidance
Pengzhi Li
Qiang Nie
Ying Chen
Xi Jiang
Kai Wu
Yuhuan Lin
Yong-Jin Liu
Jinlong Peng
Chengjie Wang
Feng Zheng
DiffM
68
21
0
19 Mar 2024
Pretraining Codomain Attention Neural Operators for Solving Multiphysics PDEs
Md Ashiqur Rahman
Robert Joseph George
Mogab Elleithy
Daniel Leibovici
Zong-Yi Li
...
Julius Berner
Raymond A. Yeh
Jean Kossaifi
Kamyar Azizzadenesheli
A. Anandkumar
AI4CE
131
23
0
19 Mar 2024
NTK-Guided Few-Shot Class Incremental Learning
Jingren Liu
Zhong Ji
Yanwei Pang
YunLong Yu
CLL
97
4
0
19 Mar 2024
Do Generated Data Always Help Contrastive Learning?
Yifei Wang
Jizhe Zhang
Yisen Wang
DiffM
109
26
0
19 Mar 2024
OV9D: Open-Vocabulary Category-Level 9D Object Pose and Size Estimation
Junhao Cai
Yisheng He
Weihao Yuan
Siyu Zhu
Zilong Dong
Liefeng Bo
Qifeng Chen
DiffM
87
8
0
19 Mar 2024
ADAPT to Robustify Prompt Tuning Vision Transformers
Masih Eskandar
Tooba Imtiaz
Zifeng Wang
Jennifer Dy
VPVLM
VLM
AAML
98
0
0
19 Mar 2024
Zero-Shot Image Feature Consensus with Deep Functional Maps
Xinle Cheng
Congyue Deng
Adam W. Harley
Yixin Zhu
Leonidas Guibas
89
5
0
18 Mar 2024
VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
Junlin Han
Filippos Kokkinos
Philip Torr
VGen
143
42
0
18 Mar 2024
HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation
Ce Zhang
Simon Stepputtis
Joseph Campbell
Katia Sycara
Yaqi Xie
112
13
0
18 Mar 2024
Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation
Axel Sauer
Frederic Boesel
Tim Dockhorn
A. Blattmann
Patrick Esser
Robin Rombach
DiffM
124
135
0
18 Mar 2024
GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning
Xiaojie Li
Yibo Yang
Hefei Ling
Jianlong Wu
Yue Yu
Guohao Li
Min Zhang
SSL
103
6
0
18 Mar 2024
DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video Editing
Hyeonho Jeong
Jinho Chang
Geon Yeong Park
Jong Chul Ye
DiffM
VGen
111
18
0
18 Mar 2024
Unimodal Multi-Task Fusion for Emotional Mimicry Intensity Prediction
Tobias Hallmen
Fabian Deuser
Norbert Oswald
Elisabeth André
84
2
0
18 Mar 2024
HVDistill: Transferring Knowledge from Images to Point Clouds via Unsupervised Hybrid-View Distillation
Sha Zhang
Jiajun Deng
Lei Bai
Houqiang Li
Wanli Ouyang
Yanyong Zhang
3DPC
101
8
0
18 Mar 2024
End-to-end multi-modal product matching in fashion e-commerce
Sándor Tóth
Stephen Wilson
Alexia Tsoukara
Enric Moreu
Anton Masalovich
Lars Roemheld
127
0
0
18 Mar 2024
Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding
Chaolei Tan
Jian-Huang Lai
Wei-Shi Zheng
Jianfang Hu
AI4TS
128
5
0
18 Mar 2024
Align and Distill: Unifying and Improving Domain Adaptive Object Detection
Justin Kay
T. Haucke
Suzanne Stathatos
Siqi Deng
Erik Young
Pietro Perona
Sara Beery
Grant Van Horn
119
6
0
18 Mar 2024
TAG: Guidance-free Open-Vocabulary Semantic Segmentation
Yasufumi Kawano
Yoshimitsu Aoki
VLM
58
4
0
17 Mar 2024
MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation
Yasufumi Kawano
Yoshimitsu Aoki
DiffM
94
8
0
17 Mar 2024
Correcting misinformation on social media with a large language model
Xinyi Zhou
Ashish Sharma
Amy X. Zhang
Tim Althoff
KELM
91
5
0
17 Mar 2024
CGI-DM: Digital Copyright Authentication for Diffusion Models via Contrasting Gradient Inversion
Xiaoyu Wu
Yang Hua
Chumeng Liang
Jiaru Zhang
Hao Wang
Tao Song
Haibing Guan
82
6
0
17 Mar 2024
Recent Advances in 3D Gaussian Splatting
Tong Wu
Yu-Jie Yuan
Ling-Xiao Zhang
Jie Yang
Yan-Pei Cao
Ling-Qi Yan
Lin Gao
3DGS
163
106
0
17 Mar 2024
A Versatile Framework for Multi-scene Person Re-identification
Wei-Shi Zheng
Junkai Yan
Yi-Xing Peng
VLM
74
6
0
17 Mar 2024
Self-supervised co-salient object detection via feature correspondence at multiple scales
Souradeep Chakraborty
Dimitris Samaras
89
4
0
17 Mar 2024
Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
Rui Li
Ruihuang Li
Song Guo
Lei Zhang
DiffM
89
10
0
17 Mar 2024
Tokensome: Towards a Genetic Vision-Language GPT for Explainable and Cognitive Karyotyping
Haoxi Zhang
Xinxu Zhang
Yuanxin Lin
Maiqi Wang
Yi Lai
Yu Wang
Linfeng Yu
Yufeng Xu
Ran Cheng
E. Szczerbicki
86
0
0
17 Mar 2024
Endora: Video Generation Models as Endoscopy Simulators
Chenxin Li
Hengyu Liu
Yifan Liu
Brandon Yushan Feng
Wuyang Li
Xinyu Liu
Zhen Chen
Jing Shao
Yixuan Yuan
VGen
MedIm
127
41
0
17 Mar 2024
N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields
Yash Bhalgat
Iro Laina
João F. Henriques
Andrew Zisserman
Andrea Vedaldi
97
17
0
16 Mar 2024
SelfIE: Self-Interpretation of Large Language Model Embeddings
Haozhe Chen
Carl Vondrick
Chengzhi Mao
77
27
0
16 Mar 2024
RetMIL: Retentive Multiple Instance Learning for Histopathological Whole Slide Image Classification
Hongbo Chu
Qiehe Sun
Jiawen Li
Yuxuan Chen
Lizhong Zhang
Tian Guan
Anjia Han
Yonghong He
58
5
0
16 Mar 2024
Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples
Ziqi Zhou
Minghui Li
Wei Liu
Shengshan Hu
Yechao Zhang
Wei Wan
Lulu Xue
Leo Yu Zhang
Dezhong Yao
Hai Jin
SILM
AAML
114
11
0
16 Mar 2024
StableGarment: Garment-Centric Generation via Stable Diffusion
Rui Wang
Hailong Guo
Jiaming Liu
Huaxia Li
Haibo Zhao
Xu Tang
Yao Hu
Hao Tang
Peipei Li
DiffM
66
16
0
16 Mar 2024
FeatUp: A Model-Agnostic Framework for Features at Any Resolution
Stephanie Fu
Mark Hamilton
Laura E. Brandt
Axel Feldmann
Zhoutong Zhang
William T. Freeman
MDE
99
51
0
15 Mar 2024
Few-Shot Image Classification and Segmentation as Visual Question Answering Using Vision-Language Models
Tian Meng
Yang Tao
Ruilin Lyu
Wuliang Yin
VLM
86
1
0
15 Mar 2024
Advancing Object Goal Navigation Through LLM-enhanced Object Affinities Transfer
Mengying Lin
Yaran Chen
Dong Zhao
Zhaoran Wang
118
2
0
15 Mar 2024
GET: Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery
Enguang Wang
Zhimao Peng
Zhengyuan Xie
Fei Yang
Xialei Liu
Ming-Ming Cheng
137
3
0
15 Mar 2024
GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding
Chengyao Wang
Li Jiang
Xiaoyang Wu
Zhuotao Tian
Bohao Peng
Hengshuang Zhao
Jiaya Jia
3DPC
SSL
140
17
0
14 Mar 2024
GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping
Yuhang Zheng
Xiangyu Chen
Yupeng Zheng
Songen Gu
Runyi Yang
...
Chao Yang
Dawei Wang
Zhen Chen
Xiaoxiao Long
Meiqing Wang
113
47
0
14 Mar 2024
EquiAV: Leveraging Equivariance for Audio-Visual Contrastive Learning
Jongsuk Kim
Hyeongkeun Lee
Kyeongha Rho
Junmo Kim
Joon Son Chung
64
6
0
14 Mar 2024
Previous
1
2
3
...
36
37
38
...
82
83
84
Next