Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14294
Cited By
v1
v2 (latest)
Emerging Properties in Self-Supervised Vision Transformers
29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Emerging Properties in Self-Supervised Vision Transformers"
50 / 4,175 papers shown
Title
Diffusion Features for Zero-Shot 6DoF Object Pose Estimation
Bernd Von Gimborn
P. Ausserlechner
Markus Vincze
S. Thalhammer
DiffM
103
1
0
25 Nov 2024
Edge Weight Prediction For Category-Agnostic Pose Estimation
Or Hirschorn
S. Avidan
145
0
0
25 Nov 2024
Leveraging Foundation Models To learn the shape of semi-fluid deformable objects
Omar El Assal
Carlos M. Mateo
Sebastien Ciron
David Fofi
100
0
0
25 Nov 2024
Cluster-based human-in-the-loop strategy for improving machine learning-based circulating tumor cell detection in liquid biopsy
Hümeyra Husseini-Wüsthoff
Sabine Riethdorf
Andreas Schneeweiss
Andreas Trumpp
Klaus Pantel
Harriet Wikman
M. Nielsen
R. Werner
OOD
78
2
0
25 Nov 2024
Brain-like emergent properties in deep networks: impact of network architecture, datasets and training
Niranjan Rajesh
Georgin Jacob
SP Arun
OOD
111
0
0
25 Nov 2024
NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model
Jinpeng Liu
Jiale Xu
Weihao Cheng
Yiming Gao
Xinyu Wang
Ying Shan
Yansong Tang
3DGS
DiffM
118
1
0
25 Nov 2024
FUN-AD: Fully Unsupervised Learning for Anomaly Detection with Noisy Training Data
Jiin Im
Yongho Son
Je Hyeong Hong
AAML
119
1
0
25 Nov 2024
Soft-TransFormers for Continual Learning
Haeyong Kang
Chang D. Yoo
CLL
147
0
0
25 Nov 2024
SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis
Hyojun Go
Byeongjun Park
Jiho Jang
Jin-Young Kim
Soonwoo Kwon
Changick Kim
3DGS
235
3
0
25 Nov 2024
A Review of Bayesian Uncertainty Quantification in Deep Probabilistic Image Segmentation
M. Valiuddin
R. V. Sloun
C.G.A. Viviers
Peter H. N. de With
Fons van der Sommen
UQCV
290
1
0
25 Nov 2024
RECAST: Reparameterized, Compact weight Adaptation for Sequential Tasks
Nazia Tasnim
Bryan A. Plummer
CLL
OffRL
159
0
0
25 Nov 2024
ResCLIP: Residual Attention for Training-free Dense Vision-language Inference
Yuhang Yang
Jinhong Deng
Wen Li
Lixin Duan
VLM
108
1
0
24 Nov 2024
Multi-Token Enhancing for Vision Representation Learning
Zhong-Yu Li
Yu-Song Hu
Bo Yin
Ming-Ming Cheng
174
1
0
24 Nov 2024
PR-MIM: Delving Deeper into Partial Reconstruction in Masked Image Modeling
Zhong-Yu Li
Yunheng Li
Deng-Ping Fan
Ming-Ming Cheng
177
0
0
24 Nov 2024
Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation
Sule Bai
Yong-Jin Liu
Yifei Han
Haoji Zhang
Yansong Tang
VLM
325
8
0
24 Nov 2024
PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation
Ziyao Zeng
Jingcheng Ni
Daniel Wang
Patrick Rim
Younjoon Chung
Fengyu Yang
Byung-Woo Hong
A. Wong
DiffM
MDE
287
2
0
24 Nov 2024
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Qifan Yu
Wei Chow
Zhongqi Yue
Kaihang Pan
Yang Wu
Xiaoyang Wan
Juncheng Billy Li
Siliang Tang
Hao Zhang
Yueting Zhuang
DiffM
236
29
0
24 Nov 2024
Improving Factuality of 3D Brain MRI Report Generation with Paired Image-domain Retrieval and Text-domain Augmentation
J. Lee
Y. Oh
Dahyoun Lee
Hyon Keun Joh
Chul-Ho Sohn
...
Cheol Kyu Jung
Jung Hyun Park
Kyu Sung Choi
Byung-Hoon Kim
Jong Chul Ye
DiffM
MedIm
129
1
0
23 Nov 2024
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator
Chaehun Shin
Jooyoung Choi
Heeseung Kim
Sungroh Yoon
DiffM
171
13
0
23 Nov 2024
FodFoM: Fake Outlier Data by Foundation Models Creates Stronger Visual Out-of-Distribution Detector
Jiankang Chen
Ling Deng
Zhiyong Gan
Wei-Shi Zheng
Ruixuan Wang
OODD
175
2
0
22 Nov 2024
Design-o-meter: Towards Evaluating and Refining Graphic Designs
Sahil Goyal
Abhinav Mahajan
Swasti Mishra
Prateksha Udhayanan
Tripti Shukla
K. J. Joseph
Balaji Vasan Srinivasan
128
1
0
22 Nov 2024
MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
Weijia Wu
Mingyu Liu
Zeyu Zhu
Xi Xia
Haoen Feng
Wen Wang
Kevin Qinghong Lin
Chunhua Shen
Mike Zheng Shou
DiffM
VGen
230
3
0
22 Nov 2024
RankByGene: Gene-Guided Histopathology Representation Learning Through Cross-Modal Ranking Consistency
Wentao Huang
Meilong Xu
Xiaoling Hu
Shahira Abousamra
Aniruddha Ganguly
...
Prateek Prasanna
Tahsin M. Kurc
Joel H. Saltz
Michael L. Miller
Chong Chen
136
0
0
22 Nov 2024
GalaxyEdit: Large-Scale Image Editing Dataset with Enhanced Diffusion Adapter
Aniruddha Bala
Rohan Jaiswal
Loay Rashid
Siddharth Roheda
120
0
0
21 Nov 2024
NexusSplats: Efficient 3D Gaussian Splatting in the Wild
Yuzhou Tang
Dejun Xu
Yongjie Hou
Zhenzhong Wang
Min Jiang
3DGS
205
2
0
21 Nov 2024
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
Ziyi Wang
Yijiao Wang
Xumin Yu
Jie Zhou
Jiwen Lu
100
0
0
20 Nov 2024
Uni-Mlip: Unified Self-supervision for Medical Vision Language Pre-training
Ameera Bawazir
Kebin Wu
Wenbin Li
CLIP
106
1
0
20 Nov 2024
Probe-Me-Not: Protecting Pre-trained Encoders from Malicious Probing
Ruyi Ding
Tong Zhou
Lili Su
A. A. Ding
Xiaolin Xu
Yunsi Fei
AAML
152
2
0
19 Nov 2024
KDC-MAE: Knowledge Distilled Contrastive Mask Auto-Encoder
Maheswar Bora
Saurabh Atreya
Aritra Mukherjee
Abhijit Das
138
0
0
19 Nov 2024
CDI: Copyrighted Data Identification in Diffusion Models
Jan Dubiñski
Antoni Kowalczuk
Franziska Boenisch
Adam Dziedzic
124
2
0
19 Nov 2024
VLN-Game: Vision-Language Equilibrium Search for Zero-Shot Semantic Navigation
Bangguo Yu
Yuzhen Liu
Lei Han
Hamidreza Kasaei
Tingguang Li
M. Cao
LM&Ro
184
3
0
18 Nov 2024
Learning a Neural Association Network for Self-supervised Multi-Object Tracking
Shuai Li
Michael G. Burke
S. Ramamoorthy
Juergen Gall
VOT
156
0
0
18 Nov 2024
The Sound of Water: Inferring Physical Properties from Pouring Liquids
Piyush Bagad
Makarand Tapaswi
Cees G. M. Snoek
Andrew Zisserman
177
0
0
18 Nov 2024
Relational Contrastive Learning and Masked Image Modeling for Scene Text Recognition
T. Lin
Jinglei Zhang
Yi Xu
Kai Chen
Rui Zhang
Chong Chen
105
0
0
18 Nov 2024
D-Cube: Exploiting Hyper-Features of Diffusion Model for Robust Medical Classification
Minhee Jang
Juheon Son
Thanaporn Viriyasaranon
Junho Kim
Jang-Hwan Choi
MedIm
120
0
0
17 Nov 2024
Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
Wentao Bao
Keqin Li
Yuxiao Chen
Deep Patel
Martin Renqiang Min
Yu Kong
VLM
ObjD
96
2
0
17 Nov 2024
ARM: Appearance Reconstruction Model for Relightable 3D Generation
Xiang-Wei Feng
Chang Yu
Zoubin Bi
Yintong Shang
Feng Gao
Hongzhi Wu
Kun Zhou
Chenfanfu Jiang
Yifan Yang
3DH
89
1
0
16 Nov 2024
Diagnostic Text-guided Representation Learning in Hierarchical Classification for Pathological Whole Slide Image
Jiawen Li
Qiehe Sun
Renao Yan
Yizhi Wang
Yuqiu Fu
Yani Wei
Tian Guan
Huijuan Shi
Yonghonghe He
Anjia Han
99
3
0
16 Nov 2024
From Prototypes to General Distributions: An Efficient Curriculum for Masked Image Modeling
Jinhong Lin
Cheng-En Wu
Huanran Li
Jifan Zhang
Yu Hen Hu
Pedro Morgado
117
0
0
16 Nov 2024
RETR: Multi-View Radar Detection Transformer for Indoor Perception
Ryoma Yataka
Adriano Cardace
Peng Wang
P. Boufounos
R. Takahashi
154
2
0
15 Nov 2024
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation
Dengke Zhang
Fagui Liu
Quan Tang
VLM
157
2
0
15 Nov 2024
ColorEdit: Training-free Image-Guided Color editing with diffusion model
Xingxi Yin
Zhi Li
Jingfeng Zhang
Chenglin Li
Yin Zhang
DiffM
157
0
0
15 Nov 2024
On the Surprising Effectiveness of Attention Transfer for Vision Transformers
Alexander C. Li
Yuandong Tian
Bin Chen
Deepak Pathak
Xinlei Chen
75
3
0
14 Nov 2024
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation
Yuheng Shi
Minjing Dong
Chang Xu
VLM
118
3
0
14 Nov 2024
Physics Informed Distillation for Diffusion Models
Joshua Tian Jin Tee
Kang Zhang
Hee Suk Yoon
Dhananjaya N. Gowda
Chanwoo Kim
Chang D. Yoo
DiffM
98
6
0
13 Nov 2024
ReMP: Reusable Motion Prior for Multi-domain 3D Human Pose Estimation and Motion Inbetweening
Hojun Jang
Y. Kim
3DH
80
0
0
13 Nov 2024
HMIL: Hierarchical Multi-Instance Learning for Fine-Grained Whole Slide Image Classification
Cheng Jin
Luyang Luo
Huangjing Lin
Jun Hou
Hao Chen
135
4
0
12 Nov 2024
Watermark Anything with Localized Messages
Tom Sander
Pierre Fernandez
Alain Durmus
Teddy Furon
Matthijs Douze
VLM
111
9
0
11 Nov 2024
SAMPart3D: Segment Any Part in 3D Objects
Yanting Yang
Yukun Huang
Yu Guo
Liangjun Lu
Xiaoyang Wu
Edmund Y. Lam
Yan-Pei Cao
Xihui Liu
VLM
115
12
0
11 Nov 2024
MapSAM: Adapting Segment Anything Model for Automated Feature Detection in Historical Maps
Xue Xia
Daiwei Zhang
Wenxuan Song
Wei Huang
L. Hurni
AI4TS
VLM
63
2
0
11 Nov 2024
Previous
1
2
3
...
15
16
17
...
82
83
84
Next