Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14294
Cited By
v1
v2 (latest)
Emerging Properties in Self-Supervised Vision Transformers
29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Emerging Properties in Self-Supervised Vision Transformers"
50 / 4,175 papers shown
Title
k-NN as a Simple and Effective Estimator of Transferability
Moein Sorkhei
Christos Matsoukas
Johan Fredin Haslum
Emir Konuk
Kevin Smith
94
0
0
24 Mar 2025
PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding
Hongjia Zhai
Haoyang Li
Zhenzhe Li
Xiaokun Pan
Yijia He
Guofeng Zhang
93
0
0
23 Mar 2025
LongDiff: Training-Free Long Video Generation in One Go
Zhuoling Li
Hossein Rahmani
Qiuhong Ke
Jing Liu
DiffM
VGen
VLM
101
0
0
23 Mar 2025
CustomKD: Customizing Large Vision Foundation for Edge Model Improvement via Knowledge Distillation
Jungsoo Lee
Debasmit Das
Munawar Hayat
Sungha Choi
Kyuwoong Hwang
Fatih Porikli
VLM
110
1
0
23 Mar 2025
PanopticSplatting: End-to-End Panoptic Gaussian Splatting
Yuxuan Xie
Xuan Yu
Changjian Jiang
Sitong Mao
Shunbo Zhou
Rui Fan
R. Xiong
Yansen Wang
3DGS
78
1
0
23 Mar 2025
DeLoRA: Decoupling Angles and Strength in Low-rank Adaptation
Massimo Bini
Leander Girrbach
Zeynep Akata
220
1
0
23 Mar 2025
BackMix: Regularizing Open Set Recognition by Removing Underlying Fore-Background Priors
Yu Wang
Junxian Mu
Hongzhi Huang
Qilong Wang
Pengfei Zhu
Q. Hu
241
1
0
22 Mar 2025
Enhancing Martian Terrain Recognition with Deep Constrained Clustering
Tejas Panambur
M. Parente
73
0
0
22 Mar 2025
Should we pre-train a decoder in contrastive learning for dense prediction tasks?
S. Quetin
Tapotosh Ghosh
Farhad Maleki
SSL
110
0
0
21 Mar 2025
Radar-Guided Polynomial Fitting for Metric Depth Estimation
Patrick Rim
Hyoungseob Park
Vadim Ezhov
Jeffrey Moon
Alex Wong
MDE
115
0
0
21 Mar 2025
Is there anything left? Measuring semantic residuals of objects removed from 3D Gaussian Splatting
Simona Kocour
Assia Benbihi
Aikaterini Adam
Torsten Sattler
3DPC
91
0
0
21 Mar 2025
AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process
J. Hu
Shuyong Gao
Qianyu Guo
Yan Wang
Qishan Wang
Yuang Feng
Wenqiang Zhang
DiffM
VGen
85
0
0
21 Mar 2025
FreeUV: Ground-Truth-Free Realistic Facial UV Texture Recovery via Cross-Assembly Inference Strategy
Xingchao Yang
Takafumi Taketomi
Yuki Endo
Yoshihiro Kanamori
DiffM
136
0
0
21 Mar 2025
What's Producible May Not Be Reachable: Measuring the Steerability of Generative Models
Keyon Vafa
Sarah Bentley
Jon M. Kleinberg
S. Mullainathan
70
2
0
21 Mar 2025
Beyond Accuracy: What Matters in Designing Well-Behaved Models?
Robin Hesse
Doğukan Bağcı
Bernt Schiele
Simone Schaub-Meyer
Stefan Roth
VLM
112
0
0
21 Mar 2025
ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos
Haolin Yang
Feilong Tang
Ming Hu
Yulong Li
Junjie Guo
...
Zelin Peng
Junjun He
Junjun He
Zongyuan Ge
Imran Razzak
DiffM
VGen
298
2
0
20 Mar 2025
Shining Yourself: High-Fidelity Ornaments Virtual Try-on with Diffusion Model
Yingmao Miao
Zhanpeng Huang
Rui Han
Zibin Wang
Chenhao Lin
Chao Shen
DiffM
82
1
0
20 Mar 2025
Learning 3D Scene Analogies with Neural Contextual Scene Maps
Junho Kim
Gwangtak Bae
E. Lee
Young Min Kim
3DPC
3DV
99
0
0
20 Mar 2025
M3: 3D-Spatial MultiModal Memory
Xueyan Zou
Yuchen Song
Ri-Zhao Qiu
Xuanbin Peng
Jianglong Ye
Sifei Liu
Xiaolong Wang
3DGS
96
0
0
20 Mar 2025
M2N2V2: Multi-Modal Unsupervised and Training-free Interactive Segmentation
Markus Karmann
Peng-Tao Jiang
Bo Li
O. Urfalioglu
81
0
0
20 Mar 2025
AIMI: Leveraging Future Knowledge and Personalization in Sparse Event Forecasting for Treatment Adherence
Abdullah Mamun
Diane J. Cook
Hassan Ghasemzadeh
AI4TS
77
0
0
20 Mar 2025
GAIR: Improving Multimodal Geo-Foundation Model with Geo-Aligned Implicit Representations
Ziqiang Liu
Fan Zhang
Junfeng Jiao
Ni Lao
Gengchen Mai
91
2
0
20 Mar 2025
TriTex: Learning Texture from a Single Mesh via Triplane Semantic Features
Dana Cohen-Bar
Daniel Cohen-Or
Gal Chechik
Yoni Kasten
69
0
0
20 Mar 2025
When Domain Generalization meets Generalized Category Discovery: An Adaptive Task-Arithmetic Driven Approach
Vaibhav Rathore
S. Bagchi
Saikat Dutta
Sarthak Mehrotra
Zsolt Kira
Biplab Banerjee
OOD
108
1
0
19 Mar 2025
Cube: A Roblox View of 3D Intelligence
Foundation AI Team Roblox
Kiran Bhat
Nishchaie Khanna
Karun Channa
Tinghui Zhou
...
Kyle Price
Steve Han
Yiqing Wang
A. Singh
David Baszucki
133
1
0
19 Mar 2025
Object-Centric Pretraining via Target Encoder Bootstrapping
Nikola Đukić
Tim Lebailly
Tinne Tuytelaars
OCL
129
0
0
19 Mar 2025
1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities
Kevin Wang
Ishaan Javali
Michał Bortkiewicz
Tomasz Trzciñski
Benjamin Eysenbach
SSL
OffRL
122
2
0
19 Mar 2025
Representational Similarity via Interpretable Visual Concepts
Neehar Kondapaneni
Oisin Mac Aodha
Pietro Perona
DRL
506
2
0
19 Mar 2025
Conjuring Positive Pairs for Efficient Unification of Representation Learning and Image Synthesis
Imanol G. Estepa
Jesús M. Rodríguez-de-Vera
Ignacio Sarasúa
Bhalaji Nagarajan
Petia Radeva
198
0
0
19 Mar 2025
Shap-MeD
Nicolás Laverde
Melissa Robles
Johan Rodríguez
MedIm
68
0
0
19 Mar 2025
xMOD: Cross-Modal Distillation for 2D/3D Multi-Object Discovery from 2D motion
Saad Lahlali
Sandra Kara
Hejer Ammar
Florian Chabot
Nicolas Granger
Hervé Le Borgne
Q. C. Pham
3DPC
103
0
0
19 Mar 2025
CAM-Seg: A Continuous-valued Embedding Approach for Semantic Image Generation
Masud Ahmed
Zahid Hasan
Syed Arefinul Haque
A. Faridee
S. Purushotham
Suya You
Nirmalya Roy
182
0
0
19 Mar 2025
Utilization of Neighbor Information for Image Classification with Different Levels of Supervision
Gihan Jayatilaka
Abhinav Shrivastava
M. Gwilliam
103
0
0
18 Mar 2025
Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding
Zining Wang
Tongkun Guan
Pei Fu
Chen Duan
Qianyi Jiang
Zhentao Guo
Shan Guo
Junfeng Luo
Wei Shen
Xiaokang Yang
MLLM
VLM
85
3
0
18 Mar 2025
Dynamic Accumulated Attention Map for Interpreting Evolution of Decision-Making in Vision Transformer
Yi Liao
Yongsheng Gao
Weichuan Zhang
84
3
0
18 Mar 2025
Squeeze Out Tokens from Sample for Finer-Grained Data Governance
Weixiong Lin
Chen Ju
Haicheng Wang
Shengchao Hu
Shuai Xiao
...
Yuheng Jiao
Mingshuai Yao
Jinsong Lan
Qingwen Liu
Ying Chen
84
0
0
18 Mar 2025
ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing
Yulin Pan
Xiangteng He
Chaojie Mao
Zhen Han
Zeyinzi Jiang
Junxuan Zhang
Yu Liu
EGVM
VLM
114
2
0
18 Mar 2025
SketchFusion: Learning Universal Sketch Features through Fusing Foundation Models
Subhadeep Koley
Tapas Kumar Dutta
Aneeshan Sain
Pinaki Nath Chowdhury
A. Bhunia
Yi-Zhe Song
VLM
121
0
0
18 Mar 2025
Text-Guided Image Invariant Feature Learning for Robust Image Watermarking
Muhammad Ahtesham
Xin Zhong
105
1
0
18 Mar 2025
Deeply Supervised Flow-Based Generative Models
Inkyu Shin
Chenglin Yang
Liang-Chieh Chen
93
2
0
18 Mar 2025
BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing
Yaowei Li
Lingen Li
Zhaoyang Zhang
Xiaoyu Li
Guangzhi Wang
Hongxiang Li
Xiaodong Cun
Ying Shan
Yuexian Zou
DiffM
107
2
0
17 Mar 2025
AI-Driven Rapid Identification of Bacterial and Fungal Pathogens in Blood Smears of Septic Patients
Agnieszka Sroka-Oleksiak
Adam Pardyl
Dawid Rymarczyk
Aldona Olechowska-Jarząb
Katarzyna Biegun-Drożdż
...
Tomasz Gosiewski
Miłosz Adamczyk
Henryk Telega
Bartosz Zieliñski
Monika Brzychczy-Włoch
88
0
0
17 Mar 2025
Towards Scalable Foundation Model for Multi-modal and Hyperspectral Geospatial Data
Haozhe Si
Yuxuan Wan
Minh Do
Deepak Vasisht
Han Zhao
Hendrik Hamann
171
0
0
17 Mar 2025
SAM2-ELNet: Label Enhancement and Automatic Annotation for Remote Sensing Segmentation
Jianhao Yang
Wenshuo Yu
Yuanchao Lv
Jiance Sun
Bokang Sun
Mingyang Liu
81
0
0
16 Mar 2025
SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs
Guibiao Liao
Qing Li
Zhenyu Bao
Guoping Qiu
Kanglin Liu
3DGS
90
2
0
16 Mar 2025
Multi Activity Sequence Alignment via Implicit Clustering
Taein Kwon
Zador Pataki
Mahdi Rad
Marc Pollefeys
HAI
AI4TS
103
0
0
16 Mar 2025
MOS: Modeling Object-Scene Associations in Generalized Category Discovery
Zhengyuan Peng
Jinpeng Ma
Zhimin Sun
Ran Yi
Haichuan Song
Xin Tan
Lizhuang Ma
157
0
0
15 Mar 2025
Leveraging Motion Information for Better Self-Supervised Video Correspondence Learning
Zihan Zhoua
Changrui Daia
Aibo Songa
Xiaolin Fang
VOS
150
0
0
15 Mar 2025
SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering
Byeongjun Park
Hyojun Go
Hyelin Nam
Byung-Hoon Kim
Hyungjin Chung
Changick Kim
VGen
LLMSV
113
1
0
15 Mar 2025
Self-Supervised Pretraining for Fine-Grained Plankton Recognition
Joona Kareinen
T. Eerola
K. Kraft
L. Lensu
S. Suikkanen
Heikki Kälviäinen
SSL
494
0
0
14 Mar 2025
Previous
1
2
3
...
8
9
10
...
82
83
84
Next