Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14294
Cited By
v1
v2 (latest)
Emerging Properties in Self-Supervised Vision Transformers
29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Emerging Properties in Self-Supervised Vision Transformers"
50 / 4,175 papers shown
Title
ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
Shaozhe Hao
Kai Han
Zhengyao Lv
Shihao Zhao
Kwan-Yee K. Wong
DiffM
CoGe
127
7
0
09 Jul 2024
Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach
Taolin Zhang
Jiawang Bai
Zhihe Lu
Dongze Lian
Genping Wang
Xinchao Wang
Shu-Tao Xia
95
5
0
09 Jul 2024
Rethinking Image-to-Video Adaptation: An Object-centric Perspective
Rui Qian
Shuangrui Ding
Dahua Lin
OCL
96
1
0
09 Jul 2024
CycleSAM: One-Shot Surgical Scene Segmentation using Cycle-Consistent Feature Matching to Prompt SAM
Aditya Murali
Pietro Mascagni
Didier Mutter
N. Padoy
VLM
MedIm
96
3
0
09 Jul 2024
Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning
Fanyue Wei
Wei Zeng
Zhenyang Li
Dawei Yin
Lixin Duan
Wen Li
EGVM
60
3
0
09 Jul 2024
Decomposition Betters Tracking Everything Everywhere
Rui Li
Dong Liu
79
3
0
09 Jul 2024
A Clinical Benchmark of Public Self-Supervised Pathology Foundation Models
Gabriele Campanella
Shengjia Chen
Ruchika Verma
Jennifer Zeng
A. Stock
...
Kuan-lin Huang
Ricky Kwan
Jane Houldsworth
Adam J. Schoenfeld
Chad M. Vanderbilt
AI4MH
OOD
LM&MA
89
23
0
09 Jul 2024
Noise-Free Explanation for Driving Action Prediction
Hongbo Zhu
Theodor Wulff
R. S. Maharjan
Jinpei Han
Angelo Cangelosi
AAML
FAtt
64
0
0
08 Jul 2024
MagMax: Leveraging Model Merging for Seamless Continual Learning
Daniel Marczak
Bartłomiej Twardowski
Tomasz Trzciñski
Sebastian Cygert
MoMe
CLL
96
24
0
08 Jul 2024
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation
Yu Zeng
Vishal M. Patel
Haochen Wang
Xun Huang
Ting-Chun Wang
Xuan Li
Yogesh Balaji
DiffM
73
23
0
08 Jul 2024
Leveraging Transformers for Weakly Supervised Object Localization in Unconstrained Videos
Shakeeb Murtaza
M. Pedersoli
Aydin Sarraf
Eric Granger
WSOL
112
0
0
08 Jul 2024
KidSat: satellite imagery to map childhood poverty dataset and benchmark
Makkunda Sharma
Fan Yang
Duy-Nhat Vo
Esra Suel
Swapnil Mishra
Samir Bhatt
Oliver Fiala
William Rudgard
Seth Flaxman
116
1
0
08 Jul 2024
WSI-VQA: Interpreting Whole Slide Images by Generative Visual Question Answering
Pingyi Chen
Chenglu Zhu
Sunyi Zheng
Honglin Li
Lin Yang
104
11
0
08 Jul 2024
FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
Jiedong Zhuang
Jiaqi Hu
Lianrui Mu
Rui Hu
Xiaoyu Liang
Jiangnan Ye
Haoji Hu
CLIP
VLM
95
4
0
08 Jul 2024
Training-free CryoET Tomogram Segmentation
Yizhou Zhao
Hengwei Bian
Michael Mu
M. R. Uddin
Zhenyang Li
Xiang Li
Tianyang Wang
Min Xu
92
0
0
08 Jul 2024
Self-supervised Learning via Cluster Distance Prediction for Operating Room Context Awareness
Idris Hamoud
Alexandros Karargyris
Aidean Sharghi
Omid Mohareri
N. Padoy
SSL
83
1
0
07 Jul 2024
FM-OSD: Foundation Model-Enabled One-Shot Detection of Anatomical Landmarks
Juzheng Miao
Cheng Chen
Keli Zhang
Jie Chuai
Quanzheng Li
Pheng-Ann Heng
73
2
0
07 Jul 2024
Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model
Danni Yang
Ruohan Dong
Jiayi Ji
Yiwei Ma
Haowei Wang
Xiaoshuai Sun
Rongrong Ji
85
3
0
07 Jul 2024
An Improved Method for Personalizing Diffusion Models
Yan Zeng
Masanori Suganuma
Takayuki Okatani
DiffM
65
1
0
07 Jul 2024
Replication in Visual Diffusion Models: A Survey and Outlook
Wenhao Wang
Yifan Sun
Zongxin Yang
Zhengdong Hu
Zhentao Tan
Yi Yang
184
10
0
07 Jul 2024
LaRa: Efficient Large-Baseline Radiance Fields
Anpei Chen
Haofei Xu
Stefano Esposito
Siyu Tang
Andreas Geiger
AI4CE
109
28
0
05 Jul 2024
RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation
Yuxuan Kuang
Junjie Ye
Haoran Geng
Jiageng Mao
Congyue Deng
Leonidas Guibas
He Wang
Yue Wang
LM&Ro
112
26
0
05 Jul 2024
PartCraft: Crafting Creative Objects by Parts
Kam Woh Ng
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
104
8
0
05 Jul 2024
Understanding the Gains from Repeated Self-Distillation
Divyansh Pareek
Simon S. Du
Sewoong Oh
105
6
0
05 Jul 2024
Feature Attenuation of Defective Representation Can Resolve Incomplete Masking on Anomaly Detection
Yeonghyeon Park
Sungho Kang
Myung Jin Kim
Hyeong Seok Kim
Juneho Yi
AAML
60
0
0
05 Jul 2024
Segment Any 4D Gaussians
Shengxiang Ji
Guanjun Wu
Jiemin Fang
Jiazhong Cen
Taoran Yi
Wenyu Liu
Qi Tian
Xinggang Wang
3DGS
153
7
0
05 Jul 2024
Multi-modal Masked Siamese Network Improves Chest X-Ray Representation Learning
Saeed Shurrab
Alejandro Guerra-Manzanares
Farah E. Shamout
89
1
0
05 Jul 2024
Unsupervised Learning of Category-Level 3D Pose from Object-Centric Videos
Leonhard Sommer
Artur Jesslen
Eddy Ilg
Adam Kortylewski
83
2
0
05 Jul 2024
Attention Normalization Impacts Cardinality Generalization in Slot Attention
Markus Krimmel
Jan Achterhold
Joerg Stueckler
OCL
84
0
0
04 Jul 2024
Learning to Be a Transformer to Pinpoint Anomalies
Alex Costanzino
Pierluigi Zama Ramirez
Giuseppe Lisanti
Luigi Di Stefano
95
0
0
04 Jul 2024
Robust Adaptation of Foundation Models with Black-Box Visual Prompting
Changdae Oh
Gyeongdeok Seo
Geunyoung Jung
Zhi-Qi Cheng
Hosik Choi
Jiyoung Jung
Kyungwoo Song
VLM
125
1
0
04 Jul 2024
How JEPA Avoids Noisy Features: The Implicit Bias of Deep Linear Self Distillation Networks
Etai Littwin
Omid Saremi
Madhu Advani
Vimal Thilak
Preetum Nakkiran
Chen Huang
Joshua Susskind
78
5
0
03 Jul 2024
A Survey on Trustworthiness in Foundation Models for Medical Image Analysis
Congzhen Shi
Ryan Rezai
Jiaxi Yang
Qi Dou
Xiaoxiao Li
MedIm
71
6
0
03 Jul 2024
HoloHisto: End-to-end Gigapixel WSI Segmentation with 4K Resolution Sequential Tokenization
Yucheng Tang
Yufan He
Vishwesh Nath
Pengfeig Guo
Ruining Deng
...
Ziyue Xu
Holger Roth
Daguang Xu
Haichun Yang
Yuankai Huo
62
4
0
03 Jul 2024
DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents
Yilun Xu
Gabriele Corso
Tommi Jaakkola
Arash Vahdat
Karsten Kreis
104
14
0
03 Jul 2024
Learning from Memory: Non-Parametric Memory Augmented Self-Supervised Learning of Visual Features
T. Silva
Hélio Pedrini
Adín Ramírez Rivera
SSL
62
4
0
03 Jul 2024
OpenSlot: Mixed Open-Set Recognition with Object-Centric Learning
Xu Yin
Fei Pan
G. An
Yuchi Huo
Zixuan Xie
Sung-eui Yoon
BDL
VLM
156
1
0
02 Jul 2024
Crossroads of Continents: Automated Artifact Extraction for Cultural Adaptation with Large Multimodal Models
A. Mukherjee
Ziwei Zhu
Antonios Anastasopoulos
90
1
0
02 Jul 2024
Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning
Chengchao Shen
Jianzhong Chen
Jianxin Wang
SSL
104
1
0
02 Jul 2024
Towards the Next Frontier in Speech Representation Learning Using Disentanglement
Varun Krishna
Sriram Ganapathy
SSL
60
1
0
02 Jul 2024
Self-Cooperation Knowledge Distillation for Novel Class Discovery
Yuzheng Wang
Zhaoyu Chen
Dingkang Yang
Yunquan Sun
Lizhe Qi
93
2
0
02 Jul 2024
Label-free Neural Semantic Image Synthesis
Jiayi Wang
Kevin Laube
Yumeng Li
J. H. Metzen
Shin-I Cheng
Julio Borges
Anna Khoreva
DiffM
139
0
0
01 Jul 2024
DRAGON: Drone and Ground Gaussian Splatting for 3D Building Reconstruction
Yujin Ham
Mateusz Michalkiewicz
Guha Balakrishnan
91
3
0
01 Jul 2024
Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation
Zihan Gao
Lingling Li
Licheng Jiao
Fang Liu
Xu Liu
Wenping Ma
Yuwei Guo
Shuyuan Yang
38
2
0
01 Jul 2024
Cross-Architecture Auxiliary Feature Space Translation for Efficient Few-Shot Personalized Object Detection
F. Barbato
Umberto Michieli
J. Moon
Pietro Zanuttigh
Mete Ozay
103
2
0
01 Jul 2024
Evaluation of Text-to-Video Generation Models: A Dynamics Perspective
Mingxiang Liao
Hannan Lu
Xinyu Zhang
Fang Wan
Tianyu Wang
Yuzhong Zhao
W. Zuo
Qixiang Ye
Jingdong Wang
VGen
EGVM
128
25
0
01 Jul 2024
EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting
Chenxin Li
Brandon Yushan Feng
Yifan Liu
Hengyu Liu
Cheng Wang
Weihao Yu
Yixuan Yuan
3DGS
72
13
0
01 Jul 2024
Diffusion Models and Representation Learning: A Survey
Michael Fuest
Pingchuan Ma
Ming Gui
Johannes S. Fischer
Vincent Tao Hu
Bjorn Ommer
DiffM
104
24
0
30 Jun 2024
Learning Granularity-Aware Affordances from Human-Object Interaction for Tool-Based Functional Grasping in Dexterous Robotics
Fan Yang
Wenrui Chen
Kailun Yang
Haoran Lin
DongSheng Luo
Conghui Tang
Zhiyong Li
Yaonan Wang
130
7
0
30 Jun 2024
Learning Unsupervised Gaze Representation via Eye Mask Driven Information Bottleneck
Yangzhou Jiang
Yinxin Lin
Yaoming Wang
Teng Li
Bilian Ke
Bingbing Ni
CVBM
85
1
0
29 Jun 2024
Previous
1
2
3
...
26
27
28
...
82
83
84
Next