Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14294
Cited By
v1
v2 (latest)
Emerging Properties in Self-Supervised Vision Transformers
29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Emerging Properties in Self-Supervised Vision Transformers"
50 / 4,176 papers shown
Title
Complementary Benefits of Contrastive Learning and Self-Training Under Distribution Shift
Saurabh Garg
Amrith Rajagopal Setlur
Zachary Chase Lipton
Sivaraman Balakrishnan
Virginia Smith
Aditi Raghunathan
SSL
81
9
0
06 Dec 2023
Anomaly Detection for Scalable Task Grouping in Reinforcement Learning-based RAN Optimization
Jimmy Li
Igor Kozlov
Di Wu
Xue Liu
Gregory Dudek
66
0
0
06 Dec 2023
Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields
Shijie Zhou
Haoran Chang
Sicheng Jiang
Zhiwen Fan
Zehao Zhu
Dejia Xu
Pradyumna Chari
Suya You
Zhangyang Wang
A. Kadambi
3DGS
142
183
0
06 Dec 2023
DreamInpainter: Text-Guided Subject-Driven Image Inpainting with Diffusion Models
Shaoan Xie
Yang Zhao
Zhisheng Xiao
Kelvin C. K. Chan
Yandong Li
Yanwu Xu
Kun Zhang
Tingbo Hou
DiffM
101
28
0
05 Dec 2023
SEVA: Leveraging sketches to evaluate alignment between human and machine visual abstraction
Kushin Mukherjee
Holly Huey
Xuanchen Lu
Yael Vinker
Rio Aguina-Kang
Ariel Shamir
Judith E. Fan
159
13
0
05 Dec 2023
Stable Diffusion Exposed: Gender Bias from Prompt to Image
Yankun Wu
Yuta Nakashima
Noa Garcia
97
18
0
05 Dec 2023
Retrieving Conditions from Reference Images for Diffusion Models
Haoran Tang
Xin Zhou
Jieren Deng
Zhihong Pan
Hao Tian
Pratik Chaudhari
77
3
0
05 Dec 2023
Towards General Purpose Vision Foundation Models for Medical Image Analysis: An Experimental Study of DINOv2 on Radiology Benchmarks
Mohammed Baharoon
Waseem Qureshi
J. Ouyang
Yanwu Xu
Abdulrhman Aljouie
Wei Peng
MedIm
AI4CE
97
7
0
04 Dec 2023
Class-Discriminative Attention Maps for Vision Transformers
L. Brocki
Jakub Binda
N. C. Chung
MedIm
120
4
0
04 Dec 2023
InstructBooth: Instruction-following Personalized Text-to-Image Generation
Daewon Chae
Nokyung Park
Jinkyu Kim
Kimin Lee
DiffM
38
11
0
04 Dec 2023
Aligning and Prompting Everything All at Once for Universal Visual Perception
Yunhang Shen
Chaoyou Fu
Peixian Chen
Mengdan Zhang
Ke Li
Xing Sun
Yunsheng Wu
Shaohui Lin
Rongrong Ji
VLM
ObjD
122
39
0
04 Dec 2023
Rejuvenating image-GPT as Strong Visual Representation Learners
Sucheng Ren
Zeyu Wang
Hongru Zhu
Junfei Xiao
Alan Yuille
Cihang Xie
VLM
123
8
0
04 Dec 2023
Style Aligned Image Generation via Shared Attention
Amir Hertz
Andrey Voynov
Shlomi Fruchter
Daniel Cohen-Or
DiffM
70
135
0
04 Dec 2023
Bootstrapping SparseFormers from Vision Foundation Models
Ziteng Gao
Zhan Tong
Kevin Qinghong Lin
Joya Chen
Mike Zheng Shou
57
0
0
04 Dec 2023
Multi-task Image Restoration Guided By Robust DINO Features
Xin Lin
Chao Ren
Kelvin C. K. Chan
Lu Qi
Jinshan Pan
Ming-Hsuan Yang
109
5
0
04 Dec 2023
SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
Feng Wang
Jieru Mei
Alan Yuille
VLM
146
66
0
04 Dec 2023
Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games
Lukas Schäfer
Logan Jones
Anssi Kanervisto
Yuhan Cao
Tabish Rashid
Raluca Georgescu
David Bignell
Siddhartha Sen
Andrea Trevino Gavito
Sam Devlin
174
3
0
04 Dec 2023
SANeRF-HQ: Segment Anything for NeRF in High Quality
Yichen Liu
Benran Hu
Chi-Keung Tang
Yu-Wing Tai
97
13
0
03 Dec 2023
Brain Decodes Deep Nets
Huzheng Yang
James C. Gee
Jianbo Shi
67
8
0
03 Dec 2023
Meta ControlNet: Enhancing Task Adaptation via Meta Learning
Junjie Yang
Jinze Zhao
Peihao Wang
Zhangyang Wang
Yingbin Liang
130
3
0
03 Dec 2023
SASSL: Enhancing Self-Supervised Learning via Neural Style Transfer
Renan A. Rojas-Gomez
Karan Singhal
Ali Etemad
Alex Bijamov
Warren Morningstar
Philip Mansfield
82
1
0
02 Dec 2023
Beyond Accuracy: Statistical Measures and Benchmark for Evaluation of Representation from Self-Supervised Learning
Jiantao Wu
Shentong Mo
Sara Atito
Josef Kittler
Zhenhua Feng
Muhammad Awais
SSL
77
3
0
02 Dec 2023
Improve Supervised Representation Learning with Masked Image Modeling
Kaifeng Chen
Daniel M. Salz
Huiwen Chang
Kihyuk Sohn
Dilip Krishnan
Mojtaba Seyedhosseini
SSL
ViT
67
3
0
01 Dec 2023
Label Delay in Online Continual Learning
Botos Csaba
Wenxuan Zhang
Matthias Muller
Ser-Nam Lim
Mohamed Elhoseiny
Philip Torr
Adel Bibi
CLL
76
1
0
01 Dec 2023
Grounding Everything: Emerging Localization Properties in Vision-Language Transformers
Walid Bousselham
Felix Petersen
Vittorio Ferrari
Hilde Kuehne
ObjD
VLM
123
49
0
01 Dec 2023
VideoBooth: Diffusion-based Video Generation with Image Prompts
Yuming Jiang
Tianxing Wu
Shuai Yang
Chenyang Si
Dahua Lin
Yu Qiao
Chen Change Loy
Ziwei Liu
DiffM
VGen
128
74
0
01 Dec 2023
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Yunyang Xiong
Bala Varadarajan
Lemeng Wu
Xiaoyu Xiang
Fanyi Xiao
...
Dilin Wang
Fei Sun
Forrest N. Iandola
Raghuraman Krishnamoorthi
Vikas Chandra
VLM
107
160
0
01 Dec 2023
Gaussian Grouping: Segment and Edit Anything in 3D Scenes
Mingqiao Ye
Martin Danelljan
Fisher Yu
Lei Ke
3DGS
DiffM
140
188
0
01 Dec 2023
SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers
Ioannis Kakogeorgiou
Spyros Gidaris
Konstantinos Karantzalos
N. Komodakis
ViT
OCL
131
16
0
01 Dec 2023
Refine, Discriminate and Align: Stealing Encoders via Sample-Wise Prototypes and Multi-Relational Extraction
Shuchi Wu
Chuan Ma
Kang Wei
Xiaogang Xu
Ming Ding
Yuwen Qian
Tao Xiang
62
0
0
01 Dec 2023
Learning from One Continuous Video Stream
João Carreira
Michael King
Viorica Patraucean
Dilara Gokay
Catalin Ionescu
...
Joseph Heyward
Carl Doersch
Y. Aytar
Dima Damen
Andrew Zisserman
CLL
91
6
0
01 Dec 2023
A Generalizable Deep Learning System for Cardiac MRI
R. Shad
C. Zakka
Dhamanpreet Kaur
R. Fong
R. Filice
...
Victor Ferrari
Euan A. Ashley
Michael A. Acker
Curt P. Langlotz
W. Hiesinger
MedIm
88
2
0
01 Dec 2023
Learning Anatomically Consistent Embedding for Chest Radiography
Ziyu Zhou
Haozhe Luo
Jiaxuan Pang
Xiaowei Ding
Michael B. Gotway
Jianming Liang
SSL
89
5
0
01 Dec 2023
Segment Any 3D Gaussians
Jiazhong Cen
Jiemin Fang
Chen Yang
Lingxi Xie
Xiaopeng Zhang
Wei Shen
Qi Tian
3DGS
178
76
0
01 Dec 2023
S2ST: Image-to-Image Translation in the Seed Space of Latent Diffusion
V. Kolmogorov
Rustem Takhanov
Dani Lischinski
DiffM
88
3
0
30 Nov 2023
Initializing Models with Larger Ones
Zhiqiu Xu
Yanjie Chen
Kirill Vishniakov
Yida Yin
Zhiqiang Shen
Trevor Darrell
Lingjie Liu
Zhuang Liu
95
21
0
30 Nov 2023
IMMA: Immunizing text-to-image Models against Malicious Adaptation
Yijia Zheng
Raymond A. Yeh
125
9
0
30 Nov 2023
FoundPose: Unseen Object Pose Estimation with Foundation Features
Evin Pınar Örnek
Yann Labbé
Bugra Tekin
Lingni Ma
Cem Keskin
Christian Forster
Tomás Hodan
110
59
0
30 Nov 2023
BioCLIP: A Vision Foundation Model for the Tree of Life
Samuel Stevens
Jiaman Wu
Matthew J Thompson
Elizabeth G Campolongo
Chan Hee Song
...
Wasila M Dahdul
Charles V. Stewart
Tanya Berger-Wolf
Wei-Lun Chao
Yu-Chuan Su
117
78
0
30 Nov 2023
Meta-Prior: Meta learning for Adaptive Inverse Problem Solvers
Matthieu Terris
Thomas Moreau
80
1
0
30 Nov 2023
Stochastic Vision Transformers with Wasserstein Distance-Aware Attention
Franciskus Xaverius Erick
Mina Rezaei
Johanna P. Müller
Bernhard Kainz
51
0
0
30 Nov 2023
A Lightweight Clustering Framework for Unsupervised Semantic Segmentation
Yau Shing Jonathan Cheung
Xi Chen
Lihe Yang
Hengshuang Zhao
80
1
0
30 Nov 2023
Contrastive Denoising Score for Text-guided Latent Diffusion Image Editing
Hyelin Nam
Gihyun Kwon
Geon Yeong Park
Jong Chul Ye
DiffM
92
29
0
30 Nov 2023
Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding
Jin-Chuan Shi
Miao Wang
Hao-Bin Duan
Shao-Hua Guan
3DGS
104
96
0
30 Nov 2023
Perceptual Group Tokenizer: Building Perception with Iterative Grouping
Zhiwei Deng
Ting Chen
Yang Li
ViT
VLM
75
2
0
30 Nov 2023
Knowledge Transfer from Vision Foundation Models for Efficient Training of Small Task-specific Models
Raviteja Vemulapalli
Hadi Pouransari
Fartash Faghri
Sachin Mehta
Mehrdad Farajtabar
Mohammad Rastegari
Oncel Tuzel
145
11
0
30 Nov 2023
HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models
Zhonghao Wang
Wei Wei
Yang Zhao
Zhisheng Xiao
M. Hasegawa-Johnson
Humphrey Shi
Tingbo Hou
DiffM
123
12
0
30 Nov 2023
Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features
Thomas Wimmer
Peter Wonka
M. Ovsjanikov
118
13
0
29 Nov 2023
Do text-free diffusion models learn discriminative visual representations?
Soumik Mukhopadhyay
M. Gwilliam
Yosuke Yamaguchi
Vatsal Agarwal
Namitha Padmanabhan
Archana Swaminathan
Dinesh Manocha
Abhinav Shrivastava
DiffM
136
14
1
29 Nov 2023
SODA: Bottleneck Diffusion Models for Representation Learning
Drew A. Hudson
Daniel Zoran
Mateusz Malinowski
Andrew Kyle Lampinen
Andrew Jaegle
James L. McClelland
Loic Matthey
Felix Hill
Alexander Lerchner
DiffM
108
56
0
29 Nov 2023
Previous
1
2
3
...
44
45
46
...
82
83
84
Next