Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14294
Cited By
v1
v2 (latest)
Emerging Properties in Self-Supervised Vision Transformers
29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Emerging Properties in Self-Supervised Vision Transformers"
50 / 4,176 papers shown
Title
ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers under Domain Shifts
Samar Khanna
Medhanie Irgau
David B. Lobell
Stefano Ermon
VLM
165
6
0
16 Jun 2024
Occam's Razor for Self Supervised Learning: What is Sufficient to Learn Good Representations?
Mark Ibrahim
David Klindt
Randall Balestriero
SSL
132
5
1
15 Jun 2024
SemanticMIM: Marring Masked Image Modeling with Semantics Compression for General Visual Representation
Yike Yuan
Huanzhang Dou
Fengjun Guo
Xi Li
109
2
0
15 Jun 2024
Self-Supervised Vision Transformer for Enhanced Virtual Clothes Try-On
Lingxiao Lu
Shengyi Wu
Haoxuan Sun
Junhong Gou
Jianlou Si
Chen Qian
Jianfu Zhang
Liqing Zhang
ViT
DiffM
75
0
0
15 Jun 2024
The BabyView dataset: High-resolution egocentric videos of infants' and young children's everyday experiences
Bria Long
Violet Xiang
Stefan Stojanov
Robert Z. Sparks
Zi Yin
...
Steven Y. Feng
Chengxu Zhuang
V. Marchman
Daniel L. K. Yamins
Michael C. Frank
VGen
EgoV
116
3
0
14 Jun 2024
Consistency-diversity-realism Pareto fronts of conditional image generative models
Pietro Astolfi
Marlene Careil
Melissa Hall
Oscar Manas
Matthew Muckley
Jakob Verbeek
Adriana Romero Soriano
M. Drozdzal
104
13
0
14 Jun 2024
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs
Rui Yang
Ruomeng Ding
Yong Lin
Huan Zhang
Tong Zhang
124
62
0
14 Jun 2024
Exploring the Benefits of Vision Foundation Models for Unsupervised Domain Adaptation
B. B. Englert
Fabrizio J. Piva
Tommie Kerssies
Daan de Geus
Gijs Dubbelman
93
11
0
14 Jun 2024
AnimalFormer: Multimodal Vision Framework for Behavior-based Precision Livestock Farming
Ahmed Qazi
Taha Razzaq
Asim Iqbal
68
2
0
14 Jun 2024
ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
Wufei Ma
Guanning Zeng
Guofeng Zhang
Qihao Liu
Letian Zhang
Adam Kortylewski
Yaoyao Liu
Alan Yuille
VLM
3DV
94
10
0
13 Jun 2024
Towards an Improved Understanding and Utilization of Maximum Manifold Capacity Representations
Rylan Schaeffer
Victor Lecomte
Dhruv Pai
Andres Carranza
Berivan Isik
...
Yann LeCun
SueYeon Chung
Andrey Gromov
Ravid Shwartz-Ziv
Sanmi Koyejo
103
8
0
13 Jun 2024
Toffee: Efficient Million-Scale Dataset Construction for Subject-Driven Text-to-Image Generation
Yufan Zhou
Ruiyi Zhang
Kaizhi Zheng
Nanxuan Zhao
Jiuxiang Gu
Zichao Wang
Xin Eric Wang
Tong Sun
DiffM
63
2
0
13 Jun 2024
Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models
Ziyi Wu
Yulia Rubanova
Rishabh Kabra
Drew A. Hudson
Igor Gilitschenski
Yusuf Aytar
Sjoerd van Steenkiste
Kelsey R. Allen
Thomas Kipf
VGen
DiffM
117
9
0
13 Jun 2024
Adaptive Slot Attention: Object Discovery with Dynamic Slot Number
Ke Fan
Zechen Bai
Tianjun Xiao
Tong He
Max Horn
Yanwei Fu
Francesco Locatello
Zheng Zhang
OCL
102
10
0
13 Jun 2024
PC-LoRA: Low-Rank Adaptation for Progressive Model Compression with Knowledge Distillation
Injoon Hwang
Haewon Park
Youngwan Lee
Jooyoung Yang
SunJae Maeng
AI4CE
63
3
0
13 Jun 2024
T-JEPA: A Joint-Embedding Predictive Architecture for Trajectory Similarity Computation
Lihuan Li
Hao Xue
Yang Song
Flora Salim
131
1
0
13 Jun 2024
EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding
Yuan-Ming Li
Wei-Jin Huang
An-Lan Wang
Ling-an Zeng
Jing-Ke Meng
Wei-Shi Zheng
93
18
0
13 Jun 2024
Cognitively Inspired Energy-Based World Models
Alexi Gladstone
Ganesh Nanduru
Md. Mofijul Islam
Aman Chadha
Jundong Li
Tariq Iqbal
76
0
0
13 Jun 2024
COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing
Jiangshan Wang
Yue Ma
Jiayi Guo
Yicheng Xiao
Gao Huang
Xiu Li
DiffM
118
24
0
13 Jun 2024
ICE-G: Image Conditional Editing of 3D Gaussian Splats
Vishnu Jaganathan
Hannah Hanyun Huang
Muhammad Zubair Irshad
Varun Jampani
Amit Raj
Z. Kira
3DGS
100
8
0
12 Jun 2024
Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement
Maxime Pietrantoni
G. Csurka
Martin Humenberger
Torsten Sattler
SSL
81
1
0
12 Jun 2024
Vessel Re-identification and Activity Detection in Thermal Domain for Maritime Surveillance
Yasod Ginige
Ransika Gunasekara
Darsha Hewavitharana
Manjula Ariyarathne
Ranga Rodrigo
P. Jayasekara
100
0
0
12 Jun 2024
A
2
^{2}
2
-MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder
Lixian Zhang
Yi Zhao
Runmin Dong
Jinxiao Zhang
Shuai Yuan
...
Weijia Li
Wei Liu
Wayne Zhang
Xue Jiang
Haohuan Fu
127
4
0
12 Jun 2024
SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation
Chanda Grover Kamra
Indra Deep Mastan
Nitin Kumar
Debayan Gupta
85
2
0
12 Jun 2024
A deep cut into Split Federated Self-supervised Learning
Marcin Przewiȩźlikowski
Marcin Osial
Bartosz Zieliñski
Marek 'Smieja
FedML
96
0
0
12 Jun 2024
Gene-Level Representation Learning via Interventional Style Transfer in Optical Pooled Screening
Mahtab Bigverdi
Burkhard Hockendorf
Heming Yao
Phil Hanslovsky
Romain Lopez
David Richmond
97
0
0
11 Jun 2024
HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness
Zihui Xue
Mi Luo
Changan Chen
Kristen Grauman
DiffM
98
11
0
11 Jun 2024
Watching Swarm Dynamics from Above: A Framework for Advanced Object Tracking in Drone Videos
Duc Pham
Matthew Hansen
Félicie Dhellemmens
Jens Krause
Pia Bideau
82
0
0
11 Jun 2024
Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance
Kuan Heng Lin
Sicheng Mo
Ben Klingher
Fangzhou Mu
Bolei Zhou
DiffM
75
18
0
11 Jun 2024
Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection
Wenxiao Wang
Weiming Zhuang
Lingjuan Lyu
107
0
0
11 Jun 2024
GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection
Hang Yao
Ming-Yu Liu
Haolin Wang
Zhicun Yin
Zifei Yan
Xiaopeng Hong
W. Zuo
120
20
0
11 Jun 2024
Visual Representation Learning with Stochastic Frame Prediction
Huiwon Jang
Dongyoung Kim
Junsu Kim
Jinwoo Shin
Pieter Abbeel
Younggyo Seo
109
3
0
11 Jun 2024
UVIS: Unsupervised Video Instance Segmentation
Shuaiyi Huang
Saksham Suri
Kamal Gupta
Sai Saketh Rambhatla
Ser-Nam Lim
Abhinav Shrivastava
VLM
83
3
0
11 Jun 2024
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
Yuanhao Zhai
Kevin Lin
Zhengyuan Yang
Linjie Li
Jianfeng Wang
Chung-Ching Lin
David Doermann
Junsong Yuan
Lijuan Wang
VGen
DiffM
98
13
0
11 Jun 2024
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
X. Wang
Siming Fu
Qihan Huang
Wanggui He
Hao Jiang
DiffM
158
53
0
11 Jun 2024
RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent
Wenjia Xu
Zijian Yu
Yixu Wang
Jiuniu Wang
Yuanben Zhang
Guangzuo Li
Mugen Peng
LLMAG
151
0
0
11 Jun 2024
Beyond Bare Queries: Open-Vocabulary Object Grounding with 3D Scene Graph
S. Linok
T. Zemskova
Svetlana Ladanova
Roman Titkov
Dmitry A. Yudin
Maxim Monastyrny
Aleksei Valenkov
LM&Ro
132
5
0
11 Jun 2024
Adapters Strike Back
Jan-Martin O. Steitz
Stefan Roth
82
7
0
10 Jun 2024
NeuroMoCo: A Neuromorphic Momentum Contrast Learning Method for Spiking Neural Networks
Yuqi Ma
Huamin Wang
Hangchi Shen
Xuemei Chen
Shukai Duan
Shiping Wen
132
0
0
10 Jun 2024
UnSupDLA: Towards Unsupervised Document Layout Analysis
Talha Uddin Sheikh
Tahira Shehzadi
K. Hashmi
Didier Stricker
Muhammad Zeshan Afzal
81
2
0
10 Jun 2024
A Comparative Survey of Vision Transformers for Feature Extraction in Texture Analysis
Leonardo F. S. Scabini
Andre Sacilotti
Kallil M. C. Zielinski
L. C. Ribas
B. De Baets
Odemir M. Bruno
ViT
80
3
0
10 Jun 2024
GAIA: Rethinking Action Quality Assessment for AI-Generated Videos
Zijian Chen
Wei Sun
Yuan Tian
Jun Jia
Zicheng Zhang
Jiarui Wang
Ru Huang
Xiongkuo Min
Guangtao Zhai
Wenjun Zhang
EGVM
124
15
0
10 Jun 2024
OD-DETR: Online Distillation for Stabilizing Training of Detection Transformer
Shengjian Wu
Li Sun
Qingli Li
124
0
0
09 Jun 2024
Visual Prompt Tuning in Null Space for Continual Learning
Yue Lu
Shizhou Zhang
De Cheng
Yinghui Xing
N. Wang
Peng Wang
Yanning Zhang
VLM
VPVLM
CLL
98
15
0
09 Jun 2024
GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement
Peiye Zhuang
Songfang Han
Chaoyang Wang
Aliaksandr Siarohin
Jiaxu Zou
Michael Vasilkovsky
V. Shakhrai
Sergey Korolev
Sergey Tulyakov
Hsin-Ying Lee
3DV
122
7
0
09 Jun 2024
Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language
Mark Hamilton
Andrew Zisserman
John R. Hershey
William T. Freeman
VLM
131
8
0
09 Jun 2024
Regularized Training with Generated Datasets for Name-Only Transfer of Vision-Language Models
Minho Park
S. Park
Jooyeol Yun
Jaegul Choo
VLM
85
0
0
08 Jun 2024
Weakly Supervised Set-Consistency Learning Improves Morphological Profiling of Single-Cell Images
Heming Yao
Phil Hanslovsky
Jan-Christian Huetter
Burkhard Hoeckendorf
David Richmond
81
5
0
08 Jun 2024
A model of early word acquisition based on realistic-scale audiovisual naming events
Khazar Khorrami
Okko Räsänen
NAI
78
0
0
07 Jun 2024
Leveraging Activations for Superpixel Explanations
Ahcène Boubekki
Samuel G. Fadel
Sebastian Mair
AAML
FAtt
XAI
66
0
0
07 Jun 2024
Previous
1
2
3
...
28
29
30
...
82
83
84
Next