Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14294
Cited By
Emerging Properties in Self-Supervised Vision Transformers
29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Emerging Properties in Self-Supervised Vision Transformers"
50 / 1,415 papers shown
Title
TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at Scale
Ziyun Zeng
Yixiao Ge
Zhan Tong
Xihui Liu
Shutao Xia
Ying Shan
24
9
0
23 May 2023
Federated Generalized Category Discovery
Nan Pu
Zhun Zhong
Xinyuan Ji
N. Sebe
FedML
30
13
0
23 May 2023
Revisiting pre-trained remote sensing model benchmarks: resizing and normalization matters
Isaac Corley
Caleb Robinson
Rahul Dodhia
J. L. Ferres
Peyman Najafirad
53
14
0
22 May 2023
HGFormer: Hierarchical Grouping Transformer for Domain Generalized Semantic Segmentation
Jian Ding
Nan Xue
Guisong Xia
Bernt Schiele
Dengxin Dai
ViT
24
30
0
22 May 2023
Unsupervised Multi-view Pedestrian Detection
Mengyin Liu
Chao Zhu
Shiqi Ren
Xu-Cheng Yin
37
6
0
21 May 2023
Multimodal Web Navigation with Instruction-Finetuned Foundation Models
Hiroki Furuta
Kuang-Huei Lee
Ofir Nachum
Yutaka Matsuo
Aleksandra Faust
S. Gu
Izzeddin Gur
LM&Ro
38
93
0
19 May 2023
Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes
Aran Nayebi
R. Rajalingham
M. Jazayeri
G. R. Yang
36
19
0
19 May 2023
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models
Ziyi Wu
Jingyu Hu
Wuyue Lu
Igor Gilitschenski
Animesh Garg
DiffM
OCL
41
45
0
18 May 2023
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Peng Wang
Shijie Wang
Junyang Lin
Shuai Bai
Xiaohuan Zhou
Jingren Zhou
Xinggang Wang
Chang Zhou
VLM
MLLM
ObjD
53
116
0
18 May 2023
CLIP-GCD: Simple Language Guided Generalized Category Discovery
Rabah Ouldnoughi
Chia-Wen Kuo
Z. Kira
VLM
37
14
0
17 May 2023
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
Alexander H. Liu
Heng-Jui Chang
Michael Auli
Wei-Ning Hsu
James R. Glass
29
25
0
17 May 2023
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Yuyang Zhao
Enze Xie
Lanqing Hong
Zhenguo Li
G. Lee
DiffM
VGen
38
33
0
15 May 2023
Fast Traversability Estimation for Wild Visual Navigation
Jonas Frey
Matías Mattamala
Nived Chebrolu
Cesar Cadena
Maurice F. Fallon
Marco Hutter
56
63
0
15 May 2023
Component-aware anomaly detection framework for adjustable and logical industrial visual inspection
Tongkun Liu
Bing Li
Xiao Du
Bingke Jiang
Xiao Jin
Liuyi Jin
Zhu Zhao
32
27
0
15 May 2023
A Comprehensive Survey on Segment Anything Model for Vision and Beyond
Chunhui Zhang
Li Liu
Yawen Cui
Guanjie Huang
Weilin Lin
Yiqian Yang
Yuehong Hu
VLM
48
90
0
14 May 2023
Consistency Regularization for Domain Generalization with Logit Attribution Matching
Han Gao
Kaican Li
Weiyan Xie
Zhi Lin
Yongxiang Huang
Luning Wang
Caleb Chen Cao
N. Zhang
13
2
0
13 May 2023
Learning Semi-supervised Gaussian Mixture Models for Generalized Category Discovery
Bingchen Zhao
Xin Wen
Kai Han
38
47
0
10 May 2023
Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval
Shiyin Dong
Mingrui Zhu
N. Wang
Xinbo Gao
VLM
31
3
0
09 May 2023
What Do Self-Supervised Vision Transformers Learn?
Namuk Park
Wonjae Kim
Byeongho Heo
Taekyung Kim
Sangdoo Yun
SSL
88
76
1
01 May 2023
IMP: Iterative Matching and Pose Estimation with Adaptive Pooling
Fei Xue
Ignas Budvytis
R. Cipolla
41
13
0
28 Apr 2023
Controllable Image Generation via Collage Representations
Arantxa Casanova
Marlene Careil
Adriana Romero Soriano
Christopher Pal
Jakob Verbeek
M. Drozdzal
DiffM
39
7
0
26 Apr 2023
VGOS: Voxel Grid Optimization for View Synthesis from Sparse Inputs
Jiakai Sun
Zhanjie Zhang
Jiafu Chen
Guangyuan Li
Boyan Ji
Lei Zhao
Wei Xing
Huaizhong Lin
31
19
0
26 Apr 2023
Segment Anything in 3D with Radiance Fields
Jiazhong Cen
Jiemin Fang
Zanwei Zhou
Chen Yang
Lingxi Xie
Xiaopeng Zhang
Wei-Ming Shen
Qi Tian
46
43
0
24 Apr 2023
Self-supervised Learning by View Synthesis
Shaoteng Liu
Xiangyu Zhang
T. Hu
Jiaya Jia
3DV
ViT
40
1
0
22 Apr 2023
Point-supervised Single-cell Segmentation via Collaborative Knowledge Sharing
Ji Yu
27
5
0
20 Apr 2023
Contrastive Tuning: A Little Help to Make Masked Autoencoders Forget
Johannes Lehner
Benedikt Alkin
Andreas Fürst
Elisabeth Rumetshofer
Lukas Miklautz
Sepp Hochreiter
36
18
0
20 Apr 2023
Visual DNA: Representing and Comparing Images using Distributions of Neuron Activations
Benjamin Ramtoula
Matthew Gadd
Paul Newman
D. Martini
31
10
0
20 Apr 2023
Masked Pre-Training of Transformers for Histology Image Analysis
Shuai Jiang
Liesbeth Hondelink
A. Suriawinata
Saeed Hassanpour
MedIm
31
15
0
14 Apr 2023
Uncovering the Inner Workings of STEGO for Safe Unsupervised Semantic Segmentation
Alexander Koenig
Maximilian Schambach
Johannes Otterbach
27
6
0
14 Apr 2023
DINOv2: Learning Robust Visual Features without Supervision
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
...
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
VLM
CLIP
SSL
160
3,070
0
14 Apr 2023
CAMM: Building Category-Agnostic and Animatable 3D Models from Monocular Videos
Tianshu Kuai
Akash Karthikeyan
Yash Kant
Ashkan Mirzaei
Igor Gilitschenski
34
9
0
14 Apr 2023
CiPR: An Efficient Framework with Cross-instance Positive Relations for Generalized Category Discovery
Shaozhe Hao
Kai Han
Kwan-Yee K. Wong
53
16
0
14 Apr 2023
Wild Face Anti-Spoofing Challenge 2023: Benchmark and Results
Dong Wang
Jiaxin Guo
Qiqi Shao
Haochi He
Zhian Chen
...
Sergio Escalera
Hugo Jair Escalante
Lei Zhen
Jun Wan
Jiankang Deng
CVBM
AAML
17
11
0
12 Apr 2023
Learning Transferable Pedestrian Representation from Multimodal Information Supervision
Li-Na Bao
Longhui Wei
Xiaoyu Qiu
Wen-gang Zhou
Houqiang Li
Qi Tian
SSL
42
5
0
12 Apr 2023
Distilling Token-Pruned Pose Transformer for 2D Human Pose Estimation
Feixiang Ren
ViT
24
2
0
12 Apr 2023
VidStyleODE: Disentangled Video Editing via StyleGAN and NeuralODEs
Moayed Haji-Ali
Andrew Bond
Tolga Birdal
Duygu Ceylan
Levent Karacan
Erkut Erdem
Aykut Erdem
VGen
DiffM
136
2
0
12 Apr 2023
Token Boosting for Robust Self-Supervised Visual Transformer Pre-training
Tianjiao Li
Lin Geng Foo
Ping Hu
Xindi Shang
Hossein Rahmani
Zehuan Yuan
Jing Liu
51
7
0
09 Apr 2023
Self-Supervised Video Similarity Learning
Giorgos Kordopatis-Zilos
Giorgos Tolias
Christos Tzelepis
I. Kompatsiaris
Ioannis Patras
Symeon Papadopoulos
SSL
37
8
0
06 Apr 2023
Diffusion Models as Masked Autoencoders
Chen Wei
K. Mangalam
Po-Yao (Bernie) Huang
Yanghao Li
Haoqi Fan
Hu Xu
Huiyu Wang
Cihang Xie
Alan Yuille
Christoph Feichtenhofer
DiffM
SyDa
36
48
0
06 Apr 2023
From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection
Changsheng Lu
Hao Zhu
Piotr Koniusz
48
11
0
06 Apr 2023
Inductive biases in deep learning models for weather prediction
Jannik Thümmel
Matthias Karlbauer
S. Otte
C. Zarfl
Georg Martius
...
Thomas Scholten
Ulrich Friedrich
V. Wulfmeyer
B. Goswami
Martin Volker Butz
AI4CE
48
6
0
06 Apr 2023
DiGA: Distil to Generalize and then Adapt for Domain Adaptive Semantic Segmentation
Fengyi Shen
A. Gurram
Ziyuan Liu
He Wang
Alois Knoll
24
26
0
05 Apr 2023
Strong Baselines for Parameter Efficient Few-Shot Fine-tuning
S. Basu
Daniela Massiceti
S. Hu
S. Feizi
VLM
37
29
0
04 Apr 2023
Divided Attention: Unsupervised Multi-Object Discovery with Contextually Separated Slots
Dong Lao
Zhengyang Hu
Francesco Locatello
Yanchao Yang
Stefano Soatto
OCL
33
5
0
04 Apr 2023
Video Instance Segmentation in an Open-World
Omkar Thawakar
Sanath Narayan
Hisham Cholakkal
Rao Muhammad Anwer
Salman Khan
Jorma T. Laaksonen
M. Shah
Fahad Shahbaz Khan
VLM
22
2
0
03 Apr 2023
Constructive Assimilation: Boosting Contrastive Learning Performance through View Generation Strategies
Ligong Han
Seung-Jun Han
Shivchander Sudalairaj
Charlotte Loh
Rumen Dangovski
...
Pulkit Agrawal
Dimitris N. Metaxas
Leonid Karlinsky
Tsui-Wei Weng
Akash Srivastava
47
1
0
02 Apr 2023
INoD: Injected Noise Discriminator for Self-Supervised Representation Learning in Agricultural Fields
Julia Hindel
Nikhil Gosala
Kevin Bregler
Abhinav Valada
34
6
0
31 Mar 2023
Removing supervision in semantic segmentation with local-global matching and area balancing
Simone Rossetti
Nico Sama
F. Pirri
VLM
43
0
0
30 Mar 2023
PMatch: Paired Masked Image Modeling for Dense Geometric Matching
Shengjie Zhu
Xiaoming Liu
40
24
0
30 Mar 2023
Masked Autoencoders as Image Processors
Huiyu Duan
Wei Shen
Xiongkuo Min
Danyang Tu
Long Teng
Jia Wang
Guangtao Zhai
ViT
38
11
0
30 Mar 2023
Previous
1
2
3
...
15
16
17
...
27
28
29
Next