Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14294
Cited By
v1
v2 (latest)
Emerging Properties in Self-Supervised Vision Transformers
29 April 2021
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Emerging Properties in Self-Supervised Vision Transformers"
50 / 4,176 papers shown
Title
Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Anindya Mondal
Sauradip Nag
J. Prada
Xiatian Zhu
Anjan Dutta
67
11
0
20 Jul 2023
SLPD: Slide-level Prototypical Distillation for WSIs
Zhimiao Yu
Tiancheng Lin
Yi Xu
80
7
0
20 Jul 2023
Learning Discriminative Visual-Text Representation for Polyp Re-Identification
Suncheng Xiang
Can Liu
Sijia Du
Xiaobo Li
64
1
0
20 Jul 2023
A Holistic Assessment of the Reliability of Machine Learning Systems
Anthony Corso
David Karamadian
Romeo Valentin
Mary Cooper
Mykel J. Kochenderfer
77
7
0
20 Jul 2023
Identifying Interpretable Subspaces in Image Representations
Neha Kalibhat
S. Bhardwaj
Bayan Bruss
Hamed Firooz
Maziar Sanjabi
Soheil Feizi
FAtt
106
28
0
20 Jul 2023
Towards A Unified Agent with Foundation Models
Norman Di Palo
Arunkumar Byravan
Leonard Hasenclever
Markus Wulfmeier
N. Heess
Martin Riedmiller
LM&Ro
LLMAG
OffRL
83
60
0
18 Jul 2023
Automating Wood Species Detection and Classification in Microscopic Images of Fibrous Materials with Deep Learning
Lars Nieradzik
Jördis Sieburg-Rockel
Stephanie Helmling
J. Keuper
Thomas Weibel
Andrea Olbrich
Henrike Stephani
71
6
0
18 Jul 2023
Grounded Object Centric Learning
Avinash Kori
Francesco Locatello
Fabio De Sousa Ribeiro
Francesca Toni
Ben Glocker
OCL
86
12
0
18 Jul 2023
MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments
Spyros Gidaris
Andrei Bursuc
Oriane Siméoni
Antonín Vobecký
N. Komodakis
Matthieu Cord
Patrick Pérez
SSL
ViT
63
3
0
18 Jul 2023
FlexiAST: Flexibility is What AST Needs
Jiu Feng
Mehmet Hamza Erol
Joon Son Chung
Arda Senocak
57
3
0
18 Jul 2023
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
Chaoyang Zhu
Long Chen
ObjD
VLM
148
40
0
18 Jul 2023
Class-relation Knowledge Distillation for Novel Class Discovery
Peiyan Gu
Chuyu Zhang
Rui Xu
Xuming He
80
17
0
18 Jul 2023
Mining of Single-Class by Active Learning for Semantic Segmentation
Hugues Lambert
E. Slade
CLL
VLM
60
0
0
18 Jul 2023
R-Cut: Enhancing Explainability in Vision Transformers with Relationship Weighted Out and Cut
Yingjie Niu
Ming Ding
Maoning Ge
Robin Karlsson
Yuxiao Zhang
K. Takeda
ViT
50
3
0
18 Jul 2023
Systematic comparison of semi-supervised and self-supervised learning for medical image classification
Zhe Huang
Ruijie Jiang
Shuchin Aeron
M. C. Hughes
SSL
OOD
101
7
0
18 Jul 2023
Diffusion Models Beat GANs on Image Classification
Soumik Mukhopadhyay
M. Gwilliam
Vatsal Agarwal
Namitha Padmanabhan
A. Swaminathan
Srinidhi Hegde
Dinesh Manocha
Abhinav Shrivastava
DiffM
163
48
1
17 Jul 2023
Learning to Count without Annotations
Lukas Knobel
Tengda Han
Yuki M. Asano
SSL
88
2
0
17 Jul 2023
Does Visual Pretraining Help End-to-End Reasoning?
Chen Sun
Calvin Luo
Xingyi Zhou
Anurag Arnab
Cordelia Schmid
OCL
LRM
ViT
78
3
0
17 Jul 2023
BUS:Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization
Chaoya Jiang
Haiyang Xu
Wei Ye
Qinghao Ye
Chenliang Li
Mingshi Yan
Bin Bi
Shikun Zhang
Fei Huang
Songfang Huang
VLM
66
9
0
17 Jul 2023
An Empirical Study of Pre-trained Model Selection for Out-of-Distribution Generalization and Calibration
Hiroki Naganuma
Ryuichiro Hataya
Kotaro Yoshida
Ioannis Mitliagkas
OODD
175
3
0
17 Jul 2023
Image Captions are Natural Prompts for Text-to-Image Models
Shiye Lei
Hao Chen
Senyang Zhang
Bo Zhao
Dacheng Tao
VLM
115
23
0
17 Jul 2023
Multi-Object Discovery by Low-Dimensional Object Motion
Sadra Safadoust
Fatma Guney
OCL
106
10
0
16 Jul 2023
Multiscale Memory Comparator Transformer for Few-Shot Video Segmentation
Mennatullah Siam
R. Karim
Henghui Zhao
Richard P. Wildes
VOS
68
2
0
15 Jul 2023
DreamTeacher: Pretraining Image Backbones with Deep Generative Models
Daiqing Li
Huan Ling
Amlan Kar
David Acuna
Seung Wook Kim
Karsten Kreis
Antonio Torralba
Sanja Fidler
VLM
DiffM
82
29
0
14 Jul 2023
Dual-Query Multiple Instance Learning for Dynamic Meta-Embedding based Tumor Classification
Simon Holdenried-Krafft
Peter Somers
Ivonne A. Montes-Majarro
Diana Silimon
Cristina Tarín
F. Fend
Hendrik P. A. Lensch
MedIm
104
3
0
14 Jul 2023
The Whole Pathological Slide Classification via Weakly Supervised Learning
Qiehe Sun
Jiawen Li
Jin Xu
Junru Cheng
Tian Guan
Yonghong He
61
0
0
12 Jul 2023
Self-Supervised Learning with Lie Symmetries for Partial Differential Equations
Grégoire Mialon
Q. Garrido
Hannah Lawrence
Danyal Rehman
Yann LeCun
B. Kiani
SSL
109
26
0
11 Jul 2023
Self-supervised adversarial masking for 3D point cloud representation learning
Michal Szachniewicz
Wojciech Kozlowski
Michal Stypulkowski
Maciej Ziȩba
3DPC
51
2
0
11 Jul 2023
OpenAL: An Efficient Deep Active Learning Framework for Open-Set Pathology Image Classification
Linhao Qu
Yingfan Ma
Zhiwei Yang
Manning Wang
Zhijian Song
VLM
LM&MA
83
9
0
11 Jul 2023
CREPE: Learnable Prompting With CLIP Improves Visual Relationship Prediction
Rakshith Subramanyam
T. S. Jayram
Rushil Anirudh
Jayaraman J. Thiagarajan
VLM
68
3
0
10 Jul 2023
Semantic-SAM: Segment and Recognize Anything at Any Granularity
Feng Li
Hao Zhang
Pei Sun
Xueyan Zou
Siyi Liu
Jianwei Yang
Chun-yue Li
Lei Zhang
Jianfeng Gao
VLM
112
178
0
10 Jul 2023
Distill-SODA: Distilling Self-Supervised Vision Transformer for Source-Free Open-Set Domain Adaptation in Computational Pathology
Guillaume Vray
Devavrat Tomar
Jean-Philippe Thiran
Behzad Bozorgtabar
MedIm
74
1
0
10 Jul 2023
FODVid: Flow-guided Object Discovery in Videos
Silky Singh
Shripad Deshmukh
Mausoom Sarkar
R. Jain
Mayur Hemani
Balaji Krishnamurthy
VOS
52
2
0
10 Jul 2023
Novel Categories Discovery Via Constraints on Empirical Prediction Statistics
Zahid Hasan
A. Faridee
Masud Ahmed
S. Purushotham
H. Kwon
Hyungtae Lee
Nirmalya Roy
75
0
0
07 Jul 2023
SpawnNet: Learning Generalizable Visuomotor Skills from Pre-trained Networks
Xingyu Lin
John So
Sashwat Mahalingam
Fangchen Liu
Pieter Abbeel
SSL
92
26
0
07 Jul 2023
Distilling Self-Supervised Vision Transformers for Weakly-Supervised Few-Shot Classification & Segmentation
Dahyun Kang
Piotr Koniusz
Minsu Cho
Naila Murray
VLM
ViT
85
26
0
07 Jul 2023
Weakly-supervised Contrastive Learning for Unsupervised Object Discovery
Yun-Qiu Lv
Jing Zhang
Nick Barnes
Yuchao Dai
87
11
0
07 Jul 2023
VideoGLUE: Video General Understanding Evaluation of Foundation Models
Liangzhe Yuan
N. B. Gundavarapu
Long Zhao
Hao Zhou
Huayu Chen
...
Florian Schroff
Hartwig Adam
Ming-Hsuan Yang
Ting Liu
Boqing Gong
ELM
85
10
0
06 Jul 2023
AxonCallosumEM Dataset: Axon Semantic Segmentation of Whole Corpus Callosum cross section from EM Images
Ao Cheng
Guoqiang Zhao
Lirong Wang
Ruobing Zhang
54
3
0
05 Jul 2023
Rethinking Multiple Instance Learning for Whole Slide Image Classification: A Good Instance Classifier is All You Need
Linhao Qu
Yingfan Ma
Xiao-Zhuo Luo
Manning Wang
Zhijian Song
VLM
109
25
0
05 Jul 2023
In-Domain Self-Supervised Learning Improves Remote Sensing Image Scene Classification
I. Dimitrovski
Ivan Kitanovski
Nikola Simidjievski
D. Kocev
SSL
59
4
0
04 Jul 2023
Segment Anything Meets Point Tracking
Frano Rajič
Lei Ke
Yu-Wing Tai
Chi-Keung Tang
Martin Danelljan
Feng Yu
VLM
VOS
111
86
0
03 Jul 2023
Stitched ViTs are Flexible Vision Backbones
Zizheng Pan
Jing Liu
Haoyu He
Jianfei Cai
Bohan Zhuang
53
3
0
30 Jun 2023
Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors
Guocheng Qian
Jinjie Mai
Abdullah Hamdi
Jian Ren
Aliaksandr Siarohin
...
Hsin-Ying Lee
Ivan Skorokhodov
Peter Wonka
Sergey Tulyakov
Guohao Li
DiffM
175
366
0
30 Jun 2023
An Efficient General-Purpose Modular Vision Model via Multi-Task Heterogeneous Training
Z. Chen
Mingyu Ding
Songlin Yang
Wei Zhan
Masayoshi Tomizuka
Erik Learned-Miller
Chuang Gan
MoE
67
8
0
29 Jun 2023
Learning Nuclei Representations with Masked Image Modelling
P. Wójcik
Hussein Naji
A. Simon
Reinhard Büttner
Katarzyna Bozek
MedIm
15
1
0
29 Jun 2023
MIS-FM: 3D Medical Image Segmentation using Foundation Models Pretrained on a Large-Scale Unannotated Dataset
Guotai Wang
Jianghao Wu
Xiangde Luo
Xinglong Liu
Kang Li
Shaoting Zhang
77
28
0
29 Jun 2023
Improving Online Continual Learning Performance and Stability with Temporal Ensembles
Albin Soutif--Cormerais
Antonio Carta
Joost van de Weijer
CLL
95
12
0
29 Jun 2023
Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train
Zhao Wang
Chang Liu
Shaoting Zhang
Qi Dou
MedIm
128
68
0
29 Jun 2023
CLANet: A Comprehensive Framework for Cross-Batch Cell Line Identification Using Brightfield Images
Lei Tong
A. Corrigan
Navin Rathna Kumar
Kerry Hallbrook
Jonathan Orme
Yinhai Wang
Huiyu Zhou
34
0
0
28 Jun 2023
Previous
1
2
3
...
54
55
56
...
82
83
84
Next