Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.07193
Cited By
DINOv2: Learning Robust Visual Features without Supervision
14 April 2023
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
Vasil Khalidov
Pierre Fernandez
Daniel Haziza
Francisco Massa
Alaaeldin El-Nouby
Mahmoud Assran
Nicolas Ballas
Wojciech Galuba
Russ Howes
Po-Yao (Bernie) Huang
Shang-Wen Li
Ishan Misra
Michael G. Rabbat
Vasu Sharma
Gabriel Synnaeve
Huijiao Xu
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
VLM
CLIP
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DINOv2: Learning Robust Visual Features without Supervision"
50 / 2,194 papers shown
Title
SpawnNet: Learning Generalizable Visuomotor Skills from Pre-trained Networks
Xingyu Lin
John So
Sashwat Mahalingam
Fangchen Liu
Pieter Abbeel
SSL
30
22
0
07 Jul 2023
VideoGLUE: Video General Understanding Evaluation of Foundation Models
Liangzhe Yuan
N. B. Gundavarapu
Long Zhao
Hao Zhou
Huayu Chen
...
Florian Schroff
Hartwig Adam
Ming Yang
Ting Liu
Boqing Gong
ELM
40
9
0
06 Jul 2023
Optimal and Efficient Binary Questioning for Human-in-the-Loop Annotation
Franco Marchesoni-Acland
Jean-Michel Morel
J. Kherroubi
Gabriele Facciolo
29
0
0
04 Jul 2023
Stitched ViTs are Flexible Vision Backbones
Zizheng Pan
Jing Liu
Haoyu He
Jianfei Cai
Bohan Zhuang
20
2
0
30 Jun 2023
Detect Any Deepfakes: Segment Anything Meets Face Forgery Detection and Localization
Ying-Hsiu Lai
Zhiming Luo
Zitong Yu
CVBM
30
20
0
29 Jun 2023
End-to-end Autonomous Driving: Challenges and Frontiers
Li Chen
Peng Wu
Kashyap Chitta
Bernhard Jaeger
Andreas Geiger
Hongyang Li
3DV
61
264
0
29 Jun 2023
Cross-Validation Is All You Need: A Statistical Approach To Label Noise Estimation
Jianan Chen
Anne L. Martel
13
0
0
24 Jun 2023
How to Efficiently Adapt Large Segmentation Model(SAM) to Medical Images
Xinrong Hu
Xiaowei Xu
Yi Shi
VLM
MedIm
19
61
0
23 Jun 2023
OpenMask3D: Open-Vocabulary 3D Instance Segmentation
Ayca Takmaz
Elisabetta Fedele
R. Sumner
Marc Pollefeys
F. Tombari
Francis Engelmann
ISeg
VLM
31
164
0
23 Jun 2023
A Survey on Multimodal Large Language Models
Shukang Yin
Chaoyou Fu
Sirui Zhao
Ke Li
Xing Sun
Tong Xu
Enhong Chen
MLLM
LRM
54
556
0
23 Jun 2023
A Sparse Graph Formulation for Efficient Spectral Image Segmentation
Rahul Palnitkar
J. F. R. Neto
23
0
0
22 Jun 2023
TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter
Binjie Zhang
Yixiao Ge
Xuyuan Xu
Ying Shan
Mike Zheng Shou
47
7
0
22 Jun 2023
LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning
Yunlong Tang
Jinrui Zhang
Xiangchen Wang
Teng Wang
Feng Zheng
VLM
73
9
0
17 Jun 2023
Segment Any Point Cloud Sequences by Distilling Vision Foundation Models
You-Chen Liu
Lingdong Kong
Jun Cen
Runnan Chen
Wenwei Zhang
Liang Pan
Kai-xiang Chen
Ziwei Liu
37
83
0
15 Jun 2023
Single-Stage Visual Query Localization in Egocentric Videos
Hanwen Jiang
Santhosh Kumar Ramakrishnan
Kristen Grauman
31
13
0
15 Jun 2023
Robustness Analysis on Foundational Segmentation Models
Madeline Chantry Schiappa
Shehreen Azad
V. Sachidanand
Yunhao Ge
O. Mikšík
Yogesh S Rawat
Vibhav Vineet
OOD
VLM
AAML
30
6
0
15 Jun 2023
LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models
Peng Xu
Wenqi Shao
Kaipeng Zhang
Peng Gao
Shuo Liu
Meng Lei
Fanqing Meng
Siyuan Huang
Yu Qiao
Ping Luo
ELM
MLLM
36
159
0
15 Jun 2023
Behavioral Cloning via Search in Embedded Demonstration Dataset
Federico Malato
Florian Leopold
Ville Hautamaki
Andrew Melnik
OffRL
27
3
0
15 Jun 2023
MOFI: Learning Image Representations from Noisy Entity Annotated Images
Wentao Wu
Aleksei Timofeev
Chen Chen
Bowen Zhang
Kun Duan
...
Yantao Zheng
Jonathon Shlens
Xianzhi Du
Zhe Gan
Yinfei Yang
VLM
26
7
0
13 Jun 2023
Frequency-Based Vulnerability Analysis of Deep Learning Models against Image Corruptions
Harshitha Machiraju
Michael H. Herzog
P. Frossard
23
0
0
12 Jun 2023
On the Challenges and Perspectives of Foundation Models for Medical Image Analysis
Shaoting Zhang
Dimitris N. Metaxas
LM&MA
VLM
MedIm
AI4CE
42
128
0
09 Jun 2023
Artificial General Intelligence for Medical Imaging
Xiang Li
Lu Zhang
Zihao Wu
Zheng Liu
Lin Zhao
...
Pingkuan Yan
Quanzheng Li
Wei Liu
Tianming Liu
Dinggang Shen
LM&MA
AI4CE
19
40
0
08 Jun 2023
SNAP: Self-Supervised Neural Maps for Visual Positioning and Semantic Understanding
Paul-Edouard Sarlin
Eduard Trulls
Marc Pollefeys
J. Hosang
Simon Lynen
3DPC
SSL
28
25
0
08 Jun 2023
Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models
Tianzhe Chu
Shengbang Tong
Tianjiao Ding
Xili Dai
B. Haeffele
René Vidal
Y. Ma
SSL
VLM
28
13
0
08 Jun 2023
Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models
G. Stein
Jesse C. Cresswell
Rasa Hosseinzadeh
Yi Sui
Brendan Leigh Ross
Valentin Villecroze
Zhaoyan Liu
Anthony L. Caterini
J. E. T. Taylor
G. Loaiza-Ganem
EGVM
33
96
0
07 Jun 2023
ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation
Jiaming Liu
Senqiao Yang
Peidong Jia
Renrui Zhang
Ming Lu
Yandong Guo
Wei Xue
Shanghang Zhang
TTA
OOD
VLM
30
36
0
07 Jun 2023
Multi-View Representation is What You Need for Point-Cloud Pre-Training
Siming Yan
Chen Song
Youkang Kong
Qi-Xing Huang
3DPC
33
2
0
05 Jun 2023
Open-world Text-specified Object Counting
Niki Amini-Naieni
Kiana Amini-Naieni
Tengda Han
Andrew Zisserman
VLM
26
16
0
02 Jun 2023
Towards In-context Scene Understanding
Ivana Balazevic
David Steiner
Nikhil Parthasarathy
Relja Arandjelović
Olivier J. Hénaff
35
28
0
02 Jun 2023
Evaluating The Robustness of Self-Supervised Representations to Background/Foreground Removal
Xavier F. Cadet
Ranya Aloufi
A. Miranville
S. Ahmadi-Abhari
Hamed Haddadi
24
0
0
02 Jun 2023
DeepfakeArt Challenge: A Benchmark Dataset for Generative AI Art Forgery and Data Poisoning Detection
Hossein Aboutalebi
Daniel Mao
Rongqi Fan
Carol Xu
Chris He
Alexander Wong
AAML
22
8
0
02 Jun 2023
A Probabilistic Relaxation of the Two-Stage Object Pose Estimation Paradigm
Onur Beker
3DV
8
0
0
01 Jun 2023
Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias
Zhongwei Wan
Che Liu
Mi Zhang
Jie Fu
Benyou Wang
Sibo Cheng
Lei Ma
César Quilodrán-Casas
Rossella Arcucci
50
71
0
31 May 2023
Augmentation-aware Self-supervised Learning with Conditioned Projector
Marcin Przewike'zlikowski
Mateusz Pyla
Bartosz Zieliñski
Bartlomiej Twardowski
Jacek Tabor
Marek Śmieja
SSL
37
2
0
31 May 2023
Unbalanced Low-rank Optimal Transport Solvers
M. Scetbon
Michal Klein
Giovanni Palla
Marco Cuturi
OT
48
4
0
31 May 2023
PaintSeg: Training-free Segmentation via Painting
Xiang Li
Chung-Ching Lin
Yinpeng Chen
Zicheng Liu
Jinglu Wang
Bhiksha Raj
40
5
0
30 May 2023
Contextual Vision Transformers for Robust Representation Learning
Yu Bao
Theofanis Karaletsos
ViT
26
13
0
30 May 2023
Ambient Diffusion: Learning Clean Distributions from Corrupted Data
Giannis Daras
Kulin Shah
Y. Dagan
Aravind Gollakota
A. Dimakis
Adam R. Klivans
DiffM
45
68
0
30 May 2023
LowDINO -- A Low Parameter Self Supervised Learning Model
Sai Krishna Prathapaneni
Shvejan Shashank
K. SrikarReddy
38
0
0
28 May 2023
VoxDet: Voxel Learning for Novel Instance Detection
Bowen Li
Jiashun Wang
Yaoyu Hu
Chen Wang
Sebastian Scherer
38
6
0
26 May 2023
Modulate Your Spectrum in Self-Supervised Learning
Xi Weng
Yu-Li Ni
Tengwei Song
Jie Luo
Rao Muhammad Anwer
Salman Khan
Fahad Shahbaz Khan
Lei Huang
45
5
0
26 May 2023
POPE: 6-DoF Promptable Pose Estimation of Any Object, in Any Scene, with One Reference
Zhiwen Fan
Pan Pan
Peihao Wang
Yi Ding
Dejia Xu
Hanwen Jiang
Zhangyang Wang
37
24
0
25 May 2023
RoMa: Robust Dense Feature Matching
Johan Edstedt
Qiyu Sun
Georg Bökman
Maarten Wadenback
M. Felsberg
3DV
42
92
0
24 May 2023
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence
Junyi Zhang
Charles Herrmann
Junhwa Hur
Luisa Polania Cabrera
Varun Jampani
Deqing Sun
Ming Yang
DiffM
39
171
0
24 May 2023
TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at Scale
Ziyun Zeng
Yixiao Ge
Zhan Tong
Xihui Liu
Shutao Xia
Ying Shan
24
9
0
23 May 2023
Weakly Supervised 3D Open-vocabulary Segmentation
Kunhao Liu
Fangneng Zhan
Jiahui Zhang
Muyu Xu
Yingchen Yu
Abdulmotaleb El Saddik
Christian Theobalt
Eric P. Xing
Shijian Lu
34
66
0
23 May 2023
Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching
Yang Liu
Muzhi Zhu
Hengtao Li
Hao Chen
Xinlong Wang
Chunhua Shen
VLM
MLLM
88
84
0
22 May 2023
Unsupervised Multi-view Pedestrian Detection
Mengyin Liu
Chao Zhu
Shiqi Ren
Xu-Cheng Yin
37
6
0
21 May 2023
Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes
Aran Nayebi
R. Rajalingham
M. Jazayeri
G. R. Yang
36
19
0
19 May 2023
TSGM: A Flexible Framework for Generative Modeling of Synthetic Time Series
Alexander Nikitin
Letizia Iannucci
Samuel Kaski
TTA
SyDa
AI4TS
39
11
0
19 May 2023
Previous
1
2
3
...
42
43
44
Next