Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.07193
Cited By
DINOv2: Learning Robust Visual Features without Supervision
14 April 2023
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
Vasil Khalidov
Pierre Fernandez
Daniel Haziza
Francisco Massa
Alaaeldin El-Nouby
Mahmoud Assran
Nicolas Ballas
Wojciech Galuba
Russ Howes
Po-Yao (Bernie) Huang
Shang-Wen Li
Ishan Misra
Michael G. Rabbat
Vasu Sharma
Gabriel Synnaeve
Huijiao Xu
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
VLM
CLIP
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DINOv2: Learning Robust Visual Features without Supervision"
50 / 2,220 papers shown
Title
Do Vision Foundation Models Enhance Domain Generalization in Medical Image Segmentation?
Kerem Cekmeceli
Meva Himmetoglu
G. I. Tombak
A. Susmelj
Ertunc Erdil
E. Konukoglu
MedIm
43
2
0
12 Sep 2024
UNIT: Unsupervised Online Instance Segmentation through Time
Corentin Sautier
Gilles Puy
Alexandre Boulch
Renaud Marlet
Vincent Lepetit
37
1
0
12 Sep 2024
Structured Pruning for Efficient Visual Place Recognition
Oliver Grainge
Michael Milford
Indu Bodala
Sarvapali D. Ramchurn
Shoaib Ehsan
51
1
0
12 Sep 2024
Learning Brain Tumor Representation in 3D High-Resolution MR Images via Interpretable State Space Models
Qingqiao Hu
Daoan Zhang
Jiebo Luo
Zhenyu Gong
Benedikt Wiestler
Jianguo Zhang
Hongwei Bran Li
39
0
0
12 Sep 2024
Foundation Models Boost Low-Level Perceptual Similarity Metrics
Abhijay Ghildyal
Nabajeet Barman
Saman Zadtootaghaj
42
3
0
11 Sep 2024
A Scalable Algorithm for Active Learning
Youguang Chen
Zheyu Wen
George Biros
27
0
0
11 Sep 2024
What to align in multimodal contrastive learning?
Benoit Dufumier
J. Castillo-Navarro
D. Tuia
Jean-Philippe Thiran
41
3
0
11 Sep 2024
A Likelihood Ratio-Based Approach to Segmenting Unknown Objects
Nazir Nayal
Youssef Shoeb
Fatma Güney
OODD
43
4
0
10 Sep 2024
High-Performance Few-Shot Segmentation with Foundation Models: An Empirical Study
Shijie Chang
Lihe Zhang
Huchuan Lu
VLM
47
1
0
10 Sep 2024
INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding
Ji Ha Jang
H. Seo
Se Young Chun
53
2
0
10 Sep 2024
RealisDance: Equip controllable character animation with realistic hands
Jingkai Zhou
Benzhi Wang
Weihua Chen
Jingqi Bai
Dongyang Li
Aixi Zhang
Hao Xu
Mingyang Yang
F. Wang
26
12
0
10 Sep 2024
Towards Generalizable Scene Change Detection
Jaewoo Kim
Uehwan Kim
58
0
0
10 Sep 2024
DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks
Amin Karimi Monsefi
Kishore Prakash Sailaja
Ali Alilooee
Ser-Nam Lim
R. Ramnath
VLM
42
6
0
10 Sep 2024
SGC-VQGAN: Towards Complex Scene Representation via Semantic Guided Clustering Codebook
C. Ding
Chiyu Wang
Boshi Liu
Xi Guo
Weixuan Tang
Wei Wu
45
0
0
09 Sep 2024
Real-Time Human Action Recognition on Embedded Platforms
Ruiqi Wang
Zichen Wang
Peiqi Gao
Mingzhen Li
Jaehwan Jeong
Yihang Xu
Yejin Lee
Carolyn M. Baum
Lisa Connor
Chenyang Lu
51
2
0
09 Sep 2024
EndoOmni: Zero-Shot Cross-Dataset Depth Estimation in Endoscopy by Robust Self-Learning from Noisy Labels
Qingyao Tian
Zhen Chen
Huai Liao
Xinyan Huang
Lujie Li
Sebastien Ourselin
Hongbin Liu
116
1
0
09 Sep 2024
KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction
Davide Di Nucci
Alessandro Simoni
Matteo Tomei
L. Ciuffreda
R. Vezzani
Rita Cucchiara
3DPC
36
0
0
09 Sep 2024
CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization
Nan Chen
Mengqi Huang
Zhuowei Chen
Yang Zheng
Lei Zhang
Zhendong Mao
DiffM
60
5
0
09 Sep 2024
Thinking Outside the BBox: Unconstrained Generative Object Compositing
Gemma Canet Tarrés
Zhe Lin
Zhifei Zhang
Jianming Zhang
Yizhi Song
Dan Ruta
Andrew Gilbert
John Collomosse
Soo Ye Kim
DiffM
35
9
0
06 Sep 2024
Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective
Tim Bader
Leon Eisemann
Adrian Pogorzelski
Namrata Jangid
Attila B. Kis
51
0
0
06 Sep 2024
Deep Clustering of Remote Sensing Scenes through Heterogeneous Transfer Learning
Isaac Ray
Alexei Skurikhin
124
0
0
05 Sep 2024
RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images
Benzhi Wang
Jingkai Zhou
Jingqi Bai
Yang Yang
Weihua Chen
F. Wang
Zhen Lei
DiffM
39
3
0
05 Sep 2024
Tissue Concepts: supervised foundation models in computational pathology
Till Nicke
Jan Raphael Schaefer
Henning Hoefener
Friedrich Feuerhake
Dorit Merhof
Fabian Kiessling
Johannes Lotz
MedIm
55
0
0
05 Sep 2024
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Yunze Man
Shuhong Zheng
Zhipeng Bao
M. Hebert
Liang-Yan Gui
Yu-xiong Wang
78
15
0
05 Sep 2024
Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment
Konstantin Schall
Kai Uwe Barthel
Nico Hezel
Klaus Jung
VLM
43
3
0
03 Sep 2024
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
Wenlong Huang
Chen Wang
Yongqian Li
Ruohan Zhang
Li Fei-Fei
62
95
0
03 Sep 2024
DiVE: DiT-based Video Generation with Enhanced Control
Junpeng Jiang
Gangyi Hong
Lijun Zhou
Enhui Ma
Hengtong Hu
...
Kaicheng Yu
Haiyang Sun
Kun Zhan
Peng Jia
Miao Zhang
VGen
DiffM
38
12
0
03 Sep 2024
DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction
Jenny Seidenschwarz
Qunjie Zhou
Bardienus Duisterhof
Deva Ramanan
Laura Leal-Taixe
50
4
0
03 Sep 2024
Understanding Multimodal Hallucination with Parameter-Free Representation Alignment
Yueqian Wang
Jianxin Liang
Yuxuan Wang
Huishuai Zhang
Dongyan Zhao
56
1
0
02 Sep 2024
ViRED: Prediction of Visual Relations in Engineering Drawings
Chao Gu
Ke Lin
Yiyang Luo
Jiahui Hou
Xiang-Yang Li
40
0
0
02 Sep 2024
Image-to-Lidar Relational Distillation for Autonomous Driving Data
Anas Mahmoud
Ali Harakeh
Steven Waslander
31
0
0
01 Sep 2024
Self-Supervised Vision Transformers for Writer Retrieval
Tim Raven
Arthur Matei
Gernot A. Fink
ViT
28
0
0
01 Sep 2024
RING#: PR-by-PE Global Localization with Roto-translation Equivariant Gram Learning
Sha Lu
Xuecheng Xu
Yuxuan Wu
Haojian Lu
Xieyuanli Chen
R. Xiong
Yue Wang
59
2
0
30 Aug 2024
DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model
Mona Sheikh Zeinoddin
Chiara Lena
Jiongqi Qu
Luca Carlini
Mattia Magro
...
E. Mazomenos
Daniel C. Alexander
Danail Stoyanov
Matthew J. Clarkson
Mobarakol Islam
36
1
0
30 Aug 2024
ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images
Xiaoshuai Zhang
Zhicheng Wang
Howard Zhou
Soham Ghosh
Danushen Gnanapragasam
Varun Jampani
Hao Su
Leonidas J. Guibas
DD
68
5
0
30 Aug 2024
FlowRetrieval: Flow-Guided Data Retrieval for Few-Shot Imitation Learning
Li-Heng Lin
Yuchen Cui
Amber Xie
Tianyu Hua
Dorsa Sadigh
34
8
0
29 Aug 2024
GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models
Moreno DÍncà
E. Peruzzo
Massimiliano Mancini
Xingqian Xu
Humphrey Shi
N. Sebe
67
0
0
29 Aug 2024
Identifying Terrain Physical Parameters from Vision -- Towards Physical-Parameter-Aware Locomotion and Navigation
Jiaqi Chen
Jonas Frey
Ruyi Zhou
Takahiro Miki
Georg Martius
Marco Hutter
47
10
0
29 Aug 2024
Towards Modality-agnostic Label-efficient Segmentation with Entropy-Regularized Distribution Alignment
Liyao Tang
Zhe Chen
Shanshan Zhao
Chaoyue Wang
Dacheng Tao
37
0
0
29 Aug 2024
Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks
Sierra Bonilla
Chiara Di Vece
Rema Daher
Xinwei Ju
Danail Stoyanov
Francisco Vasconcelos
Sophia Bano
3DV
44
1
0
29 Aug 2024
A Simple and Generalist Approach for Panoptic Segmentation
Nedyalko Prisadnikov
Wouter Van Gansbeke
Danda Pani Paudel
Luc Van Gool
VLM
55
0
0
29 Aug 2024
Law of Vision Representation in MLLMs
Shijia Yang
Bohan Zhai
Quanzeng You
Jianbo Yuan
Hongxia Yang
Chenfeng Xu
49
9
0
29 Aug 2024
BELT-2: Bootstrapping EEG-to-Language representation alignment for multi-task brain decoding
Jinzhao Zhou
Yiqun Duan
Fred Chang
T. Do
Yu-Kai Wang
Chin-Teng Lin
30
2
0
28 Aug 2024
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Min Shi
Fuxiao Liu
Shihao Wang
Shijia Liao
Subhashree Radhakrishnan
...
Andrew Tao
Andrew Tao
Zhiding Yu
Guilin Liu
Guilin Liu
MLLM
43
55
0
28 Aug 2024
Perceive-IR: Learning to Perceive Degradation Better for All-in-One Image Restoration
Xu Zhang
Jiaqi Ma
Guoli Wang
Qian Zhang
Huan Zhang
Lefei Zhang
VLM
99
7
0
28 Aug 2024
S-MolSearch: 3D Semi-supervised Contrastive Learning for Bioactive Molecule Search
G. Zhou
Zhen Wang
Feng Yu
Guolin Ke
Zhewei Wei
Zhifeng Gao
31
2
0
27 Aug 2024
NeuralOOD: Improving Out-of-Distribution Generalization Performance with Brain-machine Fusion Learning Framework
Shuangchen Zhao
Changde Du
Hui Li
Huiguang He
47
0
0
27 Aug 2024
The Benefits of Balance: From Information Projections to Variance Reduction
Lang Liu
Ronak R. Mehta
Soumik Pal
Zaïd Harchaoui
35
0
0
27 Aug 2024
SelEx: Self-Expertise in Fine-Grained Generalized Category Discovery
Sarah Rastegar
Mohammadreza Salehi
Yuki M. Asano
Hazel Doughty
Cees G. M. Snoek
46
4
0
26 Aug 2024
TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation
Anh-Dzung Doan
Vu Minh Hieu Phan
Surabhi Gupta
Markus Wagner
Tat-Jun Chin
Ian Reid
VGen
DiffM
46
0
0
26 Aug 2024
Previous
1
2
3
...
21
22
23
...
43
44
45
Next