ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.07193
  4. Cited By
DINOv2: Learning Robust Visual Features without Supervision
v1v2 (latest)

DINOv2: Learning Robust Visual Features without Supervision

14 April 2023
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
Vasil Khalidov
Pierre Fernandez
Daniel Haziza
Francisco Massa
Alaaeldin El-Nouby
Mahmoud Assran
Nicolas Ballas
Wojciech Galuba
Russ Howes
Po-Yao (Bernie) Huang
Shang-Wen Li
Ishan Misra
Michael G. Rabbat
Vasu Sharma
Gabriel Synnaeve
Huijiao Xu
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
    VLMCLIPSSL
ArXiv (abs)PDFHTML

Papers citing "DINOv2: Learning Robust Visual Features without Supervision"

50 / 826 papers shown
Title
The Faiss library
The Faiss library
Matthijs Douze
Alexandr Guzhva
Chengqi Deng
Jeff Johnson
Gergely Szilvasy
Pierre-Emmanuel Mazaré
Maria Lomeli
Lucas Hosseini
Hervé Jégou
207
185
0
16 Jan 2024
Surgical-DINO: Adapter Learning of Foundation Models for Depth
  Estimation in Endoscopic Surgery
Surgical-DINO: Adapter Learning of Foundation Models for Depth Estimation in Endoscopic Surgery
Beilei Cui
Mobarakol Islam
Long Bai
Hongliang Ren
MedIm
105
42
0
11 Jan 2024
Do Vision and Language Encoders Represent the World Similarly?
Do Vision and Language Encoders Represent the World Similarly?
Mayug Maniparambil
Raiymbek Akshulakov
Y. A. D. Djilali
Sanath Narayan
M. Seddik
K. Mangalam
Noel E. O'Connor
VLM
96
14
0
10 Jan 2024
Low-Resource Vision Challenges for Foundation Models
Low-Resource Vision Challenges for Foundation Models
Yunhua Zhang
Hazel Doughty
Cees G. M. Snoek
VLM
94
7
0
09 Jan 2024
Effective pruning of web-scale datasets based on complexity of concept
  clusters
Effective pruning of web-scale datasets based on complexity of concept clusters
Amro Abbas
E. Rusak
Kushal Tirumala
Wieland Brendel
Kamalika Chaudhuri
Ari S. Morcos
VLMCLIP
83
23
0
09 Jan 2024
MERBench: A Unified Evaluation Benchmark for Multimodal Emotion
  Recognition
MERBench: A Unified Evaluation Benchmark for Multimodal Emotion Recognition
Zheng Lian
Guoying Zhao
Yong Ren
Hao Gu
Haiyang Sun
Lan Chen
Bin Liu
Jianhua Tao
124
13
0
07 Jan 2024
VASE: Object-Centric Appearance and Shape Manipulation of Real Videos
VASE: Object-Centric Appearance and Shape Manipulation of Real Videos
E. Peruzzo
Vidit Goel
Dejia Xu
Xingqian Xu
Yi Ding
Zhangyang Wang
Humphrey Shi
N. Sebe
LM&RoVGenDiffM
122
12
0
04 Jan 2024
Amodal Ground Truth and Completion in the Wild
Amodal Ground Truth and Completion in the Wild
Guanqi Zhan
Chuanxia Zheng
Weidi Xie
Andrew Zisserman
80
22
0
28 Dec 2023
Fréchet Wavelet Distance: A Domain-Agnostic Metric for Image Generation
Fréchet Wavelet Distance: A Domain-Agnostic Metric for Image Generation
Lokesh Veeramacheneni
Moritz Wolter
Hildegard Kuehne
Juergen Gall
EGVM
74
6
0
23 Dec 2023
Design Space Exploration of Low-Bit Quantized Neural Networks for Visual
  Place Recognition
Design Space Exploration of Low-Bit Quantized Neural Networks for Visual Place Recognition
Oliver Grainge
Michael Milford
Indu Bodala
Sarvapali D. Ramchurn
Shoaib Ehsan
81
6
0
14 Dec 2023
Weighted Ensemble Models Are Strong Continual Learners
Weighted Ensemble Models Are Strong Continual Learners
Imad Eddine Marouf
Subhankar Roy
Enzo Tartaglione
Stéphane Lathuilière
CLL
109
22
0
14 Dec 2023
Bayes3D: fast learning and inference in structured generative models of
  3D objects and scenes
Bayes3D: fast learning and inference in structured generative models of 3D objects and scenes
Nishad Gothoskar
Matin Ghavami
Eric Li
Aidan Curtis
Michael Noseworthy
...
Brian Patton
William T. Freeman
Joshua B. Tenenbaum
Mirko Klukas
Vikash K. Mansinghka
BDL3DV
64
3
0
14 Dec 2023
LD-SDM: Language-Driven Hierarchical Species Distribution Modeling
LD-SDM: Language-Driven Hierarchical Species Distribution Modeling
Srikumar Sastry
Xin Xing
Aayush Dhakal
Subash Khanal
Adeel Ahmad
Nathan Jacobs
90
6
0
13 Dec 2023
Learning Naturally Aggregated Appearance for Efficient 3D Editing
Learning Naturally Aggregated Appearance for Efficient 3D Editing
Ka Leong Cheng
Qiuyu Wang
Zifan Shi
Kecheng Zheng
Yinghao Xu
Ouyang Hao
Qifeng Chen
Yujun Shen
3DH
135
4
0
11 Dec 2023
Cross Domain Generative Augmentation: Domain Generalization with Latent
  Diffusion Models
Cross Domain Generative Augmentation: Domain Generalization with Latent Diffusion Models
S. Hemati
Mahdi Beitollahi
A. Estiri
Bassel Al Omari
Xi Chen
Guojun Zhang
69
7
0
08 Dec 2023
Emergence and Function of Abstract Representations in Self-Supervised
  Transformers
Emergence and Function of Abstract Representations in Self-Supervised Transformers
Quentin RV. Ferry
Joshua Ching
Takashi Kawai
78
3
0
08 Dec 2023
Human Demonstrations are Generalizable Knowledge for Robots
Human Demonstrations are Generalizable Knowledge for Robots
Te Cui
Guangyan Chen
Tianxing Zhou
Zicai Peng
Mengxiao Hu
Haoyang Lu
Haizhou Li
Meiling Wang
Yi Yang
Yufeng Yue
LM&Ro
90
6
0
05 Dec 2023
Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games
Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games
Lukas Schäfer
Logan Jones
Anssi Kanervisto
Yuhan Cao
Tabish Rashid
Raluca Georgescu
David Bignell
Siddhartha Sen
Andrea Trevino Gavito
Sam Devlin
174
3
0
04 Dec 2023
FreeZe: Training-free zero-shot 6D pose estimation with geometric and vision foundation models
FreeZe: Training-free zero-shot 6D pose estimation with geometric and vision foundation models
Andrea Caraffa
Davide Boscaini
Amir Hamza
Fabio Poiesi
127
18
0
01 Dec 2023
A Lightweight Clustering Framework for Unsupervised Semantic
  Segmentation
A Lightweight Clustering Framework for Unsupervised Semantic Segmentation
Yau Shing Jonathan Cheung
Xi Chen
Lihe Yang
Hengshuang Zhao
71
1
0
30 Nov 2023
Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D
  Features
Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features
Thomas Wimmer
Peter Wonka
M. Ovsjanikov
109
13
0
29 Nov 2023
Meta Co-Training: Two Views are Better than One
Meta Co-Training: Two Views are Better than One
Jay C. Rothenberger
Dimitrios I. Diochnos
VLM
164
3
0
29 Nov 2023
On Bringing Robots Home
On Bringing Robots Home
Nur Muhammad (Mahi) Shafiullah
Anant Rai
Haritheja Etukuru
Yiqian Liu
Ishan Misra
Soumith Chintala
Lerrel Pinto
157
87
0
27 Nov 2023
Continual Learning: Applications and the Road Forward
Continual Learning: Applications and the Road Forward
Eli Verwimp
Rahaf Aljundi
Shai Ben-David
Matthias Bethge
Andrea Cossu
...
Joost van de Weijer
Bing Liu
Vincenzo Lomonaco
Tinne Tuytelaars
Gido M. van de Ven
CLL
114
47
0
20 Nov 2023
Event Camera Data Dense Pre-training
Event Camera Data Dense Pre-training
Yan Yang
Liyuan Pan
Liu Liu
65
4
0
20 Nov 2023
DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance
  Fields
DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance Fields
Yu Chi
Fangneng Zhan
Sibo Wu
Christian Theobalt
Adam Kortylewski
75
1
0
18 Nov 2023
On the Out of Distribution Robustness of Foundation Models in Medical
  Image Segmentation
On the Out of Distribution Robustness of Foundation Models in Medical Image Segmentation
D. M. Nguyen
Tan Ngoc Pham
Nghiem Tuong Diep
Nghi Quoc Phan
Quang Pham
...
Ngan Hoang Le
Nhat Ho
Pengtao Xie
Daniel Sonntag
Mathias Niepert
VLMUQCVOOD
80
7
0
18 Nov 2023
Challenges in data-based geospatial modeling for environmental research
  and practice
Challenges in data-based geospatial modeling for environmental research and practice
Diana Koldasbayeva
P. Tregubova
M. Gasanov
Alexey Zaytsev
Anna Petrovskaia
Evgeny Burnaev
AI4CE
78
1
0
18 Nov 2023
Multi-entity Video Transformers for Fine-Grained Video Representation Learning
Multi-entity Video Transformers for Fine-Grained Video Representation Learning
Matthew Walmer
Rose Kanjirathinkal
Kai Sheng Tai
Keyur Muzumdar
Taipeng Tian
Abhinav Shrivastava
ViT
84
0
0
17 Nov 2023
PolyMaX: General Dense Prediction with Mask Transformer
PolyMaX: General Dense Prediction with Mask Transformer
Xuan S. Yang
Liangzhe Yuan
Kimberly Wilber
Astuti Sharma
Xiuye Gu
...
Stephanie Debats
Huisheng Wang
Hartwig Adam
Mikhail Sirotenko
Liang-Chieh Chen
105
15
0
09 Nov 2023
The Pursuit of Human Labeling: A New Perspective on Unsupervised
  Learning
The Pursuit of Human Labeling: A New Perspective on Unsupervised Learning
Artyom Gadetsky
Maria Brbić
69
7
0
06 Nov 2023
OpenForest: A data catalogue for machine learning in forest monitoring
OpenForest: A data catalogue for machine learning in forest monitoring
Arthur Ouaknine
T. Kattenborn
Etienne Laliberté
David Rolnick
157
6
0
01 Nov 2023
A High-Resolution Dataset for Instance Detection with Multi-View
  Instance Capture
A High-Resolution Dataset for Instance Detection with Multi-View Instance Capture
Qianqian Shen
Yunhan Zhao
Nahyun Kwon
Jeeeun Kim
Yanan Li
Shu Kong
54
2
0
30 Oct 2023
Drive Anywhere: Generalizable End-to-end Autonomous Driving with
  Multi-modal Foundation Models
Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models
Tsun-Hsuan Wang
Alaa Maalouf
Wei Xiao
Yutong Ban
Alexander Amini
Guy Rosman
S. Karaman
Daniela Rus
73
46
0
26 Oct 2023
SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous
  Manipulation
SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous Manipulation
Qianxu Wang
Haotong Zhang
Congyue Deng
Yang You
Hao Dong
Yixin Zhu
Leonidas Guibas
82
20
0
25 Oct 2023
Visual Grounding Helps Learn Word Meanings in Low-Data Regimes
Visual Grounding Helps Learn Word Meanings in Low-Data Regimes
Chengxu Zhuang
Evelina Fedorenko
Jacob Andreas
74
12
0
20 Oct 2023
Cousins Of The Vendi Score: A Family Of Similarity-Based Diversity
  Metrics For Science And Machine Learning
Cousins Of The Vendi Score: A Family Of Similarity-Based Diversity Metrics For Science And Machine Learning
Amey P. Pasarkar
Adji Bousso Dieng
106
13
0
19 Oct 2023
Universal Visual Decomposer: Long-Horizon Manipulation Made Easy
Universal Visual Decomposer: Long-Horizon Manipulation Made Easy
Zichen Zhang
Yunshuang Li
Osbert Bastani
Abhishek Gupta
Dinesh Jayaraman
Yecheng Jason Ma
Luca Weihs
77
19
0
12 Oct 2023
Advancing Pose-Guided Image Synthesis with Progressive Conditional
  Diffusion Models
Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models
Fei Shen
Hu Ye
Jun Zhang
Cong Wang
Xiao Han
Wei Yang
DiffM
138
64
0
10 Oct 2023
Adaptive Multi-head Contrastive Learning
Adaptive Multi-head Contrastive Learning
Lei Wang
Piotr Koniusz
Tom Gedeon
Liang Zheng
107
5
0
09 Oct 2023
Improving Discriminative Multi-Modal Learning with Large-Scale
  Pre-Trained Models
Improving Discriminative Multi-Modal Learning with Large-Scale Pre-Trained Models
Chenzhuang Du
Yue Zhao
Chonghua Liao
Jiacheng You
Jie Fu
Hang Zhao
86
2
0
08 Oct 2023
Sub-token ViT Embedding via Stochastic Resonance Transformers
Sub-token ViT Embedding via Stochastic Resonance Transformers
Dong Lao
Yangchao Wu
Tian Yu Liu
Alex Wong
Stefano Soatto
VOS
79
4
0
06 Oct 2023
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and
  Planning
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning
Yuanyi Zhong
Alihusein Kuwajerwala
Sacha Morin
Krishna Murthy Jatavallabhula
Bipasha Sen
...
Celso Miguel de Melo
Joshua B. Tenenbaum
Antonio Torralba
Florian Shkurti
Liam Paull
LM&Ro
122
189
0
28 Sep 2023
Detect Everything with Few Examples
Detect Everything with Few Examples
Xinyu Zhang
Yuting Wang
Abdeslam Boularias
ObjDVLM
104
14
0
22 Sep 2023
Contrastive Learning for Enhancing Robust Scene Transfer in Vision-based
  Agile Flight
Contrastive Learning for Enhancing Robust Scene Transfer in Vision-based Agile Flight
Jiaxu Xing
L. Bauersfeld
Yunlong Song
Chunwei Xing
Davide Scaramuzza
106
12
0
18 Sep 2023
Revealing the Underlying Patterns: Investigating Dataset Similarity,
  Performance, and Generalization
Revealing the Underlying Patterns: Investigating Dataset Similarity, Performance, and Generalization
Akshit Achara
R. Pandey
SSL
113
0
0
07 Aug 2023
A Parameter-efficient Multi-subject Model for Predicting fMRI Activity
A Parameter-efficient Multi-subject Model for Predicting fMRI Activity
Connor Lane
Gregory Kiar
62
2
0
04 Aug 2023
CNOS: A Strong Baseline for CAD-based Novel Object Segmentation
CNOS: A Strong Baseline for CAD-based Novel Object Segmentation
Van Nguyen Nguyen
Thibault Groueix
Georgy Ponimatkin
Vincent Lepetit
Tomás Hodan
3DPC
95
50
0
20 Jul 2023
Diffusion Models Beat GANs on Image Classification
Diffusion Models Beat GANs on Image Classification
Soumik Mukhopadhyay
M. Gwilliam
Vatsal Agarwal
Namitha Padmanabhan
A. Swaminathan
Srinidhi Hegde
Dinesh Manocha
Abhinav Shrivastava
DiffM
163
48
1
17 Jul 2023
Dual-Query Multiple Instance Learning for Dynamic Meta-Embedding based
  Tumor Classification
Dual-Query Multiple Instance Learning for Dynamic Meta-Embedding based Tumor Classification
Simon Holdenried-Krafft
Peter Somers
Ivonne A. Montes-Majarro
Diana Silimon
Cristina Tarín
F. Fend
Hendrik P. A. Lensch
MedIm
100
3
0
14 Jul 2023
Previous
123...151617
Next