ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.07193
  4. Cited By
DINOv2: Learning Robust Visual Features without Supervision

DINOv2: Learning Robust Visual Features without Supervision

14 April 2023
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
Vasil Khalidov
Pierre Fernandez
Daniel Haziza
Francisco Massa
Alaaeldin El-Nouby
Mahmoud Assran
Nicolas Ballas
Wojciech Galuba
Russ Howes
Po-Yao (Bernie) Huang
Shang-Wen Li
Ishan Misra
Michael G. Rabbat
Vasu Sharma
Gabriel Synnaeve
Huijiao Xu
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
    VLM
    CLIP
    SSL
ArXivPDFHTML

Papers citing "DINOv2: Learning Robust Visual Features without Supervision"

50 / 2,193 papers shown
Title
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization
Siyuan Li
Lefei Zhang
Zedong Wang
Juanxi Tian
Cheng Tan
...
Chang Yu
Qingsong Xie
Haonan Lu
Haoqian Wang
Zhen Lei
50
0
0
01 Apr 2025
GECKO: Gigapixel Vision-Concept Contrastive Pretraining in Histopathology
GECKO: Gigapixel Vision-Concept Contrastive Pretraining in Histopathology
S. Kapse
Pushpak Pati
Srikar Yellapragada
Srijan Das
Rajarsi R. Gupta
Joel H. Saltz
Dimitris Samaras
Prateek Prasanna
VLM
48
0
0
01 Apr 2025
Multi-Task Learning for Extracting Menstrual Characteristics from Clinical Notes
Multi-Task Learning for Extracting Menstrual Characteristics from Clinical Notes
Anna Shopova
Cristoph Lippert
Leslee J. Shaw
Eugenia Alleva
47
0
0
31 Mar 2025
Detecting Glioma, Meningioma, and Pituitary Tumors, and Normal Brain Tissues based on Yolov11 and Yolov8 Deep Learning Models
Detecting Glioma, Meningioma, and Pituitary Tumors, and Normal Brain Tissues based on Yolov11 and Yolov8 Deep Learning Models
Ahmed M. Taha
Salah A. Aly
Mohamed F. Darwish
31
0
0
31 Mar 2025
From Colors to Classes: Emergence of Concepts in Vision Transformers
From Colors to Classes: Emergence of Concepts in Vision Transformers
Teresa Dorszewski
Lenka Tětková
Robert Jenssen
Lars Kai Hansen
Kristoffer Wickstrøm
41
0
0
31 Mar 2025
Leveraging Diffusion Model and Image Foundation Model for Improved Correspondence Matching in Coronary Angiography
Leveraging Diffusion Model and Image Foundation Model for Improved Correspondence Matching in Coronary Angiography
Lin Zhao
Xin Yu
Yikang Liu
Xiao Chen
Eric Z. Chen
Terrence Chen
Shanhui Sun
DiffM
MedIm
49
0
0
31 Mar 2025
CBIL: Collective Behavior Imitation Learning for Fish from Real Videos
CBIL: Collective Behavior Imitation Learning for Fish from Real Videos
Yifan Wu
Zhiyang Dou
Yuko Ishiwaka
Shun Ogawa
Yuke Lou
Wenping Wang
Lingjie Liu
Taku Komura
54
3
0
31 Mar 2025
Improved Ear Verification with Vision Transformers and Overlapping Patches
Improved Ear Verification with Vision Transformers and Overlapping Patches
Deeksha Arun
Kagan Öztürk
Kevin W. Bowyer
Patrick Flynn
43
0
0
30 Mar 2025
AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos
AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos
Felix Wimbauer
Weirong Chen
Dominik Muhle
Christian Rupprecht
Daniel Cremers
VGen
67
0
0
30 Mar 2025
DASH: Detection and Assessment of Systematic Hallucinations of VLMs
DASH: Detection and Assessment of Systematic Hallucinations of VLMs
Maximilian Augustin
Yannic Neuhaus
Matthias Hein
VLM
55
1
0
30 Mar 2025
COSMIC: Clique-Oriented Semantic Multi-space Integration for Robust CLIP Test-Time Adaptation
COSMIC: Clique-Oriented Semantic Multi-space Integration for Robust CLIP Test-Time Adaptation
Fanding Huang
Jingyan Jiang
Qinting Jiang
Hebei Li
Faisal Nadeem Khan
Zhi Wang
VLM
57
0
0
30 Mar 2025
Diffusion Meets Few-shot Class Incremental Learning
Diffusion Meets Few-shot Class Incremental Learning
Junsu Kim
Yunhoe Ku
Dongyoon Han
Seungryul Baek
DiffM
CLL
52
0
0
30 Mar 2025
Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model
Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model
Jannik Endres
Oliver Hahn
Charles Corbière
Simone Schaub-Meyer
Stefan Roth
Alexandre Alahi
MDE
39
0
0
30 Mar 2025
VideoGen-Eval: Agent-based System for Video Generation Evaluation
VideoGen-Eval: Agent-based System for Video Generation Evaluation
Yuhang Yang
Ke Fan
Shri Kiran Srinivasan
Hongxiang Li
Ailing Zeng
FeiLin Han
Wei-dong Zhai
Wei Liu
Yang Cao
Zheng-jun Zha
EGVM
VGen
78
0
0
30 Mar 2025
OncoReg: Medical Image Registration for Oncological Challenges
OncoReg: Medical Image Registration for Oncological Challenges
Wiebke Heyer
Yannic Elser
Lennart Berkel
Xinrui Song
Xuanang Xu
...
Christoph Großbröhmer
Lasse Hansen
Alessa Hering
Malte M. Sieren
Mattias P. Heinrich
44
0
0
29 Mar 2025
Large Self-Supervised Models Bridge the Gap in Domain Adaptive Object Detection
Large Self-Supervised Models Bridge the Gap in Domain Adaptive Object Detection
Marc-Antoine Lavoie
Anas Mahmoud
Steven Waslander
42
0
0
29 Mar 2025
MeshCraft: Exploring Efficient and Controllable Mesh Generation with Flow-based DiTs
MeshCraft: Exploring Efficient and Controllable Mesh Generation with Flow-based DiTs
Xianglong He
Junyi Chen
Di Huang
Zexiang Liu
Xiaoshui Huang
Wanli Ouyang
C. Yuan
Yangguang Li
DiffM
57
0
0
29 Mar 2025
Multi-label classification for multi-temporal, multi-spatial coral reef condition monitoring using vision foundation model with adapter learning
Multi-label classification for multi-temporal, multi-spatial coral reef condition monitoring using vision foundation model with adapter learning
Xinlei Shao
Hongruixuan Chen
Fan Zhao
Kirsty Magson
Jundong Chen
Peiran Li
Jingchao Wang
Jun Sasaki
44
0
0
29 Mar 2025
A Survey on Remote Sensing Foundation Models: From Vision to Multimodality
A Survey on Remote Sensing Foundation Models: From Vision to Multimodality
Ziyue Huang
Hongxi Yan
Qiqi Zhan
Shuai Yang
Mingming Zhang
Chenkai Zhang
Yiming Lei
Zeming Liu
Qingjie Liu
Yansen Wang
49
0
0
28 Mar 2025
MVSAnywhere: Zero-Shot Multi-View Stereo
MVSAnywhere: Zero-Shot Multi-View Stereo
Sergio Izquierdo
Mohamed Sayed
Michael Firman
Guillermo Garcia-Hernando
Daniyar Turmukhambetov
Javier Civera
Oisin Mac Aodha
Gabriel J. Brostow
Jamie Watson
3DV
41
3
0
28 Mar 2025
Understanding Co-speech Gestures in-the-wild
Understanding Co-speech Gestures in-the-wild
Sindhu B. Hegde
KR Prajwal
Taein Kwon
Andrew Zisserman
SLR
57
0
0
28 Mar 2025
High-Fidelity Diffusion Face Swapping with ID-Constrained Facial Conditioning
High-Fidelity Diffusion Face Swapping with ID-Constrained Facial Conditioning
Dailan He
Xinyu Wang
Shulun Wang
Guanglu Song
Bingqi Ma
Hao Shao
Y. Liu
Hongsheng Li
DiffM
70
0
0
28 Mar 2025
A Proposal for Networks Capable of Continual Learning
A Proposal for Networks Capable of Continual Learning
Zeki Doruk Erden
Boi Faltings
CLL
49
0
0
28 Mar 2025
Knowledge Rectification for Camouflaged Object Detection: Unlocking Insights from Low-Quality Data
Knowledge Rectification for Camouflaged Object Detection: Unlocking Insights from Low-Quality Data
Juwei Guan
Xiaolin Fang
Donghyun Kim
Haotian Gong
Tongxin Zhu
Zhen Ling
M. Yang
OffRL
38
1
0
28 Mar 2025
Endo-TTAP: Robust Endoscopic Tissue Tracking via Multi-Facet Guided Attention and Hybrid Flow-point Supervision
Endo-TTAP: Robust Endoscopic Tissue Tracking via Multi-Facet Guided Attention and Hybrid Flow-point Supervision
Rulin Zhou
Wenlong He
An Wang
Qiqi Yao
Haijun Hu
Jiankun Wang
Xi Zhang an Hongliang Ren
39
0
0
28 Mar 2025
AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization
AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization
Martin Kiss
Michal Hradiš
Martina Dvořáková
Václav Jiroušek
Filip Kersch
46
1
0
28 Mar 2025
Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets
Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets
Martin Kiss
Michal Hradiš
39
0
0
28 Mar 2025
Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance
Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance
Jaywon Koo
J. Hernandez
Moayed Haji-Ali
Ziyan Yang
Vicente Ordonez
EGVM
72
0
0
27 Mar 2025
Dual-Task Learning for Dead Tree Detection and Segmentation with Hybrid Self-Attention U-Nets in Aerial Imagery
Dual-Task Learning for Dead Tree Detection and Segmentation with Hybrid Self-Attention U-Nets in Aerial Imagery
Anis Ur Rahman
Einari Heinaro
Mete Ahishali
Samuli Junttila
45
1
0
27 Mar 2025
ProHOC: Probabilistic Hierarchical Out-of-Distribution Classification via Multi-Depth Networks
ProHOC: Probabilistic Hierarchical Out-of-Distribution Classification via Multi-Depth Networks
Erik Wallin
Fredrik Kahl
Lars Hammarstrand
OODD
54
0
0
27 Mar 2025
Stable-SCore: A Stable Registration-based Framework for 3D Shape Correspondence
Stable-SCore: A Stable Registration-based Framework for 3D Shape Correspondence
Haolin Liu
Xiaohang Zhan
Zizheng Yan
Zhongjin Luo
Yuxin Wen
Xiaoguang Han
60
0
0
27 Mar 2025
HORT: Monocular Hand-held Objects Reconstruction with Transformers
HORT: Monocular Hand-held Objects Reconstruction with Transformers
Zerui Chen
Rolandos Alexandros Potamias
Shizhe Chen
Cordelia Schmid
3DH
48
0
0
27 Mar 2025
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling
Xianglong He
Zi-Xin Zou
Chia-Hao Chen
Y. Guo
Ding Liang
Chun Yuan
Wanli Ouyang
Yan-Pei Cao
Yangguang Li
58
0
0
27 Mar 2025
Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation
Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation
Reza Qorbani
Gianluca Villani
Theodoros Panagiotakopoulos
Marc Botet Colomer
Linus Harenstam-Nielsen
...
Pier Luigi Dovesi
Jussi Karlgren
Daniel Cremers
F. Tombari
Matteo Poggi
VLM
52
0
0
27 Mar 2025
CTRL-O: Language-Controllable Object-Centric Visual Representation Learning
CTRL-O: Language-Controllable Object-Centric Visual Representation Learning
Aniket Didolkar
Andrii Zadaianchuk
Rabiul Awal
Maximilian Seitzer
E. Gavves
Aishwarya Agrawal
OCL
VLM
89
2
0
27 Mar 2025
UGNA-VPR: A Novel Training Paradigm for Visual Place Recognition Based on Uncertainty-Guided NeRF Augmentation
UGNA-VPR: A Novel Training Paradigm for Visual Place Recognition Based on Uncertainty-Guided NeRF Augmentation
Yehui Shen
Lei Zhang
Qingqiu Li
Xiongwei Zhao
Yanjie Wang
Huimin Lu
Xieyuanli Chen
47
0
0
27 Mar 2025
Online Reasoning Video Segmentation with Just-in-Time Digital Twins
Online Reasoning Video Segmentation with Just-in-Time Digital Twins
Yiqing Shen
Bohan Liu
Chenjia Li
Lalithkumar Seenivasan
Mathias Unberath
VOS
83
2
0
27 Mar 2025
LOCORE: Image Re-ranking with Long-Context Sequence Modeling
LOCORE: Image Re-ranking with Long-Context Sequence Modeling
Zilin Xiao
Pavel Suma
Ayush Sachdeva
Hao-Jen Wang
Giorgos Kordopatis-Zilos
Giorgos Tolias
Vicente Ordonez
62
0
0
27 Mar 2025
MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation
MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation
Rongyu Zhang
Menghang Dong
Yuan Zhang
Liang Heng
Xiaowei Chi
Gaole Dai
Li Du
Dan Wang
Yuan Du
MoE
84
0
0
26 Mar 2025
MAR-3D: Progressive Masked Auto-regressor for High-Resolution 3D Generation
MAR-3D: Progressive Masked Auto-regressor for High-Resolution 3D Generation
Jinnan Chen
Lingting Zhu
Zeyu Hu
Shengju Qian
Yuxiao Chen
Xin Wang
G. Lee
105
1
0
26 Mar 2025
DINeMo: Learning Neural Mesh Models with no 3D Annotations
DINeMo: Learning Neural Mesh Models with no 3D Annotations
Weijie Guo
Guofeng Zhang
Wufei Ma
A. Yuille
3DH
101
0
0
26 Mar 2025
A Large-Scale Vision-Language Dataset Derived from Open Scientific Literature to Advance Biomedical Generalist AI
A Large-Scale Vision-Language Dataset Derived from Open Scientific Literature to Advance Biomedical Generalist AI
Alejandro Lozano
Min Woo Sun
James Burgess
Jeffrey Nirschl
Christopher Polzak
...
Xiaohan Wang
Alfred Seunghoon Song
Chiang Chia-Chun
Robert Tibshirani
Serena Yeung-Levy
LM&MA
96
1
0
26 Mar 2025
Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models
Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models
Prin Phunyaphibarn
Phillip Y. Lee
Jaihoon Kim
Minhyuk Sung
DiffM
89
0
0
26 Mar 2025
Scaling Vision Pre-Training to 4K Resolution
Scaling Vision Pre-Training to 4K Resolution
Baifeng Shi
Boyi Li
Han Cai
Yunfan LU
Sifei Liu
...
Jan Kautz
Enze Xie
Trevor Darrell
Pavlo Molchanov
Hongxu Yin
CLIP
160
0
0
25 Mar 2025
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy
Zhi Hou
Tianyi Zhang
Yuwen Xiong
Haonan Duan
Hengjun Pu
...
Chengyang Zhao
X. Zhu
Yu Qiao
Jifeng Dai
Yuxiao Chen
59
1
0
25 Mar 2025
ChA-MAEViT: Unifying Channel-Aware Masked Autoencoders and Multi-Channel Vision Transformers for Improved Cross-Channel Learning
ChA-MAEViT: Unifying Channel-Aware Masked Autoencoders and Multi-Channel Vision Transformers for Improved Cross-Channel Learning
Chau Pham
Juan C. Caicedo
Bryan A. Plummer
47
0
0
25 Mar 2025
Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models
Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models
Sangwon Beak
Hyeonwoo Kim
Hanbyul Joo
46
0
0
25 Mar 2025
LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation
LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation
Vladan Stojnić
Yannis Kalantidis
Jirí Matas
Giorgos Tolias
VLM
51
0
0
25 Mar 2025
AvatarArtist: Open-Domain 4D Avatarization
AvatarArtist: Open-Domain 4D Avatarization
Hongyu Liu
Xuan Wang
Bo Liu
Yue Ma
Jingye Chen
Yanbo Fan
Yujun Shen
Yibing Song
Qifeng Chen
41
0
0
25 Mar 2025
SparseGS-W: Sparse-View 3D Gaussian Splatting in the Wild with Generative Priors
SparseGS-W: Sparse-View 3D Gaussian Splatting in the Wild with Generative Priors
Yiqing Li
Qing Guo
Jiawei Wu
Yikun Ma
Zhi Jin
3DGS
44
0
0
25 Mar 2025
Previous
123...567...424344
Next