ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.07193
  4. Cited By
DINOv2: Learning Robust Visual Features without Supervision

DINOv2: Learning Robust Visual Features without Supervision

14 April 2023
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
Vasil Khalidov
Pierre Fernandez
Daniel Haziza
Francisco Massa
Alaaeldin El-Nouby
Mahmoud Assran
Nicolas Ballas
Wojciech Galuba
Russ Howes
Po-Yao (Bernie) Huang
Shang-Wen Li
Ishan Misra
Michael G. Rabbat
Vasu Sharma
Gabriel Synnaeve
Huijiao Xu
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
    VLM
    CLIP
    SSL
ArXivPDFHTML

Papers citing "DINOv2: Learning Robust Visual Features without Supervision"

50 / 2,220 papers shown
Title
Semantic Alignment of Unimodal Medical Text and Vision Representations
Maxime Di Folco
E. Chan
Marta Hasny
Cosmin I. Bercea
Julia A. Schnabel
68
0
0
06 Mar 2025
In-Context Reverse Classification Accuracy: Efficient Estimation of Segmentation Quality without Ground-Truth
Matias Cosarinsky
Ramiro Billot
Lucas Mansilla
Gabriel Gimenez
Nicolás Gaggion
Guanghui Fu
Enzo Ferrante
61
1
0
06 Mar 2025
LensDFF: Language-enhanced Sparse Feature Distillation for Efficient Few-Shot Dexterous Manipulation
Qian Feng
David S. Martinez Lema
Jianxiang Feng
Zhaopeng Chen
Alois C. Knoll
44
0
0
05 Mar 2025
AirExo-2: Scaling up Generalizable Robotic Imitation Learning with Low-Cost Exoskeletons
AirExo-2: Scaling up Generalizable Robotic Imitation Learning with Low-Cost Exoskeletons
Hongjie Fang
Chenxi Wang
Yiming Wang
J. Chen
Shangning Xia
...
Xinyu Zhan
Lixin Yang
Weiming Wang
Cewu Lu
Hao-Shu Fang
87
1
0
05 Mar 2025
Label-Efficient LiDAR Semantic Segmentation with 2D-3D Vision Transformer Adapters
Julia Hindel
Rohit Mohan
Jelena Bratulić
Daniele Cattaneo
Thomas Brox
Abhinav Valada
3DPC
81
0
0
05 Mar 2025
COARSE: Collaborative Pseudo-Labeling with Coarse Real Labels for Off-Road Semantic Segmentation
Aurelio Noca
Xianmei Lei
Jonathan Becktor
J. Edlund
Anna Sabel
Patrick Spieler
Curtis Padgett
Alexandre Alahi
Deegan Atha
63
0
0
05 Mar 2025
CREStE: Scalable Mapless Navigation with Internet Scale Priors and Counterfactual Guidance
Arthur Zhang
Harshit S. Sikchi
Amy Zhang
Joydeep Biswas
64
1
0
05 Mar 2025
Is Pre-training Applicable to the Decoder for Dense Prediction?
Is Pre-training Applicable to the Decoder for Dense Prediction?
Chao Ning
Wanshui Gan
Weihao Xuan
Naoto Yokoya
48
0
0
05 Mar 2025
Generative Artificial Intelligence in Robotic Manipulation: A Survey
Kun Zhang
Peng Yun
Jun Cen
Junhao Cai
DiDi Zhu
...
Qifeng Chen
Jia Pan
Wei Zhang
Bo Yang
Hua Chen
61
1
0
05 Mar 2025
Out-of-Distribution Segmentation in Autonomous Driving: Problems and State of the Art
Out-of-Distribution Segmentation in Autonomous Driving: Problems and State of the Art
Youssef Shoeb
Azarm Nowzad
Hanno Gottschalk
UQCV
87
2
0
04 Mar 2025
TeTRA-VPR: A Ternary Transformer Approach for Compact Visual Place Recognition
Oliver Grainge
Michael Milford
I. Bodala
Sarvapali D. Ramchurn
Shoaib Ehsan
ViT
72
0
0
04 Mar 2025
Label-Efficient LiDAR Panoptic Segmentation
Ahmet Selim Çanakçı
Niclas Vodisch
Kürsat Petek
Wolfram Burgard
Abhinav Valada
3DPC
88
0
0
04 Mar 2025
Personalized Generation In Large Model Era: A Survey
Yiyan Xu
Jinghao Zhang
Alireza Salemi
Xinting Hu
Wenjie Wang
Fuli Feng
Hamed Zamani
Xiangnan He
Tat-Seng Chua
3DV
82
2
0
04 Mar 2025
A dataset-free approach for self-supervised learning of 3D reflectional symmetries
Issac Aguirre
Ivan Sipiran
Gabriel Montañana
53
0
0
04 Mar 2025
Resource-Efficient Affordance Grounding with Complementary Depth and Semantic Prompts
Yizhou Huang
Fan Yang
Guoliang Zhu
Gen Li
Hao-miao Shi
Yukun Zuo
Wenrui Chen
Zehan Li
Kailun Yang
64
0
0
04 Mar 2025
MindBridge: Scalable and Cross-Model Knowledge Editing via Memory-Augmented Modality
Shuaike Li
Kai Zhang
Qiang Liu
Enhong Chen
KELM
83
1
0
04 Mar 2025
Empowering Sparse-Input Neural Radiance Fields with Dual-Level Semantic Guidance from Dense Novel Views
Yingji Zhong
Kaichen Zhou
Zhihao Li
Lanqing Hong
Zhiyu Li
Dan Xu
59
1
0
04 Mar 2025
Bridging VLM and KMP: Enabling Fine-grained robotic manipulation via Semantic Keypoints Representation
Junjie Zhu
Huayu Liu
Jin Wang
Bangrong Wen
Kaixiang Huang
Xiaofei Li
Haiyun Zhan
Guodong Lu
70
0
0
04 Mar 2025
Deepfake Detection via Knowledge Injection
Tonghui Li
Yuanfang Guo
Zichen Liu
Heqi Peng
Yunhong Wang
59
2
0
04 Mar 2025
Adaptive Camera Sensor for Vision Models
Eunsu Baek
Sunghwan Han
Taesik Gong
Hyung-Sin Kim
VLM
Presented at ResearchTrend Connect | VLM on 28 Mar 2025
164
0
0
04 Mar 2025
One-shot In-context Part Segmentation
Zhenqi Dai
Ting Liu
X. Zhang
Y. X. Wei
Yanning Zhang
VLM
85
1
0
03 Mar 2025
HanDrawer: Leveraging Spatial Information to Render Realistic Hands Using a Conditional Diffusion Model in Single Stage
Qifan Fu
Xu Chen
Muhammad Asad
Shanxin Yuan
Changjae Oh
Gregory Slabaugh
DiffM
62
1
0
03 Mar 2025
Primus: Enforcing Attention Usage for 3D Medical Image Segmentation
Tassilo Wald
Saikat Roy
Fabian Isensee
Constantin Ulrich
Sebastian Ziegler
D. Trofimova
Raphael Stock
Michael Baumgartner
Gregor Köhler
Klaus H. Maier-Hein
ViT
MedIm
42
1
0
03 Mar 2025
Open-source framework for detecting bias and overfitting for large pathology images
Anders Sildnes
N. Shvetsov
M. Tafavvoghi
Vi Ngoc-Nha Tran
Kajsa Møllersen
Lill-ToveRasmussen Busund
Thomas K. Kilvær
L. A. Bongo
VLM
78
0
0
03 Mar 2025
Advancing vision-language models in front-end development via data synthesis
Tong Ge
Yashu Liu
Jieping Ye
Tianyi Li
Chao Wang
78
0
0
03 Mar 2025
Enhancing Retinal Vessel Segmentation Generalization via Layout-Aware Generative Modelling
Enhancing Retinal Vessel Segmentation Generalization via Layout-Aware Generative Modelling
Jonathan Fhima
Jan Van Eijgen
Lennert Beeckmans
Thomas Jacobs
Moti Freiman
Luis Filipe Nakayama
Ingeborg Stalmans
Chaim Baskin
Joachim A. Behar
MedIm
69
0
0
03 Mar 2025
Conditional Electrocardiogram Generation Using Hierarchical Variational Autoencoders
Conditional Electrocardiogram Generation Using Hierarchical Variational Autoencoders
Ivan Sviridov
Konstantin Egorov
DRL
SyDa
55
0
0
03 Mar 2025
MFM-DA: Instance-Aware Adaptor and Hierarchical Alignment for Efficient Domain Adaptation in Medical Foundation Models
Jia-Xuan Jiang
Wenhui Lei
Yifeng Wu
Hongtao Wu
Furong Li
Yining Xie
Xiaofan Zhang
Zhenting Wang
MedIm
34
0
0
02 Mar 2025
Foundation Models Secretly Understand Neural Network Weights: Enhancing Hypernetwork Architectures with Foundation Models
Jeffrey Gu
Serena Yeung-Levy
AI4CE
34
0
0
02 Mar 2025
FunBench: Benchmarking Fundus Reading Skills of MLLMs
Qijie Wei
Kaiheng Qian
Xirong Li
39
1
0
02 Mar 2025
Evaluating and Predicting Distorted Human Body Parts for Generated Images
Lu Ma
Kaibo Cao
Hao Liang
Jiaxin Lin
Zhiyu Li
Yuhong Liu
Jihong Zhang
Wentao Zhang
Bin Cui
MedIm
44
0
0
02 Mar 2025
MTReD: 3D Reconstruction Dataset for Fly-over Videos of Maritime Domain
Rui Yi Yong
Samuel Picosson
Arnold Wiliem
42
0
0
02 Mar 2025
Solving Instance Detection from an Open-World Perspective
Solving Instance Detection from an Open-World Perspective
Qianqian Shen
Yunhan Zhao
Nahyun Kwon
Jeeeun Kim
Yanan Li
Shu Kong
45
0
0
01 Mar 2025
Learning to Animate Images from A Few Videos to Portray Delicate Human Actions
Haoxin Li
Yingchen Yu
Qilong Wu
Hanwang Zhang
Boyang Li
Song Bai
3DH
VGen
230
0
0
01 Mar 2025
MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retention
MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retention
Tianyi Wang
Jianan Fan
Dingxin Zhang
Dongnan Liu
Yong-quan Xia
Heng Huang
Weidong Cai
39
0
0
01 Mar 2025
A Guide to Failure in Machine Learning: Reliability and Robustness from Foundations to Practice
Eric Heim
Oren Wright
David Shriver
OOD
FaML
73
0
0
01 Mar 2025
Bring Your Own Grasp Generator: Leveraging Robot Grasp Generation for Prosthetic Grasping
Giuseppe Stracquadanio
Federico Vasile
Elisa Maiettini
Nicoló Boccardo
Lorenzo Natale
32
0
0
01 Mar 2025
Multimodal Dreaming: A Global Workspace Approach to World Model-Based Reinforcement Learning
Multimodal Dreaming: A Global Workspace Approach to World Model-Based Reinforcement Learning
Léopold Maytié
Roland Bertin Johannet
Rufin VanRullen
OffRL
46
0
0
28 Feb 2025
SciceVPR: Stable Cross-Image Correlation Enhanced Model for Visual Place Recognition
SciceVPR: Stable Cross-Image Correlation Enhanced Model for Visual Place Recognition
Shanshan Wan
Yingmei Wei
Lai Kang
Tianrui Shen
Haixuan Wang
Yee-Hong Yang
43
0
0
28 Feb 2025
Ext2Gen: Alignment through Unified Extraction and Generation for Robust Retrieval-Augmented Generation
Hwanjun Song
J. Choi
Minseok Kim
RALM
3DV
68
0
0
28 Feb 2025
CNSv2: Probabilistic Correspondence Encoded Neural Image Servo
Anzhe Chen
Hongxiang Yu
Shuxin Li
Yuxi Chen
Zhongxiang Zhou
Wentao Sun
R. Xiong
Yansen Wang
34
0
0
28 Feb 2025
New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM Collaboration
New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM Collaboration
X. J. Yang
Jing Liu
Peng Wang
Guoqing Wang
Yue Yang
H. Shen
ObjD
94
0
0
27 Feb 2025
SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird's-Eye-View Segmentation
SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird's-Eye-View Segmentation
Zijie Zhou
Zhangshuo Qi
Luqi Cheng
Guangming Xiong
68
1
0
27 Feb 2025
Vector-Quantized Vision Foundation Models for Object-Centric Learning
Vector-Quantized Vision Foundation Models for Object-Centric Learning
Rongzhen Zhao
V. Wang
Arno Solin
Joni Pajarinen
OCL
VLM
284
0
0
27 Feb 2025
UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler
UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler
Luigi Piccinelli
Daniel Gehrig
Yifan Yang
Mattia Segu
Siyuan Li
Wim Abbeloos
Luc Van Gool
MDE
49
6
0
27 Feb 2025
Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report Generation
Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report Generation
Kang Liu
Zhuoqi Ma
Xiaolu Kang
Yunan Li
Kun Xie
Zhicheng Jiao
Qiguang Miao
36
3
0
27 Feb 2025
Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation
Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation
Sucheng Ren
Qihang Yu
Ju He
Xiaohui Shen
Alan Yuille
Liang-Chieh Chen
VGen
85
7
0
27 Feb 2025
Multi-Keypoint Affordance Representation for Functional Dexterous Grasping
Multi-Keypoint Affordance Representation for Functional Dexterous Grasping
Fan Yang
DongSheng Luo
Wenrui Chen
Jiacheng Lin
Junjie Cai
Kailun Yang
Zehan Li
Yaonan Wang
56
0
0
27 Feb 2025
MITracker: Multi-View Integration for Visual Object Tracking
MITracker: Multi-View Integration for Visual Object Tracking
Mengjie Xu
Yitao Zhu
Haotian Jiang
Jiaming Li
Zhenrong Shen
...
Haolin Huang
Xinyu Wang
Qing Yang
H. Zhang
Qian Wang
48
0
0
27 Feb 2025
ATLAS Navigator: Active Task-driven LAnguage-embedded Gaussian Splatting
ATLAS Navigator: Active Task-driven LAnguage-embedded Gaussian Splatting
Dexter Ong
Yuezhan Tao
Varun Murali
Igor Spasojevic
Vijay Kumar
Pratik Chaudhari
3DGS
68
0
0
27 Feb 2025
Previous
123...101112...434445
Next