ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.07193
  4. Cited By
DINOv2: Learning Robust Visual Features without Supervision

DINOv2: Learning Robust Visual Features without Supervision

14 April 2023
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
Vasil Khalidov
Pierre Fernandez
Daniel Haziza
Francisco Massa
Alaaeldin El-Nouby
Mahmoud Assran
Nicolas Ballas
Wojciech Galuba
Russ Howes
Po-Yao (Bernie) Huang
Shang-Wen Li
Ishan Misra
Michael G. Rabbat
Vasu Sharma
Gabriel Synnaeve
Huijiao Xu
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
    VLM
    CLIP
    SSL
ArXivPDFHTML

Papers citing "DINOv2: Learning Robust Visual Features without Supervision"

50 / 2,193 papers shown
Title
Analyzing the Impact of Low-Rank Adaptation for Cross-Domain Few-Shot Object Detection in Aerial Images
Analyzing the Impact of Low-Rank Adaptation for Cross-Domain Few-Shot Object Detection in Aerial Images
Hicham Talaoubrid
Anissa Mokraoui
Ismail Ben Ayed
Axel Prouvost
Sonimith Hang
Monit Korn
Rémi Harvey
ObjD
60
1
0
08 Apr 2025
OmniSVG: A Unified Scalable Vector Graphics Generation Model
OmniSVG: A Unified Scalable Vector Graphics Generation Model
Yiying Yang
Wei Cheng
Sijin Chen
Xianfang Zeng
Jiaxu Zhang
Liao Wang
Gang Yu
Xingjun Ma
Yu Jiang
VLM
45
0
0
08 Apr 2025
Flash Sculptor: Modular 3D Worlds from Objects
Flash Sculptor: Modular 3D Worlds from Objects
Yujia Hu
Songhua Liu
Xingyi Yang
Xinchao Wang
34
0
0
08 Apr 2025
Hyperbolic Category Discovery
Hyperbolic Category Discovery
Yuanpei Liu
Zhenqi He
Kai Han
28
0
0
08 Apr 2025
POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction
POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction
Songyan Zhang
Yongtao Ge
Jinyuan Tian
Guangkai Xu
Hao Chen
Chen Lv
Chunhua Shen
3DPC
24
0
0
08 Apr 2025
On the Importance of Conditioning for Privacy-Preserving Data Augmentation
On the Importance of Conditioning for Privacy-Preserving Data Augmentation
Julian Lorenz
K. Ludwig
Valentin Haug
Rainer Lienhart
DiffM
38
0
0
08 Apr 2025
Falcon: Fractional Alternating Cut with Overcoming Minima in Unsupervised Segmentation
Falcon: Fractional Alternating Cut with Overcoming Minima in Unsupervised Segmentation
Xiao Zhang
Xiangyu Han
Xiwen Lai
Yao Sun
Pei Zhang
Konrad Kording
34
0
0
08 Apr 2025
TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis
TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis
Tri Ton
Ji Woo Hong
Chang D. Yoo
VGen
24
0
0
08 Apr 2025
DiCoTTA: Domain-invariant Learning for Continual Test-time Adaptation
DiCoTTA: Domain-invariant Learning for Continual Test-time Adaptation
Sohyun Lee
N. Kim
Juwon Kang
Seong Joon Oh
Suha Kwak
94
0
0
07 Apr 2025
S^4M: Boosting Semi-Supervised Instance Segmentation with SAM
S^4M: Boosting Semi-Supervised Instance Segmentation with SAM
Heeji Yoon
Heeseong Shin
Eunbeen Hong
Hyunwook Choi
Hansang Cho
Daun Jeong
Seungryong Kim
26
0
0
07 Apr 2025
Training state-of-the-art pathology foundation models with orders of magnitude less data
Training state-of-the-art pathology foundation models with orders of magnitude less data
Mikhail Karasikov
J. Doorn
Nicolas Kanzig
Melis Erdal Cesur
Hugo Mark Horlings
Robert Berke
Fei Tang
Sebastian Otálora
AI4CE
26
0
0
07 Apr 2025
CMaP-SAM: Contraction Mapping Prior for SAM-driven Few-shot Segmentation
CMaP-SAM: Contraction Mapping Prior for SAM-driven Few-shot Segmentation
Shuai Chen
Fanman Meng
Haoran Wei
Chenhao Wu
Qi Wu
Linfeng Xu
Yiming Li
30
0
0
07 Apr 2025
EffOWT: Transfer Visual Language Models to Open-World Tracking Efficiently and Effectively
EffOWT: Transfer Visual Language Models to Open-World Tracking Efficiently and Effectively
Bingyang Wang
Kaer Huang
Bin Li
Yiqiang Yan
L. Zhang
Huchuan Lu
You He
VLM
37
0
0
07 Apr 2025
URECA: Unique Region Caption Anything
URECA: Unique Region Caption Anything
Sangbeom Lim
J. Kim
Heeji Yoon
Jaewoo Jung
Seungryong Kim
31
0
0
07 Apr 2025
Variational Self-Supervised Learning
Variational Self-Supervised Learning
Mehmet Can Yavuz
Berrin Yanikoglu
SSL
102
0
0
06 Apr 2025
AnomalyHybrid: A Domain-agnostic Generative Framework for General Anomaly Detection
AnomalyHybrid: A Domain-agnostic Generative Framework for General Anomaly Detection
Ying Zhao
23
0
0
06 Apr 2025
Video4DGen: Enhancing Video and 4D Generation through Mutual Optimization
Video4DGen: Enhancing Video and 4D Generation through Mutual Optimization
Yikai Wang
Guangce Liu
Xinzhou Wang
Zilong Chen
Jiafang Li
Xin Liang
F. Sun
J. Zhu
3DGS
VGen
32
0
0
05 Apr 2025
Resilience of Vision Transformers for Domain Generalisation in the Presence of Out-of-Distribution Noisy Images
Resilience of Vision Transformers for Domain Generalisation in the Presence of Out-of-Distribution Noisy Images
Hamza Riaz
Alan F. Smeaton
41
0
0
05 Apr 2025
A Survey of Pathology Foundation Model: Progress and Future Directions
A Survey of Pathology Foundation Model: Progress and Future Directions
Conghao Xiong
Hao Chen
Joseph J. Y. Sung
LM&MA
AI4CE
53
0
0
05 Apr 2025
Dynamic Objective MPC for Motion Planning of Seamless Docking Maneuvers
Dynamic Objective MPC for Motion Planning of Seamless Docking Maneuvers
Oliver Schumann
Michael Buchholz
Klaus C. J. Dietmayer
40
0
0
04 Apr 2025
REJEPA: A Novel Joint-Embedding Predictive Architecture for Efficient Remote Sensing Image Retrieval
REJEPA: A Novel Joint-Embedding Predictive Architecture for Efficient Remote Sensing Image Retrieval
Shabnam Choudhury
Yash Salunkhe
Sarthak Mehrotra
Biplab Banerjee
36
0
0
04 Apr 2025
Simultaneous Learning of Optimal Transports for Training All-to-All Flow-Based Condition Transfer Model
Simultaneous Learning of Optimal Transports for Training All-to-All Flow-Based Condition Transfer Model
Kotaro Ikeda
Masanori Koyama
Jinzhe Zhang
Kohei Hayashi
Kenji Fukumizu
OT
148
0
0
04 Apr 2025
Real-is-Sim: Bridging the Sim-to-Real Gap with a Dynamic Digital Twin for Real-World Robot Policy Evaluation
Real-is-Sim: Bridging the Sim-to-Real Gap with a Dynamic Digital Twin for Real-World Robot Policy Evaluation
Jad Abou-Chakra
Lingfeng Sun
Krishan Rana
Brandon B. May
Karl Schmeckpeper
M. Minniti
Laura Herlant
OffRL
140
0
0
04 Apr 2025
Dexterous Manipulation through Imitation Learning: A Survey
Dexterous Manipulation through Imitation Learning: A Survey
Shan An
Ziyu Meng
Chao Tang
Yue Zhou
Tengyu Liu
...
Yao Mu
Ran Song
Wei Zhang
Zeng-Guang Hou
H. Zhang
51
0
0
04 Apr 2025
WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments
WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments
Jianhao Zheng
Zihan Zhu
Valentin Bieri
Marc Pollefeys
Songyou Peng
Iro Armeni
3DGS
26
0
0
04 Apr 2025
Quantum Speedups for Markov Chain Monte Carlo Methods with Application to Optimization
Quantum Speedups for Markov Chain Monte Carlo Methods with Application to Optimization
Guneykan Ozgul
Xiantao Li
Mehrdad Mahdavi
Chunhao Wang
34
0
0
04 Apr 2025
BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation
BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation
Van Nguyen Nguyen
Stephen Tyree
Andrew Guo
Mederic Fourmy
Anas Gouda
...
Stan Birchfield
Jiri Matas
Yann Labbé
M. Sundermeyer
Tomás Hodan
3DPC
58
1
0
03 Apr 2025
Towards Generalizing Temporal Action Segmentation to Unseen Views
Towards Generalizing Temporal Action Segmentation to Unseen Views
Emad Bahrami
Olga Zatsarynna
Gianpiero Francesca
Juergen Gall
EgoV
46
0
0
03 Apr 2025
Agglomerating Large Vision Encoders via Distillation for VFSS Segmentation
Agglomerating Large Vision Encoders via Distillation for VFSS Segmentation
Chengxi Zeng
Yuxuan Jiang
Fan Zhang
A. Gambaruto
T. Burghardt
MedIm
48
0
0
03 Apr 2025
PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose Estimation
PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose Estimation
Lihua Liu
Jiehong Lin
Zhenxin Liu
Kui Jia
45
0
0
03 Apr 2025
Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models
Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models
Mateusz Pach
Shyamgopal Karthik
Quentin Bouniot
Serge Belongie
Zeynep Akata
VLM
69
0
0
03 Apr 2025
Scene-Centric Unsupervised Panoptic Segmentation
Scene-Centric Unsupervised Panoptic Segmentation
Oliver Hahn
Christoph Reich
Nikita Araslanov
Daniel Cremers
Christian Rupprecht
Stefan Roth
OCL
62
0
0
02 Apr 2025
Multimodal Reference Visual Grounding
Multimodal Reference Visual Grounding
Yangxiao Lu
Ruosen Li
Liqiang Jing
Jikai Wang
Xinya Du
Yunhui Guo
Nicholas Ruozzi
Yu Xiang
ObjD
78
0
0
02 Apr 2025
Slot-Level Robotic Placement via Visual Imitation from Single Human Video
Slot-Level Robotic Placement via Visual Imitation from Single Human Video
Dandan Shan
Kaichun Mo
Wei Yang
Yu-Wei Chao
David Fouhey
Dieter Fox
Arsalan Mousavian
38
0
0
02 Apr 2025
Prompt-Guided Attention Head Selection for Focus-Oriented Image Retrieval
Prompt-Guided Attention Head Selection for Focus-Oriented Image Retrieval
Yuji Nozawa
Yu Lin
Kazumoto Nakamura
Youyang Ng
43
0
0
02 Apr 2025
Training-free Dense-Aligned Diffusion Guidance for Modular Conditional Image Synthesis
Training-free Dense-Aligned Diffusion Guidance for Modular Conditional Image Synthesis
Zixuan Wang
Duo Peng
Feng Chen
Yi Yang
Yinjie Lei
DiffM
79
0
0
02 Apr 2025
v-CLR: View-Consistent Learning for Open-World Instance Segmentation
v-CLR: View-Consistent Learning for Open-World Instance Segmentation
Chang-Bin Zhang
Jinhong Ni
Yujie Zhong
Kai Han
3DV
VLM
69
0
0
02 Apr 2025
All Patches Matter, More Patches Better: Enhance AI-Generated Image Detection via Panoptic Patch Learning
All Patches Matter, More Patches Better: Enhance AI-Generated Image Detection via Panoptic Patch Learning
Zheng Yang
Ruoxin Chen
Zhiyuan Yan
Ke-Yue Zhang
Xinghe Fu
...
Xiujun Shu
Taiping Yao
Junchi Yan
Shouhong Ding
Xi Li
31
0
0
02 Apr 2025
A Diffusion-Based Framework for Occluded Object Movement
A Diffusion-Based Framework for Occluded Object Movement
Zheng-Peng Duan
Jiawei Zhang
Siyu Liu
Zheng Lin
Chun-Le Guo
Dongqing Zou
Jimmy S. Ren
Chongyi Li
38
0
0
02 Apr 2025
ProtoGCD: Unified and Unbiased Prototype Learning for Generalized Category Discovery
ProtoGCD: Unified and Unbiased Prototype Learning for Generalized Category Discovery
Shijie Ma
Fei Zhu
Xu-Yao Zhang
Cheng-Lin Liu
37
1
0
02 Apr 2025
Anomaly Detection for Hybrid Butterfly Subspecies via Probability Filtering
Anomaly Detection for Hybrid Butterfly Subspecies via Probability Filtering
Bo-Kai Ruan
Yi-Zeng Fang
Hong-Han Shuai
Juinn-Dar Huang
46
0
0
02 Apr 2025
UniViTAR: Unified Vision Transformer with Native Resolution
UniViTAR: Unified Vision Transformer with Native Resolution
Limeng Qiao
Yiyang Gan
Bairui Wang
Jie Qin
Shuang Xu
Siqi Yang
Lin Ma
57
0
0
02 Apr 2025
Distilling Multi-view Diffusion Models into 3D Generators
Distilling Multi-view Diffusion Models into 3D Generators
Hao Qin
Luyuan Chen
Ming Kong
Mengxu Lu
Qiang Zhu
3DGS
64
0
0
01 Apr 2025
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization
Siyuan Li
L. Zhang
Zedong Wang
Juanxi Tian
Cheng Tan
...
Chang Yu
Qingsong Xie
Haonan Lu
Haoqian Wang
Zhen Lei
48
0
0
01 Apr 2025
DecoFuse: Decomposing and Fusing the "What", "Where", and "How" for Brain-Inspired fMRI-to-Video Decoding
DecoFuse: Decomposing and Fusing the "What", "Where", and "How" for Brain-Inspired fMRI-to-Video Decoding
Chong Li
Jingyang Huo
Weikang Gong
Yanwei Fu
Xiangyang Xue
Jianfeng Feng
43
0
0
01 Apr 2025
Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation
Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation
Junyu Xie
Tengda Han
Max Bain
Arsha Nagrani
Eshika Khandelwal
Gül Varol
Weidi Xie
Andrew Zisserman
DiffM
VGen
59
0
0
01 Apr 2025
Spingarn's Method and Progressive Decoupling Beyond Elicitable Monotonicity
Spingarn's Method and Progressive Decoupling Beyond Elicitable Monotonicity
B. Evens
P. Latafat
Panagiotis Patrinos
48
1
0
01 Apr 2025
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Tian-Xing Xu
Xiangjun Gao
Wenbo Hu
Xiaoyu Li
Song-Hai Zhang
Ying Shan
VGen
MDE
60
1
0
01 Apr 2025
GECKO: Gigapixel Vision-Concept Contrastive Pretraining in Histopathology
GECKO: Gigapixel Vision-Concept Contrastive Pretraining in Histopathology
S. Kapse
Pushpak Pati
Srikar Yellapragada
Srijan Das
Rajarsi R. Gupta
Joel H. Saltz
Dimitris Samaras
Prateek Prasanna
VLM
48
0
0
01 Apr 2025
Scaling Language-Free Visual Representation Learning
Scaling Language-Free Visual Representation Learning
David Fan
Shengbang Tong
Jiachen Zhu
Koustuv Sinha
Zhuang Liu
...
Michael G. Rabbat
Nicolas Ballas
Yann LeCun
Amir Bar
Saining Xie
CLIP
VLM
64
2
0
01 Apr 2025
Previous
123456...424344
Next