ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.07193
  4. Cited By
DINOv2: Learning Robust Visual Features without Supervision

DINOv2: Learning Robust Visual Features without Supervision

14 April 2023
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
Vasil Khalidov
Pierre Fernandez
Daniel Haziza
Francisco Massa
Alaaeldin El-Nouby
Mahmoud Assran
Nicolas Ballas
Wojciech Galuba
Russ Howes
Po-Yao (Bernie) Huang
Shang-Wen Li
Ishan Misra
Michael G. Rabbat
Vasu Sharma
Gabriel Synnaeve
Huijiao Xu
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
    VLM
    CLIP
    SSL
ArXivPDFHTML

Papers citing "DINOv2: Learning Robust Visual Features without Supervision"

50 / 2,242 papers shown
Title
Cross-Architecture Auxiliary Feature Space Translation for Efficient
  Few-Shot Personalized Object Detection
Cross-Architecture Auxiliary Feature Space Translation for Efficient Few-Shot Personalized Object Detection
F. Barbato
Umberto Michieli
J. Moon
Pietro Zanuttigh
Mete Ozay
52
2
0
01 Jul 2024
FairMedFM: Fairness Benchmarking for Medical Imaging Foundation Models
FairMedFM: Fairness Benchmarking for Medical Imaging Foundation Models
Ruinan Jin
Zikang Xu
Yuan Zhong
Qiongsong Yao
Qi Dou
S. Kevin Zhou
Xiaoxiao Li
VLM
57
14
0
01 Jul 2024
Diffusion Models and Representation Learning: A Survey
Diffusion Models and Representation Learning: A Survey
Michael Fuest
Pingchuan Ma
Ming Gui
Johannes S. Fischer
Vincent Tao Hu
Bjorn Ommer
DiffM
60
21
0
30 Jun 2024
Unveiling Glitches: A Deep Dive into Image Encoding Bugs within CLIP
Unveiling Glitches: A Deep Dive into Image Encoding Bugs within CLIP
Ayush Ranjan
Daniel Wen
Karthik Bhat
39
0
0
30 Jun 2024
Multimodal Prototyping for cancer survival prediction
Multimodal Prototyping for cancer survival prediction
Andrew H. Song
Richard J. Chen
Guillaume Jaume
Anurag J. Vaidya
Alexander S. Baras
Faisal Mahmood
43
15
0
28 Jun 2024
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
Xiang Li
Cristina Mata
J. Park
Kumara Kahatapitiya
Yoo Sung Jang
...
Kanchana Ranasinghe
R. Burgert
Mu Cai
Yong Jae Lee
Michael S. Ryoo
LM&Ro
77
26
0
28 Jun 2024
SpotlessSplats: Ignoring Distractors in 3D Gaussian Splatting
SpotlessSplats: Ignoring Distractors in 3D Gaussian Splatting
S. Sabour
Lily Goli
George Kopanas
Mark J. Matthews
Dmitry Lagun
Leonidas Guibas
Alec Jacobson
David J. Fleet
Andrea Tagliasacchi
59
18
0
28 Jun 2024
Odd-One-Out: Anomaly Detection by Comparing with Neighbors
Odd-One-Out: Anomaly Detection by Comparing with Neighbors
A. Bhunia
Changjian Li
Hakan Bilen
75
0
0
28 Jun 2024
What Matters in Detecting AI-Generated Videos like Sora?
What Matters in Detecting AI-Generated Videos like Sora?
Chirui Chang
Zhengzhe Liu
Xiaoyang Lyu
Xiaojuan Qi
DiffM
VGen
93
7
0
27 Jun 2024
Enhancing Continual Learning in Visual Question Answering with
  Modality-Aware Feature Distillation
Enhancing Continual Learning in Visual Question Answering with Modality-Aware Feature Distillation
Malvina Nikandrou
Georgios Pantazopoulos
Ioannis Konstas
Alessandro Suglia
44
1
0
27 Jun 2024
Dense Monocular Motion Segmentation Using Optical Flow and Pseudo Depth
  Map: A Zero-Shot Approach
Dense Monocular Motion Segmentation Using Optical Flow and Pseudo Depth Map: A Zero-Shot Approach
Yuxiang Huang
Yuhao Chen
John S. Zelek
MDE
57
2
0
27 Jun 2024
3D Feature Distillation with Object-Centric Priors
3D Feature Distillation with Object-Centric Priors
Georgios Tziafas
Yucheng Xu
Zhibin Li
Hamidreza Kasaei
46
1
0
26 Jun 2024
Towards Human-Level 3D Relative Pose Estimation: Generalizable,
  Training-Free, with Single Reference
Towards Human-Level 3D Relative Pose Estimation: Generalizable, Training-Free, with Single Reference
Yuan Gao
Yajing Luo
Junhong Wang
Kui Jia
Gui-Song Xia
3DH
47
0
0
26 Jun 2024
Foundational Models for Pathology and Endoscopy Images: Application for
  Gastric Inflammation
Foundational Models for Pathology and Endoscopy Images: Application for Gastric Inflammation
H. Kerdegari
Kyle Higgins
Dennis Veselkov
I. Laponogov
I. Poļaka
...
Junior Andrea Pescino
M. Leja
M. Dinis-Ribeiro
T. F. Kanonnikoff
Kirill Veselkov
61
3
0
26 Jun 2024
SynRS3D: A Synthetic Dataset for Global 3D Semantic Understanding from
  Monocular Remote Sensing Imagery
SynRS3D: A Synthetic Dataset for Global 3D Semantic Understanding from Monocular Remote Sensing Imagery
Jian Song
Hongruixuan Chen
Weihao Xuan
Junshi Xia
Naoto Yokoya
37
4
0
26 Jun 2024
Diffusion Model-Based Video Editing: A Survey
Diffusion Model-Based Video Editing: A Survey
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Dacheng Tao
VGen
71
22
0
26 Jun 2024
Changen2: Multi-Temporal Remote Sensing Generative Change Foundation
  Model
Changen2: Multi-Temporal Remote Sensing Generative Change Foundation Model
Zhuo Zheng
Stefano Ermon
Dongjun Kim
Liangpei Zhang
Yanfei Zhong
DiffM
50
20
0
26 Jun 2024
MotionBooth: Motion-Aware Customized Text-to-Video Generation
MotionBooth: Motion-Aware Customized Text-to-Video Generation
Jianzong Wu
Xiangtai Li
Yanhong Zeng
Jiangning Zhang
Qianyu Zhou
Yining Li
Yunhai Tong
Kai Chen
DiffM
VGen
93
44
0
25 Jun 2024
Depth-Guided Semi-Supervised Instance Segmentation
Depth-Guided Semi-Supervised Instance Segmentation
Xin Chen
Jie Hu
Xiawu Zheng
Jianghang Lin
Liujuan Cao
Rongrong Ji
ISeg
3DV
60
1
0
25 Jun 2024
LIPE: Learning Personalized Identity Prior for Non-rigid Image Editing
LIPE: Learning Personalized Identity Prior for Non-rigid Image Editing
Aoyang Liu
Qingnan Fan
Shuai Qin
Hong Gu
Yansong Tang
DiffM
63
1
0
25 Jun 2024
MM-SpuBench: Towards Better Understanding of Spurious Biases in
  Multimodal LLMs
MM-SpuBench: Towards Better Understanding of Spurious Biases in Multimodal LLMs
Wenqian Ye
Guangtao Zheng
Yunsheng Ma
Xu Cao
Bolin Lai
James M. Rehg
Aidong Zhang
42
10
0
24 Jun 2024
StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal
StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal
Chongjie Ye
Lingteng Qiu
Xiaodong Gu
Qi Zuo
Yushuang Wu
Zilong Dong
Liefeng Bo
Yuliang Xiu
Xiaoguang Han
DiffM
57
41
0
24 Jun 2024
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs
Shengbang Tong
Ellis L Brown
Penghao Wu
Sanghyun Woo
Manoj Middepogu
...
Xichen Pan
Austin Wang
Rob Fergus
Yann LeCun
Saining Xie
3DV
MLLM
62
311
0
24 Jun 2024
WARP: On the Benefits of Weight Averaged Rewarded Policies
WARP: On the Benefits of Weight Averaged Rewarded Policies
Alexandre Ramé
Johan Ferret
Nino Vieillard
Robert Dadashi
Léonard Hussenot
Pierre-Louis Cedoz
Pier Giuseppe Sessa
Sertan Girgin
Arthur Douillard
Olivier Bachem
62
15
0
24 Jun 2024
UNICAD: A Unified Approach for Attack Detection, Noise Reduction and
  Novel Class Identification
UNICAD: A Unified Approach for Attack Detection, Noise Reduction and Novel Class Identification
Alvaro Lopez Pellicer
Kittipos Giatgong
Yi Li
N. Suri
Plamen Angelov
AAML
37
3
0
24 Jun 2024
The Hidden Pitfalls of the Cosine Similarity Loss
The Hidden Pitfalls of the Cosine Similarity Loss
Andrew Draganov
Sharvaree P. Vadgama
Erik J. Bekkers
SSL
48
1
0
24 Jun 2024
Feature-prompting GBMSeg: One-Shot Reference Guided Training-Free Prompt
  Engineering for Glomerular Basement Membrane Segmentation
Feature-prompting GBMSeg: One-Shot Reference Guided Training-Free Prompt Engineering for Glomerular Basement Membrane Segmentation
Xueyu Liu
Guangze Shi
Rui Wang
Yexin Lai
Jianan Zhang
...
Quan Yang
Yongfei Wu
MIng Li
Weixia Han
Wen-Xin Zheng
VLM
52
2
0
24 Jun 2024
Breaking the Frame: Image Retrieval by Visual Overlap Prediction
Breaking the Frame: Image Retrieval by Visual Overlap Prediction
Tong Wei
Philipp Lindenberger
Jirí Matas
Dániel Baráth
70
0
0
23 Jun 2024
HEST-1k: A Dataset for Spatial Transcriptomics and Histology Image
  Analysis
HEST-1k: A Dataset for Spatial Transcriptomics and Histology Image Analysis
Guillaume Jaume
Paul Doucet
Andrew H. Song
Ming Y. Lu
Cristina Almagro-Pérez
...
Anurag J. Vaidya
Richard J. Chen
Drew F. K. Williamson
Ahrong Kim
Faisal Mahmood
56
30
0
23 Jun 2024
LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control
LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control
Delin Qu
Qizhi Chen
Pingrui Zhang
Xianqiang Gao
Bin Zhao
Bin Zhao
Dong Wang
Xuelong Li
AI4CE
54
8
0
23 Jun 2024
Beyond the Doors of Perception: Vision Transformers Represent Relations
  Between Objects
Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects
Michael A. Lepori
Alexa R. Tartaglini
Wai Keen Vong
Thomas Serre
Brenden M. Lake
Ellie Pavlick
49
3
0
22 Jun 2024
PUDD: Towards Robust Multi-modal Prototype-based Deepfake Detection
PUDD: Towards Robust Multi-modal Prototype-based Deepfake Detection
Alvaro Lopez Pellcier
Yi Li
Plamen Angelov
DiffM
48
9
0
22 Jun 2024
SEDMamba: Enhancing Selective State Space Modelling with Bottleneck
  Mechanism and Fine-to-Coarse Temporal Fusion for Efficient Error Detection in
  Robot-Assisted Surgery
SEDMamba: Enhancing Selective State Space Modelling with Bottleneck Mechanism and Fine-to-Coarse Temporal Fusion for Efficient Error Detection in Robot-Assisted Surgery
Jialang Xu
Nazir Sirajudeen
M. Boal
Nader K Francis
Danail Stoyanov
E. Mazomenos
Mamba
45
2
0
22 Jun 2024
Open-Vocabulary Temporal Action Localization using Multimodal Guidance
Open-Vocabulary Temporal Action Localization using Multimodal Guidance
Akshita Gupta
Aditya Arora
Sanath Narayan
Salman Khan
Fahad Shahbaz Khan
Graham W. Taylor
46
3
0
21 Jun 2024
DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection
DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection
Jia Syuen Lim
Zhuoxiao Chen
Mahsa Baktashmotlagh
Zhi Chen
Xin Yu
Zi Huang
Yadan Luo
VLM
ObjD
86
1
0
21 Jun 2024
Consistency Models Made Easy
Consistency Models Made Easy
Zhengyang Geng
Ashwini Pokle
William Luo
Justin Lin
J. Zico Kolter
52
29
0
20 Jun 2024
Predicting Probabilities of Error to Combine Quantization and Early
  Exiting: QuEE
Predicting Probabilities of Error to Combine Quantization and Early Exiting: QuEE
Florence Regol
Joud Chataoui
Bertrand Charpentier
Mark Coates
Pablo Piantanida
Stephan Gunnemann
69
0
0
20 Jun 2024
Automatic Labels are as Effective as Manual Labels in Biomedical Images
  Classification with Deep Learning
Automatic Labels are as Effective as Manual Labels in Biomedical Images Classification with Deep Learning
Niccolo Marini
S. Marchesin
Lluis Borras Ferris
Simon Püttmann
Marek Wodzinski
...
Filippo Fraggetta
Iris Nagtegaal
Gianmaria Silvello
Manfredo Atzori
Henning Muller
30
1
0
20 Jun 2024
Latent Functional Maps
Latent Functional Maps
Marco Fumero
Marco Pegoraro
Valentino Maiorca
Francesco Locatello
Emanuele Rodolà
55
0
0
20 Jun 2024
Splatter a Video: Video Gaussian Representation for Versatile Processing
Splatter a Video: Video Gaussian Representation for Versatile Processing
Yang-tian Sun
Yi-Hua Huang
Lin Ma
Xiaoyang Lyu
Yan-Pei Cao
Xiaojuan Qi
3DGS
49
5
0
19 Jun 2024
You can't handle the (dirty) truth: Data-centric insights improve
  pseudo-labeling
You can't handle the (dirty) truth: Data-centric insights improve pseudo-labeling
Nabeel Seedat
Nicolas Huynh
F. Imrie
Mihaela van der Schaar
53
2
0
19 Jun 2024
Controlling Forgetting with Test-Time Data in Continual Learning
Controlling Forgetting with Test-Time Data in Continual Learning
Vaibhav Singh
Rahaf Aljundi
Eugene Belilovsky
CLL
VLM
KELM
53
3
0
19 Jun 2024
Is AI fun? HumorDB: a curated dataset and benchmark to investigate
  graphical humor
Is AI fun? HumorDB: a curated dataset and benchmark to investigate graphical humor
Veedant Jain
Felipe dos Santos Alves Feitosa
Gabriel Kreiman
VLM
71
2
0
19 Jun 2024
4K4DGen: Panoramic 4D Generation at 4K Resolution
4K4DGen: Panoramic 4D Generation at 4K Resolution
Renjie Li
Panwang Pan
Bangbang Yang
Dejia Xu
Shijie Zhou
Xuanyang Zhang
Zeming Li
A. Kadambi
Zhangyang Wang
Zhiwen Fan
VGen
68
17
0
19 Jun 2024
Large-Scale Dataset Pruning in Adversarial Training through Data
  Importance Extrapolation
Large-Scale Dataset Pruning in Adversarial Training through Data Importance Extrapolation
Bjorn Nieth
Thomas Altstidl
Leo Schwinn
Björn Eskofier
AAML
53
2
0
19 Jun 2024
ChangeViT: Unleashing Plain Vision Transformers for Change Detection
ChangeViT: Unleashing Plain Vision Transformers for Change Detection
Duowang Zhu
Xiaohu Huang
Haiyan Huang
Zhenfeng Shao
Q. Cheng
59
8
0
18 Jun 2024
GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation
  Models
GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models
Yongtao Ge
Guangkai Xu
Zhiyue Zhao
Libo Sun
Zheng Huang
Yanlong Sun
Hao Chen
Chunhua Shen
MDE
42
3
0
18 Jun 2024
Cycle-Correspondence Loss: Learning Dense View-Invariant Visual Features
  from Unlabeled and Unordered RGB Images
Cycle-Correspondence Loss: Learning Dense View-Invariant Visual Features from Unlabeled and Unordered RGB Images
David B. Adrian
A. Kupcsik
Markus Spies
Heiko Neumann
SSL
39
0
0
18 Jun 2024
The Wisdom of a Crowd of Brains: A Universal Brain Encoder
The Wisdom of a Crowd of Brains: A Universal Brain Encoder
Roman Beliy
Navve Wasserman
Amit Zalcher
Michal Irani
45
2
0
18 Jun 2024
DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by
  Distilling Neural Fields and Foundation Model Features
DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features
Letian Wang
Seung Wook Kim
Jiawei Yang
Cunjun Yu
Boris Ivanovic
Steven Waslander
Yue Wang
Sanja Fidler
Marco Pavone
Peter Karkus
48
8
0
17 Jun 2024
Previous
123...262728...434445
Next