ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.07193
  4. Cited By
DINOv2: Learning Robust Visual Features without Supervision

DINOv2: Learning Robust Visual Features without Supervision

14 April 2023
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
Vasil Khalidov
Pierre Fernandez
Daniel Haziza
Francisco Massa
Alaaeldin El-Nouby
Mahmoud Assran
Nicolas Ballas
Wojciech Galuba
Russ Howes
Po-Yao (Bernie) Huang
Shang-Wen Li
Ishan Misra
Michael G. Rabbat
Vasu Sharma
Gabriel Synnaeve
Huijiao Xu
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
    VLM
    CLIP
    SSL
ArXivPDFHTML

Papers citing "DINOv2: Learning Robust Visual Features without Supervision"

50 / 2,220 papers shown
Title
Deep Learning for Spatiotemporal Big Data: A Vision on Opportunities and
  Challenges
Deep Learning for Spatiotemporal Big Data: A Vision on Opportunities and Challenges
Zhe Jiang
28
0
0
30 Oct 2023
Adversarial Attacks and Defenses in Large Language Models: Old and New
  Threats
Adversarial Attacks and Defenses in Large Language Models: Old and New Threats
Leo Schwinn
David Dobre
Stephan Günnemann
Gauthier Gidel
AAML
ELM
29
39
0
30 Oct 2023
A Survey on Knowledge Editing of Neural Networks
A Survey on Knowledge Editing of Neural Networks
Vittorio Mazzia
Alessandro Pedrani
Andrea Caciolai
Kay Rottmann
Davide Bernardi
KELM
20
25
0
30 Oct 2023
HyPE: Attention with Hyperbolic Biases for Relative Positional Encoding
HyPE: Attention with Hyperbolic Biases for Relative Positional Encoding
Giorgio Angelotti
16
0
0
30 Oct 2023
Are Natural Domain Foundation Models Useful for Medical Image
  Classification?
Are Natural Domain Foundation Models Useful for Medical Image Classification?
Joana Palés Huix
Adithya Raju Ganeshan
Johan Fredin Haslum
Magnus P Soderberg
Christos Matsoukas
Kevin Smith
OOD
MedIm
VLM
26
30
0
30 Oct 2023
Few-shot Hybrid Domain Adaptation of Image Generators
Few-shot Hybrid Domain Adaptation of Image Generators
Hengjia Li
Yang Liu
Linxuan Xia
Yuqi Lin
Tu Zheng
Zheng Yang
Wenxiao Wang
Xiaohui Zhong
Xiaobo Ren
Xiaofei He
22
2
0
30 Oct 2023
A High-Resolution Dataset for Instance Detection with Multi-View
  Instance Capture
A High-Resolution Dataset for Instance Detection with Multi-View Instance Capture
Qianqian Shen
Yunhan Zhao
Nahyun Kwon
Jeeeun Kim
Yanan Li
Shu Kong
28
2
0
30 Oct 2023
CHAMMI: A benchmark for channel-adaptive models in microscopy imaging
CHAMMI: A benchmark for channel-adaptive models in microscopy imaging
Zitong S. Chen
Chau Pham
Siqi Wang
Michael Doron
Nikita Moshkov
Bryan A. Plummer
Juan C. Caicedo
30
11
0
30 Oct 2023
Patch-Wise Self-Supervised Visual Representation Learning: A
  Fine-Grained Approach
Patch-Wise Self-Supervised Visual Representation Learning: A Fine-Grained Approach
Ali Javidani
Mohammad Amin Sadeghi
Babak N. Araabi
30
0
0
28 Oct 2023
One-shot Localization and Segmentation of Medical Images with Foundation
  Models
One-shot Localization and Segmentation of Medical Images with Foundation Models
Deepa Anand
Gurunath Reddy
Vanika Singhal
D. Shanbhag
KS Shriram
...
Dawei Gui
R. Mullick
Avinash Gopal
Parminder Bhatia
Taha A. Kass-Hout
MedIm
52
13
0
28 Oct 2023
Drive Anywhere: Generalizable End-to-end Autonomous Driving with
  Multi-modal Foundation Models
Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models
Tsun-Hsuan Wang
Alaa Maalouf
Wei Xiao
Yutong Ban
Alexander Amini
Guy Rosman
S. Karaman
Daniela Rus
27
42
0
26 Oct 2023
SD4Match: Learning to Prompt Stable Diffusion Model for Semantic
  Matching
SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching
Xinghui Li
Jingyi Lu
Kai Han
V. Prisacariu
DiffM
30
19
0
26 Oct 2023
Three Pillars improving Vision Foundation Model Distillation for Lidar
Three Pillars improving Vision Foundation Model Distillation for Lidar
Gilles Puy
Spyros Gidaris
Alexandre Boulch
Oriane Siméoni
Corentin Sautier
Patrick Pérez
Andrei Bursuc
Renaud Marlet
107
18
0
26 Oct 2023
Attribute Based Interpretable Evaluation Metrics for Generative Models
Attribute Based Interpretable Evaluation Metrics for Generative Models
Dongkyun Kim
Mingi Kwon
Youngjung Uh
EGVM
40
2
0
26 Oct 2023
SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous
  Manipulation
SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous Manipulation
Qianxu Wang
Haotong Zhang
Congyue Deng
Yang You
Hao Dong
Yixin Zhu
Leonidas J. Guibas
29
18
0
25 Oct 2023
Open-NeRF: Towards Open Vocabulary NeRF Decomposition
Open-NeRF: Towards Open Vocabulary NeRF Decomposition
Hao Zhang
Fang Li
Narendra Ahuja
35
12
0
25 Oct 2023
Integrating View Conditions for Image Synthesis
Integrating View Conditions for Image Synthesis
Jinbin Bai
Zhen Dong
Aosong Feng
Xiao Zhang
Tian-Chun Ye
Kaicheng Zhou
67
13
0
24 Oct 2023
Robot Skill Generalization via Keypoint Integrated Soft Actor-Critic
  Gaussian Mixture Models
Robot Skill Generalization via Keypoint Integrated Soft Actor-Critic Gaussian Mixture Models
Iman Nematollahi
Kirill Yankov
Wolfram Burgard
Tim Welschehold
31
0
0
23 Oct 2023
Learning Generalizable Manipulation Policies with Object-Centric 3D
  Representations
Learning Generalizable Manipulation Policies with Object-Centric 3D Representations
Yifeng Zhu
Zhenyu Jiang
Peter Stone
Yuke Zhu
3DPC
29
45
0
22 Oct 2023
A Survey on Continual Semantic Segmentation: Theory, Challenge, Method
  and Application
A Survey on Continual Semantic Segmentation: Theory, Challenge, Method and Application
Bo Yuan
Danpei Zhao
3DV
CLL
38
10
0
22 Oct 2023
SILC: Improving Vision Language Pretraining with Self-Distillation
SILC: Improving Vision Language Pretraining with Self-Distillation
Muhammad Ferjad Naeem
Yongqin Xian
Xiaohua Zhai
Lukas Hoyer
Luc Van Gool
F. Tombari
VLM
30
33
0
20 Oct 2023
Visual Grounding Helps Learn Word Meanings in Low-Data Regimes
Visual Grounding Helps Learn Word Meanings in Low-Data Regimes
Chengxu Zhuang
Evelina Fedorenko
Jacob Andreas
22
10
0
20 Oct 2023
Cousins Of The Vendi Score: A Family Of Similarity-Based Diversity
  Metrics For Science And Machine Learning
Cousins Of The Vendi Score: A Family Of Similarity-Based Diversity Metrics For Science And Machine Learning
Amey P. Pasarkar
Adji Bousso Dieng
27
11
0
19 Oct 2023
Unsupervised Object Localization in the Era of Self-Supervised ViTs: A
  Survey
Unsupervised Object Localization in the Era of Self-Supervised ViTs: A Survey
Oriane Siméoni
Éloi Zablocki
Spyros Gidaris
Gilles Puy
Patrick Pérez
31
10
0
19 Oct 2023
An Image is Worth Multiple Words: Discovering Object Level Concepts
  using Multi-Concept Prompt Learning
An Image is Worth Multiple Words: Discovering Object Level Concepts using Multi-Concept Prompt Learning
Chen Jin
Ryutaro Tanno
Amrutha Saseendran
Tom Diethe
Philip Teare
21
2
0
18 Oct 2023
Functional Invariants to Watermark Large Transformers
Functional Invariants to Watermark Large Transformers
Pierre Fernandez
Guillaume Couairon
Teddy Furon
Matthijs Douze
19
8
0
17 Oct 2023
Tracking and Mapping in Medical Computer Vision: A Review
Tracking and Mapping in Medical Computer Vision: A Review
Adam Schmidt
Omid Mohareri
S. DiMaio
Michael C. Yip
Septimiu E. Salcudean
47
34
0
17 Oct 2023
Towards Training-free Open-world Segmentation via Image Prompt
  Foundation Models
Towards Training-free Open-world Segmentation via Image Prompt Foundation Models
Lv Tang
Peng-Tao Jiang
Haoke Xiao
Bo Li
VLM
18
8
0
17 Oct 2023
Prototype-oriented Unsupervised Change Detection for Disaster Management
Prototype-oriented Unsupervised Change Detection for Disaster Management
Youngtack Oh
Minseok Seo
Do-Yun Kim
Junghoon Seo
41
0
0
15 Oct 2023
From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language
  Models
From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
Dongsheng Jiang
Yuchen Liu
Songlin Liu
Jiné Zhao
Hao Zhang
Zhen Gao
Xiaopeng Zhang
Jin Li
Hongkai Xiong
MLLM
VLM
41
34
0
13 Oct 2023
Is ImageNet worth 1 video? Learning strong image encoders from 1 long
  unlabelled video
Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video
Shashanka Venkataramanan
Mamshad Nayeem Rizve
João Carreira
Yuki M. Asano
Yannis Avrithis
SSL
39
18
0
12 Oct 2023
Universal Visual Decomposer: Long-Horizon Manipulation Made Easy
Universal Visual Decomposer: Long-Horizon Manipulation Made Easy
Zichen Zhang
Yunshuang Li
Osbert Bastani
Abhishek Gupta
Dinesh Jayaraman
Yecheng Jason Ma
Luca Weihs
37
17
0
12 Oct 2023
Causal Unsupervised Semantic Segmentation
Causal Unsupervised Semantic Segmentation
Junho Kim
Byung-Kwan Lee
Yonghyun Ro
36
18
0
11 Oct 2023
Computational Pathology at Health System Scale -- Self-Supervised
  Foundation Models from Three Billion Images
Computational Pathology at Health System Scale -- Self-Supervised Foundation Models from Three Billion Images
Gabriele Campanella
Ricky Kwan
Eugene Fluder
Jennifer Zeng
A. Stock
...
Adam J. Schoenfeld
Chad M. Vanderbilt
P. Kovatch
Carlos Cordon-Cardo
Thomas J. Fuchs
MedIm
63
25
0
10 Oct 2023
Self-supervised Object-Centric Learning for Videos
Self-supervised Object-Centric Learning for Videos
Görkay Aydemir
Weidi Xie
Fatma Guney
OCL
VOS
SSL
33
24
0
10 Oct 2023
A General Protocol to Probe Large Vision Models for 3D Physical
  Understanding
A General Protocol to Probe Large Vision Models for 3D Physical Understanding
Guanqi Zhan
Chuanxia Zheng
Weidi Xie
Andrew Zisserman
DiffM
26
14
0
10 Oct 2023
AttributionLab: Faithfulness of Feature Attribution Under Controllable
  Environments
AttributionLab: Faithfulness of Feature Attribution Under Controllable Environments
Yang Zhang
Yawei Li
Hannah Brown
Mina Rezaei
Bernd Bischl
Philip Torr
Ashkan Khakzar
Kenji Kawaguchi
OOD
55
1
0
10 Oct 2023
Advancing Pose-Guided Image Synthesis with Progressive Conditional
  Diffusion Models
Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models
Fei Shen
Hu Ye
Jun Zhang
Cong Wang
Xiao Han
Wei Yang
DiffM
48
56
0
10 Oct 2023
Adaptive Multi-head Contrastive Learning
Adaptive Multi-head Contrastive Learning
Lei Wang
Piotr Koniusz
Tom Gedeon
Liang Zheng
41
4
0
09 Oct 2023
Improving Discriminative Multi-Modal Learning with Large-Scale
  Pre-Trained Models
Improving Discriminative Multi-Modal Learning with Large-Scale Pre-Trained Models
Chenzhuang Du
Yue Zhao
Chonghua Liao
Jiacheng You
Jie Fu
Hang Zhao
47
2
0
08 Oct 2023
Sub-token ViT Embedding via Stochastic Resonance Transformers
Sub-token ViT Embedding via Stochastic Resonance Transformers
Dong Lao
Yangchao Wu
Tian Yu Liu
Alex Wong
Stefano Soatto
VOS
36
4
0
06 Oct 2023
FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained
  Diffusion Models and Monocular Depth Estimators
FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators
Haiping Wang
Yuan Liu
Bing Wang
Yujing Sun
Zhenchao Dong
Wenping Wang
Bisheng Yang
DiffM
38
11
0
05 Oct 2023
Efficient-3DiM: Learning a Generalizable Single-image Novel-view
  Synthesizer in One Day
Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day
Yi Ding
Hao Tang
Jen-Hao Rick Chang
Liangchen Song
Zhangyang Wang
Liangliang Cao
DiffM
43
10
0
04 Oct 2023
Active Visual Localization for Multi-Agent Collaboration: A Data-Driven
  Approach
Active Visual Localization for Multi-Agent Collaboration: A Data-Driven Approach
Matthew Hanlon
Boyang Sun
Marc Pollefeys
Hermann Blum
20
5
0
04 Oct 2023
NOLA: Compressing LoRA using Linear Combination of Random Basis
NOLA: Compressing LoRA using Linear Combination of Random Basis
Soroush Abbasi Koohpayegani
K. Navaneet
Parsa Nooralinejad
Soheil Kolouri
Hamed Pirsiavash
40
12
0
04 Oct 2023
CLIP Is Also a Good Teacher: A New Learning Framework for Inductive
  Zero-shot Semantic Segmentation
CLIP Is Also a Good Teacher: A New Learning Framework for Inductive Zero-shot Semantic Segmentation
Jialei Chen
Daisuke Deguchi
Chenkai Zhang
Xu Zheng
Hiroshi Murase
VLM
19
9
0
03 Oct 2023
LEAP: Liberate Sparse-view 3D Modeling from Camera Poses
LEAP: Liberate Sparse-view 3D Modeling from Camera Poses
Hanwen Jiang
Zhenyu Jiang
Yue Zhao
Qixing Huang
34
37
0
02 Oct 2023
ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to
  Video
ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video
Xinhao Li
Yuhan Zhu
Limin Wang
VLM
35
8
0
02 Oct 2023
HyMNet: a Multimodal Deep Learning System for Hypertension
  Classification using Fundus Photographs and Cardiometabolic Risk Factors
HyMNet: a Multimodal Deep Learning System for Hypertension Classification using Fundus Photographs and Cardiometabolic Risk Factors
Mohammed Baharoon
Hessa Almatar
Reema Alduhayan
Tariq Aldebasi
Badr O. Alahmadi
Yahya Bokhari
M. Alawad
A. Almazroa
Abdulrhman Aljouie
31
0
0
02 Oct 2023
LoCUS: Learning Multiscale 3D-consistent Features from Posed Images
LoCUS: Learning Multiscale 3D-consistent Features from Posed Images
Dominik A. Kloepfer
Dylan Campbell
João F. Henriques
3DPC
3DV
45
0
0
02 Oct 2023
Previous
123...404142434445
Next