ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.07193
  4. Cited By
DINOv2: Learning Robust Visual Features without Supervision

DINOv2: Learning Robust Visual Features without Supervision

14 April 2023
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
Vasil Khalidov
Pierre Fernandez
Daniel Haziza
Francisco Massa
Alaaeldin El-Nouby
Mahmoud Assran
Nicolas Ballas
Wojciech Galuba
Russ Howes
Po-Yao (Bernie) Huang
Shang-Wen Li
Ishan Misra
Michael G. Rabbat
Vasu Sharma
Gabriel Synnaeve
Huijiao Xu
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
    VLM
    CLIP
    SSL
ArXivPDFHTML

Papers citing "DINOv2: Learning Robust Visual Features without Supervision"

50 / 2,220 papers shown
Title
Evaluating the Effectiveness of Attack-Agnostic Features for Morphing
  Attack Detection
Evaluating the Effectiveness of Attack-Agnostic Features for Morphing Attack Detection
Laurent Colbois
S´ebastien Marcel
AAML
43
0
0
22 Oct 2024
Frontiers in Intelligent Colonoscopy
Frontiers in Intelligent Colonoscopy
Ge-Peng Ji
Jingyi Liu
Peng Xu
Nick Barnes
Fahad Shahbaz Khan
Salman Khan
Deng-Ping Fan
49
4
0
22 Oct 2024
Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5%
  Parameters and 90% Performance
Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance
Zhangwei Gao
Zhe Chen
Erfei Cui
Yiming Ren
Weiyun Wang
...
Lewei Lu
Tong Lu
Yu Qiao
Jifeng Dai
Wenhai Wang
VLM
77
25
0
21 Oct 2024
Granularity Matters in Long-Tail Learning
Granularity Matters in Long-Tail Learning
Shizhen Zhao
Xin Wen
Jiaheng Liu
Chuofan Ma
Chun Yuan
Xiaojuan Qi
34
0
0
21 Oct 2024
Visual Motif Identification: Elaboration of a Curated Comparative
  Dataset and Classification Methods
Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods
Adam Phillips
Daniel Grandes Rodriguez
Miriam Sánchez-Manzano
Alan Salvadó
Manuel Garin
G. Haro
C. Ballester
34
0
0
21 Oct 2024
ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts
ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts
Xumeng Han
Longhui Wei
Zhiyang Dou
Zipeng Wang
Chenhui Qiang
Xin He
Yingfei Sun
Zhenjun Han
Qi Tian
MoE
50
3
0
21 Oct 2024
SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects
SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects
Jiayi Liu
Denys Iliash
Angel X. Chang
Manolis Savva
Ali Mahdavi-Amiri
67
8
0
21 Oct 2024
Triplane Grasping: Efficient 6-DoF Grasping with Single RGB Images
Triplane Grasping: Efficient 6-DoF Grasping with Single RGB Images
Yiming Li
Hanchi Ren
Yue Yang
Jingjing Deng
Xianghua Xie
41
0
0
21 Oct 2024
TIPS: Text-Image Pretraining with Spatial awareness
TIPS: Text-Image Pretraining with Spatial awareness
Kevis-Kokitsi Maninis
Kaifeng Chen
Soham Ghosh
Arjun Karpur
Koert Chen
...
Jan Dlabal
Dan Gnanapragasam
Mojtaba Seyedhosseini
Howard Zhou
Andre Araujo
VLM
40
3
0
21 Oct 2024
Dynamic Contrastive Learning for Time Series Representation
Dynamic Contrastive Learning for Time Series Representation
Abdul-Kazeem Shamba
Kerstin Bach
Gavin Taylor
AI4TS
31
0
0
20 Oct 2024
Upsampling DINOv2 features for unsupervised vision tasks and weakly
  supervised materials segmentation
Upsampling DINOv2 features for unsupervised vision tasks and weakly supervised materials segmentation
Ronan Docherty
Antonis Vamvakeros
Samuel J. Cooper
49
1
0
20 Oct 2024
A Survey of Hallucination in Large Visual Language Models
A Survey of Hallucination in Large Visual Language Models
Wei Lan
Wenyi Chen
Qingfeng Chen
Shirui Pan
Huiyu Zhou
Yi-Lun Pan
LRM
38
4
0
20 Oct 2024
Layout-your-3D: Controllable and Precise 3D Generation with 2D Blueprint
Layout-your-3D: Controllable and Precise 3D Generation with 2D Blueprint
Junwei Zhou
Xueting Li
Lu Qi
Ming-Hsuan Yang
DiffM
44
2
0
20 Oct 2024
A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and
  Future Trends
A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends
Junjun Jiang
Zengyuan Zuo
Gang Wu
Kui Jiang
Xianming Liu
56
11
0
19 Oct 2024
CAGE: Causal Attention Enables Data-Efficient Generalizable Robotic
  Manipulation
CAGE: Causal Attention Enables Data-Efficient Generalizable Robotic Manipulation
Shangning Xia
Hongjie Fang
Hao-Shu Fang
Cewu Lu
CML
42
5
0
19 Oct 2024
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher
  in One Step
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step
Mingyuan Zhou
Huangjie Zheng
Yi Gu
Zhendong Wang
Hai Huang
DiffM
61
7
0
19 Oct 2024
How Do Training Methods Influence the Utilization of Vision Models?
How Do Training Methods Influence the Utilization of Vision Models?
Paul Gavrikov
Shashank Agnihotri
Margret Keuper
J. Keuper
39
2
0
18 Oct 2024
LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes
LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes
Juliette Marrie
Romain Menegaux
Michael Arbel
Diane Larlus
Julien Mairal
3DGS
46
1
0
18 Oct 2024
Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning
Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning
Yuxiang Lu
Shengcao Cao
Yu-xiong Wang
60
1
0
18 Oct 2024
A Survey on Computational Solutions for Reconstructing Complete Objects by Reassembling Their Fractured Parts
A Survey on Computational Solutions for Reconstructing Complete Objects by Reassembling Their Fractured Parts
Jiaxin Lu
Yongqing Liang
Huijun Han
Jiacheng Hua
Junfeng Jiang
Xin Li
Qixing Huang
3DV
53
1
0
18 Oct 2024
On Partial Prototype Collapse in the DINO Family of Self-Supervised
  Methods
On Partial Prototype Collapse in the DINO Family of Self-Supervised Methods
Hariprasath Govindarajan
Per Sidén
Jacob Roll
Fredrik Lindsten
37
2
0
17 Oct 2024
Improving Multi-modal Large Language Model through Boosting Vision
  Capabilities
Improving Multi-modal Large Language Model through Boosting Vision Capabilities
Yanpeng Sun
Han Zhang
Qiang Chen
Xinyu Zhang
Nong Sang
Gang Zhang
Jingdong Wang
Zechao Li
36
5
0
17 Oct 2024
DepthSplat: Connecting Gaussian Splatting and Depth
DepthSplat: Connecting Gaussian Splatting and Depth
Haofei Xu
Songyou Peng
Fangjinhua Wang
Hermann Blum
Dániel Baráth
Andreas Geiger
Marc Pollefeys
3DGS
MDE
65
31
0
17 Oct 2024
MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models
MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models
Donghao Zhou
Jiancheng Huang
J. Bai
Jiaze Wang
Hao Chen
Guangyong Chen
Xiaowei Hu
Pheng Ann Heng
50
5
0
17 Oct 2024
Composing Novel Classes: A Concept-Driven Approach to Generalized Category Discovery
Composing Novel Classes: A Concept-Driven Approach to Generalized Category Discovery
Chuyu Zhang
Peiyan Gu
Xueyang Yu
Xuming He
37
0
0
17 Oct 2024
Towards Zero-Shot Camera Trap Image Categorization
Towards Zero-Shot Camera Trap Image Categorization
Jiří Vyskočil
Lukas Picek
VLM
28
0
0
16 Oct 2024
DH-VTON: Deep Text-Driven Virtual Try-On via Hybrid Attention Learning
DH-VTON: Deep Text-Driven Virtual Try-On via Hybrid Attention Learning
Jiabao Wei
Zhiyuan Ma
DiffM
45
0
0
16 Oct 2024
Stabilize the Latent Space for Image Autoregressive Modeling: A Unified
  Perspective
Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
Yongxin Zhu
Bing Li
Hang Zhang
Xin Li
Linli Xu
Lidong Bing
DiffM
47
9
0
16 Oct 2024
SAM-Guided Masked Token Prediction for 3D Scene Understanding
SAM-Guided Masked Token Prediction for 3D Scene Understanding
Zhimin Chen
Liang Yang
Yingwei Li
Longlong Jing
Bing Li
45
3
0
16 Oct 2024
In-Context Learning Enables Robot Action Prediction in LLMs
In-Context Learning Enables Robot Action Prediction in LLMs
Yida Yin
Zekai Wang
Yuvan Sharma
Dantong Niu
Trevor Darrell
Roei Herzig
LM&Ro
120
2
0
16 Oct 2024
MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained
  Vision-Language Understanding
MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding
Yue Cao
Yangzhou Liu
Zhe Chen
Guangchen Shi
Wenhai Wang
Danhuai Zhao
Tong Lu
60
7
0
15 Oct 2024
KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual
  Entities
KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities
Hsin-Ping Huang
Xinyu Wang
Yonatan Bitton
Hagai Taitelbaum
Gaurav Singh Tomar
...
Xuhui Jia
Kelvin Chan
Hexiang Hu
Yu-Chuan Su
Ming-Hsuan Yang
EGVM
75
4
0
15 Oct 2024
Jigsaw++: Imagining Complete Shape Priors for Object Reassembly
Jigsaw++: Imagining Complete Shape Priors for Object Reassembly
Jiaxin Lu
Gang Hua
Qixing Huang
44
2
0
15 Oct 2024
Visual Fixation-Based Retinal Prosthetic Simulation
Visual Fixation-Based Retinal Prosthetic Simulation
Yuli Wu
Do Dinh Tan Nguyen
Henning Konermann
Rüveyda Yilmaz
Peter Walter
Johannes Stegmaier
23
0
0
15 Oct 2024
Hairmony: Fairness-aware hairstyle classification
Hairmony: Fairness-aware hairstyle classification
Givi Meishvili
James Clemoes
Charlie Hewitt
Zafiirah Hosenie
Xian Xiao
...
T. Baltrušaitis
Antonio Criminisi
Chyna McRae
Nina Jablonski
Marta Wilczkowiak
21
1
0
15 Oct 2024
MCGS: Multiview Consistency Enhancement for Sparse-View 3D Gaussian
  Radiance Fields
MCGS: Multiview Consistency Enhancement for Sparse-View 3D Gaussian Radiance Fields
Yuru Xiao
Deming Zhai
Wenbo Zhao
Kui Jiang
Junjun Jiang
Xianming Liu
3DGS
32
0
0
15 Oct 2024
DRACO: A Denoising-Reconstruction Autoencoder for Cryo-EM
DRACO: A Denoising-Reconstruction Autoencoder for Cryo-EM
Yingjun Shen
Haizhao Dai
Qihe Chen
Yan Zeng
Jiakai Zhang
Yuan Pei
Jingyi Yu
26
0
0
15 Oct 2024
Evolutionary Retrofitting
Evolutionary Retrofitting
Mathurin Videau
M. Zameshina
Alessandro Leite
Laurent Najman
Marc Schoenauer
O. Teytaud
46
0
0
15 Oct 2024
Automatically Generating Visual Hallucination Test Cases for Multimodal
  Large Language Models
Automatically Generating Visual Hallucination Test Cases for Multimodal Large Language Models
Zhongye Liu
Hongbin Liu
Yuepeng Hu
Zedian Shao
Neil Zhenqiang Gong
VLM
MLLM
26
0
0
15 Oct 2024
Multiview Scene Graph
Multiview Scene Graph
Juexiao Zhang
Gao Zhu
Sihang Li
Xinhao Liu
Haorui Song
Xinran Tang
Chen Feng
3DV
31
1
0
15 Oct 2024
EchoApex: A General-Purpose Vision Foundation Model for Echocardiography
EchoApex: A General-Purpose Vision Foundation Model for Echocardiography
A. Amadou
Wenjie Qu
Sebastien Piat
Paul Klein
Ingo Schmuecking
Tiziano Passerini
Puneet Sharma
25
5
0
14 Oct 2024
Browsing without Third-Party Cookies: What Do You See?
Browsing without Third-Party Cookies: What Do You See?
Maxwell Lin
Shihan Lin
Helen Wu
Karen Wang
Xiaowei Yang
BDL
59
0
0
14 Oct 2024
Towards Reliable Verification of Unauthorized Data Usage in Personalized
  Text-to-Image Diffusion Models
Towards Reliable Verification of Unauthorized Data Usage in Personalized Text-to-Image Diffusion Models
Boheng Li
Yanhao Wei
Yankai Fu
Ziyi Wang
Yiming Li
Jie Zhang
Run Wang
Tianwei Zhang
DiffM
AAML
40
9
0
14 Oct 2024
Spatial-Aware Efficient Projector for MLLMs via Multi-Layer Feature
  Aggregation
Spatial-Aware Efficient Projector for MLLMs via Multi-Layer Feature Aggregation
Shun Qian
Bingquan Liu
Chengjie Sun
Zhen Xu
Baoxun Wang
36
0
0
14 Oct 2024
big.LITTLE Vision Transformer for Efficient Visual Recognition
big.LITTLE Vision Transformer for Efficient Visual Recognition
He Guo
Yulong Wang
Zixuan Ye
Jifeng Dai
Yuwen Xiong
ViT
54
0
0
14 Oct 2024
DINTR: Tracking via Diffusion-based Interpolation
DINTR: Tracking via Diffusion-based Interpolation
Pha Nguyen
Ngan Le
J. Cothren
Alper Yilmaz
Khoa Luu
DiffM
48
0
0
14 Oct 2024
Exploring Semi-Supervised Learning for Online Mapping
Exploring Semi-Supervised Learning for Online Mapping
Adam Lilja
Erik Wallin
Junsheng Fu
Lars Hammarstrand
SSL
59
1
0
14 Oct 2024
Locality Alignment Improves Vision-Language Models
Locality Alignment Improves Vision-Language Models
Ian Covert
Tony Sun
James Zou
Tatsunori Hashimoto
VLM
74
4
0
14 Oct 2024
Large-Scale 3D Medical Image Pre-training with Geometric Context Priors
Large-Scale 3D Medical Image Pre-training with Geometric Context Priors
Linshan Wu
Jiaxin Zhuang
Hao Chen
41
5
0
13 Oct 2024
EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion
  Models
EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models
Eungbean Lee
Somi Jeong
Kwanghoon Sohn
DiffM
35
1
0
13 Oct 2024
Previous
123...171819...434445
Next