Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.07193
Cited By
DINOv2: Learning Robust Visual Features without Supervision
14 April 2023
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
Vasil Khalidov
Pierre Fernandez
Daniel Haziza
Francisco Massa
Alaaeldin El-Nouby
Mahmoud Assran
Nicolas Ballas
Wojciech Galuba
Russ Howes
Po-Yao (Bernie) Huang
Shang-Wen Li
Ishan Misra
Michael G. Rabbat
Vasu Sharma
Gabriel Synnaeve
Huijiao Xu
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
VLM
CLIP
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DINOv2: Learning Robust Visual Features without Supervision"
50 / 2,220 papers shown
Title
Evaluating the Effectiveness of Attack-Agnostic Features for Morphing Attack Detection
Laurent Colbois
S´ebastien Marcel
AAML
43
0
0
22 Oct 2024
Frontiers in Intelligent Colonoscopy
Ge-Peng Ji
Jingyi Liu
Peng Xu
Nick Barnes
Fahad Shahbaz Khan
Salman Khan
Deng-Ping Fan
49
4
0
22 Oct 2024
Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance
Zhangwei Gao
Zhe Chen
Erfei Cui
Yiming Ren
Weiyun Wang
...
Lewei Lu
Tong Lu
Yu Qiao
Jifeng Dai
Wenhai Wang
VLM
77
25
0
21 Oct 2024
Granularity Matters in Long-Tail Learning
Shizhen Zhao
Xin Wen
Jiaheng Liu
Chuofan Ma
Chun Yuan
Xiaojuan Qi
34
0
0
21 Oct 2024
Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods
Adam Phillips
Daniel Grandes Rodriguez
Miriam Sánchez-Manzano
Alan Salvadó
Manuel Garin
G. Haro
C. Ballester
34
0
0
21 Oct 2024
ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts
Xumeng Han
Longhui Wei
Zhiyang Dou
Zipeng Wang
Chenhui Qiang
Xin He
Yingfei Sun
Zhenjun Han
Qi Tian
MoE
50
3
0
21 Oct 2024
SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects
Jiayi Liu
Denys Iliash
Angel X. Chang
Manolis Savva
Ali Mahdavi-Amiri
67
8
0
21 Oct 2024
Triplane Grasping: Efficient 6-DoF Grasping with Single RGB Images
Yiming Li
Hanchi Ren
Yue Yang
Jingjing Deng
Xianghua Xie
41
0
0
21 Oct 2024
TIPS: Text-Image Pretraining with Spatial awareness
Kevis-Kokitsi Maninis
Kaifeng Chen
Soham Ghosh
Arjun Karpur
Koert Chen
...
Jan Dlabal
Dan Gnanapragasam
Mojtaba Seyedhosseini
Howard Zhou
Andre Araujo
VLM
40
3
0
21 Oct 2024
Dynamic Contrastive Learning for Time Series Representation
Abdul-Kazeem Shamba
Kerstin Bach
Gavin Taylor
AI4TS
31
0
0
20 Oct 2024
Upsampling DINOv2 features for unsupervised vision tasks and weakly supervised materials segmentation
Ronan Docherty
Antonis Vamvakeros
Samuel J. Cooper
49
1
0
20 Oct 2024
A Survey of Hallucination in Large Visual Language Models
Wei Lan
Wenyi Chen
Qingfeng Chen
Shirui Pan
Huiyu Zhou
Yi-Lun Pan
LRM
38
4
0
20 Oct 2024
Layout-your-3D: Controllable and Precise 3D Generation with 2D Blueprint
Junwei Zhou
Xueting Li
Lu Qi
Ming-Hsuan Yang
DiffM
44
2
0
20 Oct 2024
A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends
Junjun Jiang
Zengyuan Zuo
Gang Wu
Kui Jiang
Xianming Liu
56
11
0
19 Oct 2024
CAGE: Causal Attention Enables Data-Efficient Generalizable Robotic Manipulation
Shangning Xia
Hongjie Fang
Hao-Shu Fang
Cewu Lu
CML
42
5
0
19 Oct 2024
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step
Mingyuan Zhou
Huangjie Zheng
Yi Gu
Zhendong Wang
Hai Huang
DiffM
61
7
0
19 Oct 2024
How Do Training Methods Influence the Utilization of Vision Models?
Paul Gavrikov
Shashank Agnihotri
Margret Keuper
J. Keuper
39
2
0
18 Oct 2024
LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes
Juliette Marrie
Romain Menegaux
Michael Arbel
Diane Larlus
Julien Mairal
3DGS
46
1
0
18 Oct 2024
Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning
Yuxiang Lu
Shengcao Cao
Yu-xiong Wang
60
1
0
18 Oct 2024
A Survey on Computational Solutions for Reconstructing Complete Objects by Reassembling Their Fractured Parts
Jiaxin Lu
Yongqing Liang
Huijun Han
Jiacheng Hua
Junfeng Jiang
Xin Li
Qixing Huang
3DV
53
1
0
18 Oct 2024
On Partial Prototype Collapse in the DINO Family of Self-Supervised Methods
Hariprasath Govindarajan
Per Sidén
Jacob Roll
Fredrik Lindsten
37
2
0
17 Oct 2024
Improving Multi-modal Large Language Model through Boosting Vision Capabilities
Yanpeng Sun
Han Zhang
Qiang Chen
Xinyu Zhang
Nong Sang
Gang Zhang
Jingdong Wang
Zechao Li
36
5
0
17 Oct 2024
DepthSplat: Connecting Gaussian Splatting and Depth
Haofei Xu
Songyou Peng
Fangjinhua Wang
Hermann Blum
Dániel Baráth
Andreas Geiger
Marc Pollefeys
3DGS
MDE
65
31
0
17 Oct 2024
MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models
Donghao Zhou
Jiancheng Huang
J. Bai
Jiaze Wang
Hao Chen
Guangyong Chen
Xiaowei Hu
Pheng Ann Heng
50
5
0
17 Oct 2024
Composing Novel Classes: A Concept-Driven Approach to Generalized Category Discovery
Chuyu Zhang
Peiyan Gu
Xueyang Yu
Xuming He
37
0
0
17 Oct 2024
Towards Zero-Shot Camera Trap Image Categorization
Jiří Vyskočil
Lukas Picek
VLM
28
0
0
16 Oct 2024
DH-VTON: Deep Text-Driven Virtual Try-On via Hybrid Attention Learning
Jiabao Wei
Zhiyuan Ma
DiffM
45
0
0
16 Oct 2024
Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
Yongxin Zhu
Bing Li
Hang Zhang
Xin Li
Linli Xu
Lidong Bing
DiffM
47
9
0
16 Oct 2024
SAM-Guided Masked Token Prediction for 3D Scene Understanding
Zhimin Chen
Liang Yang
Yingwei Li
Longlong Jing
Bing Li
45
3
0
16 Oct 2024
In-Context Learning Enables Robot Action Prediction in LLMs
Yida Yin
Zekai Wang
Yuvan Sharma
Dantong Niu
Trevor Darrell
Roei Herzig
LM&Ro
120
2
0
16 Oct 2024
MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding
Yue Cao
Yangzhou Liu
Zhe Chen
Guangchen Shi
Wenhai Wang
Danhuai Zhao
Tong Lu
60
7
0
15 Oct 2024
KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities
Hsin-Ping Huang
Xinyu Wang
Yonatan Bitton
Hagai Taitelbaum
Gaurav Singh Tomar
...
Xuhui Jia
Kelvin Chan
Hexiang Hu
Yu-Chuan Su
Ming-Hsuan Yang
EGVM
75
4
0
15 Oct 2024
Jigsaw++: Imagining Complete Shape Priors for Object Reassembly
Jiaxin Lu
Gang Hua
Qixing Huang
44
2
0
15 Oct 2024
Visual Fixation-Based Retinal Prosthetic Simulation
Yuli Wu
Do Dinh Tan Nguyen
Henning Konermann
Rüveyda Yilmaz
Peter Walter
Johannes Stegmaier
23
0
0
15 Oct 2024
Hairmony: Fairness-aware hairstyle classification
Givi Meishvili
James Clemoes
Charlie Hewitt
Zafiirah Hosenie
Xian Xiao
...
T. Baltrušaitis
Antonio Criminisi
Chyna McRae
Nina Jablonski
Marta Wilczkowiak
21
1
0
15 Oct 2024
MCGS: Multiview Consistency Enhancement for Sparse-View 3D Gaussian Radiance Fields
Yuru Xiao
Deming Zhai
Wenbo Zhao
Kui Jiang
Junjun Jiang
Xianming Liu
3DGS
32
0
0
15 Oct 2024
DRACO: A Denoising-Reconstruction Autoencoder for Cryo-EM
Yingjun Shen
Haizhao Dai
Qihe Chen
Yan Zeng
Jiakai Zhang
Yuan Pei
Jingyi Yu
26
0
0
15 Oct 2024
Evolutionary Retrofitting
Mathurin Videau
M. Zameshina
Alessandro Leite
Laurent Najman
Marc Schoenauer
O. Teytaud
46
0
0
15 Oct 2024
Automatically Generating Visual Hallucination Test Cases for Multimodal Large Language Models
Zhongye Liu
Hongbin Liu
Yuepeng Hu
Zedian Shao
Neil Zhenqiang Gong
VLM
MLLM
26
0
0
15 Oct 2024
Multiview Scene Graph
Juexiao Zhang
Gao Zhu
Sihang Li
Xinhao Liu
Haorui Song
Xinran Tang
Chen Feng
3DV
31
1
0
15 Oct 2024
EchoApex: A General-Purpose Vision Foundation Model for Echocardiography
A. Amadou
Wenjie Qu
Sebastien Piat
Paul Klein
Ingo Schmuecking
Tiziano Passerini
Puneet Sharma
25
5
0
14 Oct 2024
Browsing without Third-Party Cookies: What Do You See?
Maxwell Lin
Shihan Lin
Helen Wu
Karen Wang
Xiaowei Yang
BDL
59
0
0
14 Oct 2024
Towards Reliable Verification of Unauthorized Data Usage in Personalized Text-to-Image Diffusion Models
Boheng Li
Yanhao Wei
Yankai Fu
Ziyi Wang
Yiming Li
Jie Zhang
Run Wang
Tianwei Zhang
DiffM
AAML
40
9
0
14 Oct 2024
Spatial-Aware Efficient Projector for MLLMs via Multi-Layer Feature Aggregation
Shun Qian
Bingquan Liu
Chengjie Sun
Zhen Xu
Baoxun Wang
36
0
0
14 Oct 2024
big.LITTLE Vision Transformer for Efficient Visual Recognition
He Guo
Yulong Wang
Zixuan Ye
Jifeng Dai
Yuwen Xiong
ViT
54
0
0
14 Oct 2024
DINTR: Tracking via Diffusion-based Interpolation
Pha Nguyen
Ngan Le
J. Cothren
Alper Yilmaz
Khoa Luu
DiffM
48
0
0
14 Oct 2024
Exploring Semi-Supervised Learning for Online Mapping
Adam Lilja
Erik Wallin
Junsheng Fu
Lars Hammarstrand
SSL
59
1
0
14 Oct 2024
Locality Alignment Improves Vision-Language Models
Ian Covert
Tony Sun
James Zou
Tatsunori Hashimoto
VLM
74
4
0
14 Oct 2024
Large-Scale 3D Medical Image Pre-training with Geometric Context Priors
Linshan Wu
Jiaxin Zhuang
Hao Chen
41
5
0
13 Oct 2024
EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models
Eungbean Lee
Somi Jeong
Kwanghoon Sohn
DiffM
35
1
0
13 Oct 2024
Previous
1
2
3
...
17
18
19
...
43
44
45
Next