Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.07193
Cited By
DINOv2: Learning Robust Visual Features without Supervision
14 April 2023
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
Vasil Khalidov
Pierre Fernandez
Daniel Haziza
Francisco Massa
Alaaeldin El-Nouby
Mahmoud Assran
Nicolas Ballas
Wojciech Galuba
Russ Howes
Po-Yao (Bernie) Huang
Shang-Wen Li
Ishan Misra
Michael G. Rabbat
Vasu Sharma
Gabriel Synnaeve
Huijiao Xu
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
VLM
CLIP
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DINOv2: Learning Robust Visual Features without Supervision"
50 / 2,220 papers shown
Title
Counterfactual World Modeling for Physical Dynamics Understanding
Rahul Venkatesh
Honglin Chen
Kevin T. Feigelis
Daniel M. Bear
Khaled Jedoui
...
Wanhee Lee
Sherry Liu
Kevin A. Smith
Judith E. Fan
Daniel L. K. Yamins
VGen
45
1
0
11 Dec 2023
SkyScenes: A Synthetic Dataset for Aerial Scene Understanding
Sahil Khose
Anisha Pal
Aayushi Agarwal
Deepanshi
Judy Hoffman
Prithvijit Chattopadhyay
VGen
29
3
0
11 Dec 2023
Learning Naturally Aggregated Appearance for Efficient 3D Editing
Ka Leong Cheng
Qiuyu Wang
Zifan Shi
Kecheng Zheng
Yinghao Xu
Ouyang Hao
Qifeng Chen
Yujun Shen
3DH
63
4
0
11 Dec 2023
GenDepth: Generalizing Monocular Depth Estimation for Arbitrary Camera Parameters via Ground Plane Embedding
Karlo Koledić
Luka V. Petrović
Ivan Petrović
Ivan Marković
MDE
28
1
0
10 Dec 2023
NLLG Quarterly arXiv Report 09/23: What are the most influential current AI Papers?
Ran Zhang
Aida Kostikova
Christoph Leiter
Jonas Belouadi
Daniil Larionov
Yanran Chen
Vivian Fresen
Steffen Eger
53
0
0
09 Dec 2023
Transformer as Linear Expansion of Learngene
Shiyu Xia
Miaosen Zhang
Xu Yang
Ruiming Chen
Haokun Chen
Xin Geng
46
6
0
09 Dec 2023
Cross Domain Generative Augmentation: Domain Generalization with Latent Diffusion Models
S. Hemati
Mahdi Beitollahi
A. Estiri
Bassel Al Omari
Xi Chen
Guojun Zhang
27
6
0
08 Dec 2023
Emergence and Function of Abstract Representations in Self-Supervised Transformers
Quentin RV. Ferry
Joshua Ching
Takashi Kawai
32
2
0
08 Dec 2023
Adapting Vision Transformer for Efficient Change Detection
Yang Zhao
Yuxiang Zhang
Yanni Dong
Bo Du
VLM
51
2
0
08 Dec 2023
Rapid Motor Adaptation for Robotic Manipulator Arms
Yichao Liang
Kevin Ellis
Joao Henriques
35
4
0
07 Dec 2023
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Zhiwu Qing
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yujie Wei
Yingya Zhang
Changxin Gao
Nong Sang
VGen
DiffM
34
37
0
07 Dec 2023
Multi-View Unsupervised Image Generation with Cross Attention Guidance
L. Cerkezi
A. Davtyan
Sepehr Sameni
Paolo Favaro
DiffM
30
0
0
07 Dec 2023
Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation
Zhixiang Wei
Lin Chen
Yi Jin
Xiaoxiao Ma
Tianle Liu
Pengyang Lin
Ben Wang
H. Chen
Jinjin Zheng
45
42
0
07 Dec 2023
Fine-tuning vision foundation model for crack segmentation in civil infrastructures
Kang Ge
Chen Wang
Yutao Guo
Yansong Tang
Zhenzhong Hu
Hongbing Chen
VLM
33
22
0
07 Dec 2023
Instance Tracking in 3D Scenes from Egocentric Videos
Yunhan Zhao
Haoyu Ma
Shu Kong
Charless C. Fowlkes
3DPC
41
4
0
07 Dec 2023
Diffusion Illusions: Hiding Images in Plain Sight
R. Burgert
Xiang Li
Abe Leite
Kanchana Ranasinghe
Michael S. Ryoo
58
17
0
06 Dec 2023
Low-shot Object Learning with Mutual Exclusivity Bias
Anh Thai
Ahmad Humayun
Stefan Stojanov
Zixuan Huang
Bikram Boote
James M. Rehg
40
2
0
06 Dec 2023
Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adaptation
Haojie Zhang
Yongyi Su
Xun Xu
Kui Jia
OOD
VLM
42
21
0
06 Dec 2023
Understanding Representations Pretrained with Auxiliary Losses for Embodied Agent Planning
Samrudhdhi B. Rangrej
James J. Clark
SSL
42
0
0
06 Dec 2023
LivePhoto: Real Image Animation with Text-guided Motion Control
Xi Chen
Zhiheng Liu
Mengting Chen
Yutong Feng
Yu Liu
Yujun Shen
Hengshuang Zhao
VGen
DiffM
47
30
0
05 Dec 2023
Analyzing and Improving the Training Dynamics of Diffusion Models
Tero Karras
M. Aittala
J. Lehtinen
Janne Hellsten
Timo Aila
S. Laine
49
158
0
05 Dec 2023
Synchronization is All You Need: Exocentric-to-Egocentric Transfer for Temporal Action Segmentation with Unlabeled Synchronized Video Pairs
Camillo Quattrocchi
Antonino Furnari
Daniele Di Mauro
M. Giuffrida
G. Farinella
36
8
0
05 Dec 2023
GeNIe: Generative Hard Negative Images Through Diffusion
Soroush Abbasi Koohpayegani
Anuj Singh
K. Navaneet
Hadi Jamali Rad
Hamed Pirsiavash
VLM
DiffM
46
4
0
05 Dec 2023
Human Demonstrations are Generalizable Knowledge for Robots
Te Cui
Guangyan Chen
Tianxing Zhou
Zicai Peng
Mengxiao Hu
Haoyang Lu
Haizhou Li
Meiling Wang
Yi Yang
Yufeng Yue
LM&Ro
41
6
0
05 Dec 2023
Class-Discriminative Attention Maps for Vision Transformers
L. Brocki
Jakub Binda
N. C. Chung
MedIm
38
3
0
04 Dec 2023
Instant Uncertainty Calibration of NeRFs Using a Meta-calibrator
Niki Amini-Naieni
Tomas Jakab
Andrea Vedaldi
Ronald Clark
34
1
0
04 Dec 2023
Steerers: A framework for rotation equivariant keypoint descriptors
Georg Bökman
Johan Edstedt
Michael Felsberg
Fredrik Kahl
LLMSV
31
10
0
04 Dec 2023
Bootstrapping SparseFormers from Vision Foundation Models
Ziteng Gao
Zhan Tong
K. Lin
Joya Chen
Mike Zheng Shou
41
0
0
04 Dec 2023
BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection
Zhenxin Li
Shiyi Lan
Jose M. Alvarez
Zuxuan Wu
41
16
0
04 Dec 2023
Multi-task Image Restoration Guided By Robust DINO Features
Xin Lin
Chao Ren
Kelvin C. K. Chan
Lu Qi
Jinshan Pan
Ming-Hsuan Yang
44
2
0
04 Dec 2023
Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games
Lukas Schäfer
Logan Jones
Anssi Kanervisto
Yuhan Cao
Tabish Rashid
Raluca Georgescu
David Bignell
Siddhartha Sen
Andrea Trevino Gavito
Sam Devlin
93
3
0
04 Dec 2023
SANeRF-HQ: Segment Anything for NeRF in High Quality
Yichen Liu
Benran Hu
Chi-Keung Tang
Yu-Wing Tai
41
11
0
03 Dec 2023
Brain Decodes Deep Nets
Huzheng Yang
James C. Gee
Jianbo Shi
38
7
0
03 Dec 2023
FreeZe: Training-free zero-shot 6D pose estimation with geometric and vision foundation models
Andrea Caraffa
Davide Boscaini
Amir Hamza
Fabio Poiesi
61
15
0
01 Dec 2023
VideoBooth: Diffusion-based Video Generation with Image Prompts
Yuming Jiang
Tianxing Wu
Shuai Yang
Chenyang Si
Dahua Lin
Yu Qiao
Chen Change Loy
Ziwei Liu
DiffM
VGen
40
66
0
01 Dec 2023
The role of interface design on prompt-mediated creativity in Generative AI
M. Torricelli
Mauro Martino
Andrea Baronchelli
L. Aiello
35
7
0
30 Nov 2023
FoundPose: Unseen Object Pose Estimation with Foundation Features
Evin Pınar Örnek
Yann Labbé
Bugra Tekin
Lingni Ma
Cem Keskin
Christian Forster
Tomás Hodan
35
48
0
30 Nov 2023
BioCLIP: A Vision Foundation Model for the Tree of Life
Samuel Stevens
Jiaman Wu
Matthew J Thompson
Elizabeth G Campolongo
Chan Hee Song
...
Wasila M Dahdul
Charles V. Stewart
Tanya Berger-Wolf
Wei-Lun Chao
Yu-Chuan Su
49
64
0
30 Nov 2023
A Lightweight Clustering Framework for Unsupervised Semantic Segmentation
Yau Shing Jonathan Cheung
Xi Chen
Lihe Yang
Hengshuang Zhao
37
1
0
30 Nov 2023
Perceptual Group Tokenizer: Building Perception with Iterative Grouping
Zhiwei Deng
Ting Chen
Yang Li
ViT
VLM
29
2
0
30 Nov 2023
Knowledge Transfer from Vision Foundation Models for Efficient Training of Small Task-specific Models
Raviteja Vemulapalli
Hadi Pouransari
Fartash Faghri
Sachin Mehta
Mehrdad Farajtabar
Mohammad Rastegari
Oncel Tuzel
43
7
0
30 Nov 2023
Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features
Thomas Wimmer
Peter Wonka
M. Ovsjanikov
49
9
0
29 Nov 2023
Meta Co-Training: Two Views are Better than One
Jay C. Rothenberger
Dimitrios I. Diochnos
VLM
50
2
0
29 Nov 2023
Do text-free diffusion models learn discriminative visual representations?
Soumik Mukhopadhyay
M. Gwilliam
Yosuke Yamaguchi
Vatsal Agarwal
Namitha Padmanabhan
Archana Swaminathan
Dinesh Manocha
Abhinav Shrivastava
DiffM
37
12
1
29 Nov 2023
Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation
Shuangrui Ding
Rui Qian
Haohang Xu
Dahua Lin
Hongkai Xiong
VOS
46
4
0
29 Nov 2023
One-Shot Open Affordance Learning with Foundation Models
Gen Li
Deqing Sun
Laura Sevilla-Lara
Varun Jampani
VLM
76
23
0
29 Nov 2023
Federated Fine-Tuning of Foundation Models via Probabilistic Masking
Vasileios Tsouvalas
Yuki M. Asano
Aaqib Saeed
FedML
87
3
0
29 Nov 2023
Adversarial Diffusion Distillation
Axel Sauer
Dominik Lorenz
A. Blattmann
Robin Rombach
138
339
0
28 Nov 2023
Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence
Junyi Zhang
Charles Herrmann
Junhwa Hur
Eric Chen
Varun Jampani
Deqing Sun
Ming-Hsuan Yang
28
37
0
28 Nov 2023
On Bringing Robots Home
Nur Muhammad (Mahi) Shafiullah
Anant Rai
Haritheja Etukuru
Yiqian Liu
Ishan Misra
Soumith Chintala
Lerrel Pinto
43
77
0
27 Nov 2023
Previous
1
2
3
...
38
39
40
...
43
44
45
Next