ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.07193
  4. Cited By
DINOv2: Learning Robust Visual Features without Supervision

DINOv2: Learning Robust Visual Features without Supervision

14 April 2023
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
Vasil Khalidov
Pierre Fernandez
Daniel Haziza
Francisco Massa
Alaaeldin El-Nouby
Mahmoud Assran
Nicolas Ballas
Wojciech Galuba
Russ Howes
Po-Yao (Bernie) Huang
Shang-Wen Li
Ishan Misra
Michael G. Rabbat
Vasu Sharma
Gabriel Synnaeve
Huijiao Xu
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
    VLM
    CLIP
    SSL
ArXivPDFHTML

Papers citing "DINOv2: Learning Robust Visual Features without Supervision"

50 / 2,220 papers shown
Title
SA3DIP: Segment Any 3D Instance with Potential 3D Priors
SA3DIP: Segment Any 3D Instance with Potential 3D Priors
Xi Yang
Xu Gu
Xingyilang Yin
Xinbo Gao
49
0
0
06 Nov 2024
Classification Done Right for Vision-Language Pre-Training
Classification Done Right for Vision-Language Pre-Training
Zilong Huang
Qinghao Ye
Bingyi Kang
Jiashi Feng
Haoqi Fan
CLIP
VLM
58
2
0
05 Nov 2024
TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for
  Image-to-Video Generation
TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation
Wenhao Wang
Yue Yang
VGen
60
3
0
05 Nov 2024
Learning Few-Shot Object Placement with Intra-Category Transfer
Learning Few-Shot Object Placement with Intra-Category Transfer
Adrian Rofer
Russell Buchanan
Max Argus
S. Vijayakumar
Abhinav Valada
53
0
0
05 Nov 2024
Local Lesion Generation is Effective for Capsule Endoscopy Image Data
  Augmentation in a Limited Data Setting
Local Lesion Generation is Effective for Capsule Endoscopy Image Data Augmentation in a Limited Data Setting
Adrian B. Chłopowiec
Adam R. Chłopowiec
Krzysztof Galus
Wojciech Cebula
Martin Tabakov
MedIm
38
0
0
05 Nov 2024
Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression Perspective
Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression Perspective
Qishuai Wen
Chun-Guang Li
ViT
37
0
0
05 Nov 2024
Multi-Transmotion: Pre-trained Model for Human Motion Prediction
Multi-Transmotion: Pre-trained Model for Human Motion Prediction
Yang Gao
Po-Chien Luan
Alexandre Alahi
46
6
0
04 Nov 2024
Grouped Discrete Representation for Object-Centric Learning
Grouped Discrete Representation for Object-Centric Learning
Rongzhen Zhao
V. Wang
Arno Solin
Joni Pajarinen
BDL
OCL
39
1
0
04 Nov 2024
FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage
  Training
FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training
Ruihong Yin
V. Yugay
Yue Li
Sezer Karaoglu
Theo Gevers
3DGS
50
2
0
04 Nov 2024
Enhancing Multiple Dimensions of Trustworthiness in LLMs via Sparse
  Activation Control
Enhancing Multiple Dimensions of Trustworthiness in LLMs via Sparse Activation Control
Yuxin Xiao
Chaoqun Wan
Yonggang Zhang
Wenxiao Wang
Binbin Lin
Xiaofei He
Xu Shen
Jieping Ye
29
0
0
04 Nov 2024
Silver medal Solution for Image Matching Challenge 2024
Silver medal Solution for Image Matching Challenge 2024
Yian Wang
3DV
3DPC
44
0
0
04 Nov 2024
KptLLM: Unveiling the Power of Large Language Model for Keypoint
  Comprehension
KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
Jie Yang
Wang Zeng
Sheng Jin
Lumin Xu
Wentao Liu
Chen Qian
Ruimao Zhang
MLLM
70
2
0
04 Nov 2024
Task-Oriented Hierarchical Object Decomposition for Visuomotor Control
Task-Oriented Hierarchical Object Decomposition for Visuomotor Control
Jianing Qian
Yunshuang Li
Bernadette Bucher
Dinesh Jayaraman
OCL
49
0
0
02 Nov 2024
AI-EDI-SPACE: A Co-designed Dataset for Evaluating the Quality of Public
  Spaces
AI-EDI-SPACE: A Co-designed Dataset for Evaluating the Quality of Public Spaces
Shreeyash Gowaikar
Hugo Berard
Rashid Mushkani
Emmanuel Beaudry Marchand
Toumadher Ammar
Shin Koseki
35
1
0
01 Nov 2024
CLIP-RT: Learning Language-Conditioned Robotic Policies from Natural Language Supervision
CLIP-RT: Learning Language-Conditioned Robotic Policies from Natural Language Supervision
Gi-Cheon Kang
Junghyun Kim
Kyuhwan Shim
Jun Ki Lee
Byoung-Tak Zhang
LM&Ro
118
1
1
01 Nov 2024
Label Noise: Ignorance Is Bliss
Label Noise: Ignorance Is Bliss
Yilun Zhu
Jianxin Zhang
Aditya Gangrade
Clayton Scott
NoLa
52
2
0
31 Oct 2024
Sparsh: Self-supervised touch representations for vision-based tactile
  sensing
Sparsh: Self-supervised touch representations for vision-based tactile sensing
Carolina Higuera
Akash Sharma
Chaithanya Krishna Bodduluri
Taosha Fan
Patrick E. Lancaster
...
Michael Kaess
Byron Boots
Mike Lambeta
Tingfan Wu
Mustafa Mukadam
52
13
0
31 Oct 2024
Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach
Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach
Mathilde Caron
Alireza Fathi
Cordelia Schmid
Ahmet Iscen
47
1
0
31 Oct 2024
ResiDual Transformer Alignment with Spectral Decomposition
ResiDual Transformer Alignment with Spectral Decomposition
Lorenzo Basile
Valentino Maiorca
Luca Bortolussi
Emanuele Rodolà
Francesco Locatello
63
1
0
31 Oct 2024
FRoundation: Are Foundation Models Ready for Face Recognition?
FRoundation: Are Foundation Models Ready for Face Recognition?
Tahar Chettaoui
Naser Damer
Fadi Boutros
CVBM
43
5
0
31 Oct 2024
FAIR-TAT: Improving Model Fairness Using Targeted Adversarial Training
FAIR-TAT: Improving Model Fairness Using Targeted Adversarial Training
Tejaswini Medi
Steffen Jung
Margret Keuper
AAML
46
3
0
30 Oct 2024
Decoupling Semantic Similarity from Spatial Alignment for Neural
  Networks
Decoupling Semantic Similarity from Spatial Alignment for Neural Networks
Tassilo Wald
Constantin Ulrich
Gregor Köhler
David Zimmerer
Stefan Denner
Michael Baumgartner
Fabian Isensee
Priyank Jaini
Klaus H. Maier-Hein
45
0
0
30 Oct 2024
Neural Attention Field: Emerging Point Relevance in 3D Scenes for
  One-Shot Dexterous Grasping
Neural Attention Field: Emerging Point Relevance in 3D Scenes for One-Shot Dexterous Grasping
Qianxu Wang
Congyue Deng
Tyler Ga Wei Lum
Yuanpei Chen
Yaodong Yang
Jeannette Bohg
Yixin Zhu
Leonidas J. Guibas
52
4
0
30 Oct 2024
NeFF-BioNet: Crop Biomass Prediction from Point Cloud to Drone Imagery
NeFF-BioNet: Crop Biomass Prediction from Point Cloud to Drone Imagery
Xuesong Li
Zeeshan Hayder
Ali Zia
Connor Cassidy
Shiming Liu
W. Stiller
Eric A. Stone
Warren C. Conaty
Lars Petersson
V. Rolland
40
0
0
30 Oct 2024
Revisiting MAE pre-training for 3D medical image segmentation
Revisiting MAE pre-training for 3D medical image segmentation
Tassilo Wald
Constantin Ulrich
Stanislav Lukyanenko
Andrei Goncharov
Alberto Paderno
Leander Maerkisch
Paul F. Jäger
Paul F. Jäger
Klaus Maier-Hein
56
2
0
30 Oct 2024
Dreaming Out Loud: A Self-Synthesis Approach For Training
  Vision-Language Models With Developmentally Plausible Data
Dreaming Out Loud: A Self-Synthesis Approach For Training Vision-Language Models With Developmentally Plausible Data
Badr AlKhamissi
Yingtian Tang
Abdülkadir Gökce
Johannes Mehrer
Martin Schrimpf
VLM
59
0
0
29 Oct 2024
A Fresh Look at Generalized Category Discovery through Non-negative
  Matrix Factorization
A Fresh Look at Generalized Category Discovery through Non-negative Matrix Factorization
Zhong Ji
Steve Yang
Jingren Liu
Yanwei Pang
Jungong Han
41
0
0
29 Oct 2024
Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for
  Semantic Segmentation
Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation
Ruihao Xia
Yu Liang
Peng-Tao Jiang
Hao Zhang
Bo Li
Yang Tang
Pan Zhou
DiffM
VLM
49
1
0
29 Oct 2024
AdaptGCD: Multi-Expert Adapter Tuning for Generalized Category Discovery
AdaptGCD: Multi-Expert Adapter Tuning for Generalized Category Discovery
Yuxun Qu
Yongqiang Tang
Chenyang Zhang
Wensheng Zhang
36
0
0
29 Oct 2024
OFER: Occluded Face Expression Reconstruction
OFER: Occluded Face Expression Reconstruction
Pratheba Selvaraju
Victoria Fernandez-Abrevaya
Timo Bolkart
Rick Akkerman
Tianyu Ding
F. Amjadi
Ilya Zharkov
DiffM
CVBM
3DH
40
0
0
29 Oct 2024
NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place
  Recognition Dataset in Dense Urban Environments
NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments
Taiyi Pan
Junyang He
Chao-Yeh Chen
Yiming Li
Chen Feng
43
2
0
28 Oct 2024
SocialGPT: Prompting LLMs for Social Relation Reasoning via Greedy
  Segment Optimization
SocialGPT: Prompting LLMs for Social Relation Reasoning via Greedy Segment Optimization
Wanhua Li
Zibin Meng
Jiawei Zhou
D. Wei
Chuang Gan
Hanspeter Pfister
LRM
VLM
29
5
0
28 Oct 2024
FreqMark: Invisible Image Watermarking via Frequency Based Optimization
  in Latent Space
FreqMark: Invisible Image Watermarking via Frequency Based Optimization in Latent Space
Yiyang Guo
Ruizhe Li
Mude Hui
Hanzhong Guo
Chen Zhang
Chuangjian Cai
Le Wan
Shangfei Wang
29
0
0
28 Oct 2024
Novel Object Synthesis via Adaptive Text-Image Harmony
Novel Object Synthesis via Adaptive Text-Image Harmony
Zeren Xiong
Zedong Zhang
Zikun Chen
Shuo Chen
Xianrui Li
Gan Sun
Jian Yang
Jun Li
DiffM
53
4
0
28 Oct 2024
Multi-modal AI for comprehensive breast cancer prognostication
Multi-modal AI for comprehensive breast cancer prognostication
Jan Witowski
Ken Zeng
Joseph Cappadona
Jailan Elayoubi
Elena Diana Chiru
...
Adam Brufsky
Francisco J. Esteva
Lajos Pusztai
Yann LeCun
Krzysztof J. Geras
25
1
0
28 Oct 2024
Accelerating Augmentation Invariance Pretraining
Accelerating Augmentation Invariance Pretraining
Jinhong Lin
Cheng-En Wu
Yibing Wei
Pedro Morgado
ViT
33
1
0
27 Oct 2024
Neural Fields in Robotics: A Survey
Neural Fields in Robotics: A Survey
Muhammad Zubair Irshad
Mauro Comi
Yen-Chen Lin
Nick Heppert
Abhinav Valada
Rares Andrei Ambrus
Z. Kira
Jonathan Tremblay
AI4CE
60
4
0
26 Oct 2024
SCube: Instant Large-Scale Scene Reconstruction using VoxSplats
SCube: Instant Large-Scale Scene Reconstruction using VoxSplats
Xuanchi Ren
Y. Lu
Hanxue Liang
Zhangjie Wu
Huan Ling
Mike Chen
Sanja Fidler
Francis Williams
Jiahui Huang
3DGS
50
8
0
26 Oct 2024
Model merging with SVD to tie the Knots
Model merging with SVD to tie the Knots
George Stoica
Pratik Ramesh
B. Ecsedi
Leshem Choshen
Judy Hoffman
MoMe
44
9
0
25 Oct 2024
Frozen-DETR: Enhancing DETR with Image Understanding from Frozen
  Foundation Models
Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models
Shenghao Fu
Junkai Yan
Q. Yang
Xihan Wei
Xiaohua Xie
Wei-Shi Zheng
VLM
49
3
0
25 Oct 2024
On Occlusions in Video Action Detection: Benchmark Datasets And Training
  Recipes
On Occlusions in Video Action Detection: Benchmark Datasets And Training Recipes
Rajat Modi
Vibhav Vineet
Yogesh S Rawat
42
1
0
25 Oct 2024
Brain-like Functional Organization within Large Language Models
Brain-like Functional Organization within Large Language Models
Haiyang Sun
Lin Zhao
Zihao Wu
Xiaohui Gao
Yutao Hu
Mengfei Zuo
Wenbo Zhang
Junwei Han
Tianming Liu
X. Hu
36
0
0
25 Oct 2024
On-Robot Reinforcement Learning with Goal-Contrastive Rewards
On-Robot Reinforcement Learning with Goal-Contrastive Rewards
Ondrej Biza
Thomas Weng
Lingfeng Sun
Karl Schmeckpeper
Tarik Kelestemur
Yecheng Jason Ma
Robert Platt
Jan-Willem van de Meent
Lawson L. S. Wong
OffRL
52
0
0
25 Oct 2024
SegLLM: Multi-round Reasoning Segmentation
SegLLM: Multi-round Reasoning Segmentation
XuDong Wang
Shaolun Zhang
Shufan Li
Konstantinos Kallidromitis
Kehan Li
Yusuke Kato
Kazuki Kozuka
Trevor Darrell
VLM
LRM
58
2
0
24 Oct 2024
From Efficiency to Equity: Measuring Fairness in Preference Learning
From Efficiency to Equity: Measuring Fairness in Preference Learning
Shreeyash Gowaikar
Hugo Berard
Rashid Mushkani
Shin Koseki
33
0
0
24 Oct 2024
MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
Ruicheng Wang
Sicheng Xu
Cassie Dai
Jianfeng Xiang
Yu Deng
Xin Tong
Jiaolong Yang
TPM
3DH
MDE
67
30
0
24 Oct 2024
PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a
  resource-limited Context
PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context
Maximilian Augustin
Syed Shakib Sarwar
Mostafa Elhoushi
Sai Qian Zhang
Yuecheng Li
B. D. Salvo
33
0
0
23 Oct 2024
SRA: A Novel Method to Improve Feature Embedding in Self-supervised
  Learning for Histopathological Images
SRA: A Novel Method to Improve Feature Embedding in Self-supervised Learning for Histopathological Images
Hamid Manoochehri
Bodong Zhang
Beatrice Knudsen
Tolga Tasdizen
34
1
0
23 Oct 2024
X-MOBILITY: End-To-End Generalizable Navigation via World Modeling
X-MOBILITY: End-To-End Generalizable Navigation via World Modeling
Wei Liu
Huihua Zhao
Chenran Li
Joydeep Biswas
Billy Okal
Pulkit Goyal
Yan Chang
Soha Pouya
29
4
0
23 Oct 2024
SigCLR: Sigmoid Contrastive Learning of Visual Representations
SigCLR: Sigmoid Contrastive Learning of Visual Representations
Ömer Veysel Çağatan
29
0
0
22 Oct 2024
Previous
123...161718...434445
Next