ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.10891
  4. Cited By
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

19 January 2024
Lihe Yang
Bingyi Kang
Zilong Huang
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
    VLM
ArXivPDFHTML

Papers citing "Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data"

50 / 146 papers shown
Title
Rethinking Encoder-Decoder Flow Through Shared Structures
Rethinking Encoder-Decoder Flow Through Shared Structures
Frederik Laboyrie
M. K. Yucel
Albert Saà-Garriga
AI4CE
45
0
0
24 Jan 2025
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Jianing Yang
Alexander Sax
Kevin J Liang
Mikael Henaff
Hao Tang
Ang Cao
J. Chai
Franziska Meier
Matt Feiszli
3DGS
73
16
0
23 Jan 2025
Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks
Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks
Alessio Quercia
Erenus Yildiz
Zhuo Cao
Kai Krajsek
Abigail Morrison
Ira Assent
Hanno Scharr
56
0
0
22 Jan 2025
DiffDoctor: Diagnosing Image Diffusion Models Before Treating
DiffDoctor: Diagnosing Image Diffusion Models Before Treating
Yiyang Wang
Xi Chen
Xiaogang Xu
S. Ji
Y. Liu
Yujun Shen
Hengshuang Zhao
DiffM
49
0
0
21 Jan 2025
Survey on Monocular Metric Depth Estimation
Survey on Monocular Metric Depth Estimation
Jiuling Zhang
VLM
69
0
0
21 Jan 2025
Continual Test-Time Adaptation for Single Image Defocus Deblurring via Causal Siamese Networks
Continual Test-Time Adaptation for Single Image Defocus Deblurring via Causal Siamese Networks
Shuang Cui
Yi Li
Jiangmeng Li
Xiongxin Tang
Bing-Huang Su
Fanjiang Xu
Hui Xiong
53
0
0
15 Jan 2025
MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training
MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training
Xingyi He He
Hao Yu
Sida Peng
Dongli Tan
Zehong Shen
Hujun Bao
Xiaowei Zhou
48
4
0
13 Jan 2025
Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation
Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation
Ziyang Xie
Zhizheng Liu
Zhenghao Peng
Wayne Wu
Bolei Zhou
VGen
48
3
0
12 Jan 2025
GeoDiffuser: Geometry-Based Image Editing with Diffusion Models
GeoDiffuser: Geometry-Based Image Editing with Diffusion Models
Rahul Sajnani
Jeroen Vanbaar
Jie Min
Kapil D. Katyal
Srinath Sridhar
DiffM
59
11
0
03 Jan 2025
NFL-BA: Improving Endoscopic SLAM with Near-Field Light Bundle Adjustment
NFL-BA: Improving Endoscopic SLAM with Near-Field Light Bundle Adjustment
Andrea Dunn Beltran
Daniel Rho
Marc Niethammer
Roni Sengupta
Roni Sengupta
90
2
0
17 Dec 2024
RoMeO: Robust Metric Visual Odometry
RoMeO: Robust Metric Visual Odometry
JunDa Cheng
Z. Cai
Zhaoxing Zhang
Wei Yin
Matthias Müller
Michael Paulitsch
Xin Yang
93
0
0
16 Dec 2024
MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes
MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes
Ruijie Lu
Yixin Chen
Junfeng Ni
Baoxiong Jia
Yu Liu
Diwen Wan
Gang Zeng
Siyuan Huang
DiffM
127
4
0
16 Dec 2024
ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction
ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction
Yi Feng
Yu Han
Xijing Zhang
Tanghui Li
Yanting Zhang
Rui Fan
114
3
0
15 Dec 2024
FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation
FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation
Yunpeng Bai
Qixing Huang
DiffM
94
0
0
01 Dec 2024
T-3DGS: Removing Transient Objects for 3D Scene Reconstruction
T-3DGS: Removing Transient Objects for 3D Scene Reconstruction
Vadim Pryadilshchikov
Alexander Markin
Artem Komarichev
Ruslan Rakhimov
Peter Wonka
Evgeny Burnaev
3DGS
79
1
0
29 Nov 2024
PCDreamer: Point Cloud Completion Through Multi-view Diffusion Priors
PCDreamer: Point Cloud Completion Through Multi-view Diffusion Priors
Guangshun Wei
Yuan Feng
Long Ma
Chen Wang
Yuanfeng Zhou
Changjian Li
192
0
0
28 Nov 2024
Helvipad: A Real-World Dataset for Omnidirectional Stereo Depth Estimation
Helvipad: A Real-World Dataset for Omnidirectional Stereo Depth Estimation
Mehdi Zayene
Jannik Endres
Albias Havolli
Charles Corbière
Salim Cherkaoui
Alexandre Kontouli
Alexandre Alahi
MDE
147
1
0
27 Nov 2024
GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration
GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration
Sudarshan Rajagopalan
Nithin Gopalakrishnan Nair
Jay N. Paranjape
Vishal M. Patel
DiffM
90
0
0
26 Nov 2024
SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting
SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting
Gyeongjin Kang
Jisang Yoo
Jihyeon Park
Seungtae Nam
Hyeonsoo Im
Sangheon Shin
Sangpil Kim
Eunbyung Park
3DGS
153
3
0
26 Nov 2024
PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation
PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation
Ziyao Zeng
Jingcheng Ni
Daniel Wang
Patrick Rim
Younjoon Chung
Fengyu Yang
Byung-Woo Hong
A. Wong
DiffM
MDE
108
2
0
24 Nov 2024
SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input
SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input
Zhen Lv
Yangqi Long
Congzhentao Huang
Cao Li
Chengfei Lv
Hao Ren
Dian Zheng
DiffM
VGen
MDE
114
5
0
18 Nov 2024
Decoupling Fine Detail and Global Geometry for Compressed Depth Map Super-Resolution
Decoupling Fine Detail and Global Geometry for Compressed Depth Map Super-Resolution
Huan Zheng
Wencheng Han
Jianbing Shen
65
1
0
05 Nov 2024
X-Drive: Cross-modality consistent multi-sensor data synthesis for
  driving scenarios
X-Drive: Cross-modality consistent multi-sensor data synthesis for driving scenarios
Yichen Xie
Chenfeng Xu
C-T.John Peng
Shuqi Zhao
Nhat Ho
Alexander T. Pham
Mingyu Ding
M. Tomizuka
W. Zhan
DiffM
41
2
0
02 Nov 2024
Aquatic-GS: A Hybrid 3D Representation for Underwater Scenes
Aquatic-GS: A Hybrid 3D Representation for Underwater Scenes
Shaohua Liu
Junzhe Lu
Zuoya Gu
Jiajun Li
Yue Deng
3DGS
41
0
0
31 Oct 2024
Local Policies Enable Zero-shot Long-horizon Manipulation
Local Policies Enable Zero-shot Long-horizon Manipulation
Murtaza Dalal
Min Liu
Walter Talbott
Chen Chen
Deepak Pathak
Jian Zhang
Ruslan Salakhutdinov
38
3
0
29 Oct 2024
MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
Ruicheng Wang
Sicheng Xu
Cassie Dai
Jianfeng Xiang
Yu Deng
Xin Tong
Jiaolong Yang
TPM
3DH
MDE
60
30
0
24 Oct 2024
TIPS: Text-Image Pretraining with Spatial awareness
TIPS: Text-Image Pretraining with Spatial awareness
Kevis-Kokitsi Maninis
Kaifeng Chen
Soham Ghosh
Arjun Karpur
Koert Chen
...
Jan Dlabal
Dan Gnanapragasam
Mojtaba Seyedhosseini
Howard Zhou
Andre Araujo
VLM
35
3
0
21 Oct 2024
Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning
Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning
Yuxiang Lu
Shengcao Cao
Yu-xiong Wang
49
1
0
18 Oct 2024
DepthSplat: Connecting Gaussian Splatting and Depth
DepthSplat: Connecting Gaussian Splatting and Depth
Haofei Xu
Songyou Peng
Fangjinhua Wang
Hermann Blum
Dániel Baráth
Andreas Geiger
Marc Pollefeys
3DGS
MDE
52
29
0
17 Oct 2024
A Simple Approach to Unifying Diffusion-based Conditional Generation
A Simple Approach to Unifying Diffusion-based Conditional Generation
Xirui Li
Charles Herrmann
Kelvin C.K. Chan
Yinxiao Li
Deqing Sun
Chao Ma
Ming Yang
DiffM
VLM
43
1
0
15 Oct 2024
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Ziyue Li
Tianyi Zhou
MoE
71
5
0
14 Oct 2024
Browsing without Third-Party Cookies: What Do You See?
Browsing without Third-Party Cookies: What Do You See?
Maxwell Lin
Shihan Lin
Helen Wu
Karen Wang
Xiaowei Yang
BDL
53
0
0
14 Oct 2024
Surgical Depth Anything: Depth Estimation for Surgical Scenes using
  Foundation Models
Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models
Ange Lou
Yamin Li
Yike Zhang
Jack Noble
MedIm
31
4
0
09 Oct 2024
Refinement of Monocular Depth Maps via Multi-View Differentiable
  Rendering
Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering
Laura Fink
Linus Franke
Joachim Keinert
Marc Stamminger
MDE
3DV
35
0
0
04 Oct 2024
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion
Junyi Zhang
Charles Herrmann
Junhwa Hur
Varun Jampani
Trevor Darrell
Forrester Cole
Deqing Sun
Ming Yang
VGen
89
70
0
04 Oct 2024
RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through
  Language Descriptions
RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions
Ziyao Zeng
Yangchao Wu
Hyoungseob Park
Daniel Wang
Fengyu Yang
Stefano Soatto
Dong Lao
Byung-Woo Hong
Alex Wong
MDE
20
7
0
03 Oct 2024
SinkSAM: A Monocular Depth-Guided SAM Framework for Automatic Sinkhole
  Segmentation
SinkSAM: A Monocular Depth-Guided SAM Framework for Automatic Sinkhole Segmentation
Osher Rafaeli
T. Svoray
Ariel Nahlieli
28
0
0
02 Oct 2024
Enhancing Screen Time Identification in Children with a Multi-View Vision Language Model and Screen Time Tracker
Enhancing Screen Time Identification in Children with a Multi-View Vision Language Model and Screen Time Tracker
Xinlong Hou
Sen Shen
Xueshen Li
Xinran Gao
Ziyi Huang
Steven J. Holiday
Matthew R. Cribbet
Susan W. White
Edward Sazonov
Yu Gan
34
0
0
02 Oct 2024
GTransPDM: A Graph-embedded Transformer with Positional Decoupling for Pedestrian Crossing Intention Prediction
GTransPDM: A Graph-embedded Transformer with Positional Decoupling for Pedestrian Crossing Intention Prediction
Chen Xie
Ciyun Lin
Xiaoyu Zheng
Bowen Gong
Dayong Wu
ViT
41
0
0
30 Sep 2024
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Jing He
Haodong Li
Wei Yin
Yixun Liang
Leheng Li
Kaiqiang Zhou
Hongbo Zhang
Bingbing Liu
Ying-Cong Chen
DiffM
VLM
46
40
0
26 Sep 2024
MIMO: Controllable Character Video Synthesis with Spatial Decomposed
  Modeling
MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling
Yifang Men
Yuan Yao
Miaomiao Cui
Liefeng Bo
DiffM
29
18
0
24 Sep 2024
Safe Navigation for Robotic Digestive Endoscopy via Human Intervention-based Reinforcement Learning
Safe Navigation for Robotic Digestive Endoscopy via Human Intervention-based Reinforcement Learning
Min Tan
Yushun Tao
Boyun Zheng
GaoSheng Xie
Lijuan Feng
Zeyang Xia
Jing Xiong
27
0
0
24 Sep 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
Weifeng Lin
Xinyu Wei
Renrui Zhang
Le Zhuo
Shitian Zhao
...
Junlin Xie
Junlin Xie
Yu Qiao
Peng Gao
Hongsheng Li
MLLM
DiffM
57
10
0
23 Sep 2024
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
Gonzalo Martin Garcia
Karim Abou Zeid
Christian Schmidt
Daan de Geus
Alexander Hermans
Bastian Leibe
39
24
0
17 Sep 2024
Towards Real-Time Generation of Delay-Compensated Video Feeds for Outdoor Mobile Robot Teleoperation
Towards Real-Time Generation of Delay-Compensated Video Feeds for Outdoor Mobile Robot Teleoperation
Neeloy Chakraborty
Yixiao Fang
Andre Schreiber
Tianchen Ji
Zhe Huang
Aganze Mihigo
Cassidy Wall
Abdulrahman Almana
Katherine Driggs-Campbell
33
0
0
16 Sep 2024
SteeredMarigold: Steering Diffusion Towards Depth Completion of Largely Incomplete Depth Maps
SteeredMarigold: Steering Diffusion Towards Depth Completion of Largely Incomplete Depth Maps
Jakub Gregorek
Lazaros Nalpantidis
3DGS
38
3
0
16 Sep 2024
GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion
GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion
Vitor Campagnolo Guizilini
P. Tokmakov
Achal Dave
Rares Ambrus
DiffM
28
2
0
15 Sep 2024
Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy
Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy
Bojian Li
Bo Liu
Jinghua Yue
F. Zhou
Fugen Zhou
MedIm
MDE
45
2
0
12 Sep 2024
Towards Generalizable Scene Change Detection
Towards Generalizable Scene Change Detection
Jaewoo Kim
Uehwan Kim
48
0
0
10 Sep 2024
EndoOmni: Zero-Shot Cross-Dataset Depth Estimation in Endoscopy by Robust Self-Learning from Noisy Labels
EndoOmni: Zero-Shot Cross-Dataset Depth Estimation in Endoscopy by Robust Self-Learning from Noisy Labels
Qingyao Tian
Zhen Chen
Huai Liao
Xinyan Huang
Lujie Li
Sebastien Ourselin
Hongbin Liu
95
1
0
09 Sep 2024
Previous
123
Next