Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.12288
Cited By
ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth
23 February 2023
S. Bhat
R. Birkl
Diana Wofk
Peter Wonka
Matthias Müller
VLM
MDE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth"
50 / 321 papers shown
Title
Invisible Stitch: Generating Smooth 3D Scenes with Depth Inpainting
Paul Engstler
Andrea Vedaldi
Iro Laina
Christian Rupprecht
MDE
37
9
0
30 Apr 2024
HELPER-X: A Unified Instructable Embodied Agent to Tackle Four Interactive Vision-Language Domains with Memory-Augmented Language Models
Gabriel H. Sarch
Sahil Somani
Raghav Kapoor
Michael J. Tarr
Katerina Fragkiadaki
LM&Ro
LLMAG
34
3
0
29 Apr 2024
The Third Monocular Depth Estimation Challenge
Jaime Spencer
Fabio Tosi
Matteo Poggi
Ripudaman Singh Arora
Chris Russell
...
Albert Luginov
Muhammad Shahzad
Seyed Hosseini
Aleksander Trajcevski
James H. Elder
MDE
38
7
0
25 Apr 2024
G3R: Generating Rich and Fine-grained mmWave Radar Data from 2D Videos for Generalized Gesture Recognition
Kaikai Deng
Dong Zhao
Wenxin Zheng
Yue Ling
Kangwen Yin
Huadong Ma
28
1
0
23 Apr 2024
LTOS: Layout-controllable Text-Object Synthesis via Adaptive Cross-attention Fusions
Xiaoran Zhao
Tianhao Wu
Yu Lai
Zhiliang Tian
Zhen Huang
Yahui Liu
Zejiang He
Dongsheng Li
DiffM
38
1
0
21 Apr 2024
SPIdepth: Strengthened Pose Information for Self-supervised Monocular Depth Estimation
M. Lavrenyuk
MDE
37
2
0
18 Apr 2024
Food Portion Estimation via 3D Object Scaling
Gautham Vinod
Jiangpeng He
Zeman Shao
F. Zhu
27
5
0
18 Apr 2024
InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior
Zhiheng Liu
Ouyang Hao
Qiuyu Wang
Ka Leong Cheng
Jie Xiao
Kai Zhu
Nan Xue
Yu Liu
Yujun Shen
Yang Cao
DiffM
3DGS
45
20
0
17 Apr 2024
Predicting Long-horizon Futures by Conditioning on Geometry and Time
Tarasha Khurana
Deva Ramanan
AI4TS
49
0
0
17 Apr 2024
Taming Latent Diffusion Model for Neural Radiance Field Inpainting
C. Lin
Changil Kim
Jia-Bin Huang
Qinbo Li
Chih-Yao Ma
Johannes Kopf
Ming-Hsuan Yang
Hung-Yu Tseng
AI4CE
DiffM
26
10
0
15 Apr 2024
In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation
Han Xue
Qianru Sun
Li-Na Song
Wenjun Zhang
Zhiwu Huang
MLLM
44
0
0
15 Apr 2024
Probing the 3D Awareness of Visual Foundation Models
Mohamed El Banani
Amit Raj
Kevis-Kokitsi Maninis
Abhishek Kar
Yuanzhen Li
Michael Rubinstein
Deqing Sun
Leonidas J. Guibas
Justin Johnson
Varun Jampani
40
79
0
12 Apr 2024
Implicit and Explicit Language Guidance for Diffusion-based Visual Perception
Hefeng Wang
Jiale Cao
Jin Xie
Aiping Yang
Yanwei Pang
VLM
DiffM
50
2
0
11 Apr 2024
RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion
Jaidev Shriram
Alex Trevithick
Lingjie Liu
Ravi Ramamoorthi
DiffM
3DGS
75
55
0
10 Apr 2024
Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
Axel Barroso-Laguna
Sowmya P. Munukutla
V. Prisacariu
Eric Brachmann
3DV
42
12
0
09 Apr 2024
SpatialTracker: Tracking Any 2D Pixels in 3D Space
Yuxi Xiao
Qianqian Wang
Shangzhan Zhang
Nan Xue
Sida Peng
Yujun Shen
Xiaowei Zhou
19
53
0
05 Apr 2024
Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning
Rui Li
Tobias Fischer
Mattia Segu
Marc Pollefeys
Luc Van Gool
Federico Tombari
21
8
0
04 Apr 2024
Gen3DSR: Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single View
Andreea Dogaru
M. Ozer
Bernhard Egger
3DGS
64
4
0
04 Apr 2024
SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation
Junyan Ye
Qiyan Luo
Jinhua Yu
Huaping Zhong
Zhimeng Zheng
Conghui He
Weijia Li
34
12
0
03 Apr 2024
TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Autonomous Driving
Cheng Zhao
Su Sun
Ruoyu Wang
Yuliang Guo
Jun-Jun Wan
Zhou Huang
Xinyu Huang
Yingjie Victor Chen
Liu Ren
3DGS
45
4
0
03 Apr 2024
SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent Objects
Avinash Ummadisingu
Jongkeum Choi
Koki Yamane
Shimpei Masuda
Naoki Fukaya
Kuniyuki Takahashi
55
2
0
28 Mar 2024
UniDepth: Universal Monocular Metric Depth Estimation
Luigi Piccinelli
Yung-Hsu Yang
Christos Sakaridis
Mattia Segu
Siyuan Li
Luc Van Gool
Fisher Yu
VLM
MDE
78
128
0
27 Mar 2024
ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation
Suraj Patni
Aradhye Agarwal
Chetan Arora
VLM
DiffM
MDE
33
26
0
27 Mar 2024
Track Everything Everywhere Fast and Robustly
Yunzhou Song
Jiahui Lei
ZiYun Wang
Lingjie Liu
Kostas Daniilidis
27
5
0
26 Mar 2024
DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing
Matias Turkulainen
Xuqian Ren
Iaroslav Melekhov
Otto Seiskari
Esa Rahtu
Arno Solin
3DGS
48
57
0
26 Mar 2024
MMVP: A Multimodal MoCap Dataset with Vision and Pressure Sensors
He Zhang
Shenghao Ren
Haolei Yuan
Jianhui Zhao
Fan Li
Shuangpeng Sun
Zhenghao Liang
Tao Yu
Qiu Shen
Xun Cao
40
4
0
26 Mar 2024
TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos
Yufu Wang
ZiYun Wang
Lingjie Liu
Kostas Daniilidis
48
25
0
26 Mar 2024
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation
Mu Hu
Wei Yin
C. Zhang
Zhipeng Cai
Xiaoxiao Long
Kaixuan Wang
Kaixuan Wang
Gang Yu
Chunhua Shen
Shaojie Shen
3DGS
54
116
0
22 Mar 2024
DepthFM: Fast Monocular Depth Estimation with Flow Matching
Ming Gui
Johannes S. Fischer
Ulrich Prestel
Pingchuan Ma
Dmytro Kotovenko
Olga Grebenkova
S. A. Baumann
Vincent Tao Hu
Bjorn Ommer
MDE
36
52
0
20 Mar 2024
SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language Models through Prompting and Interacting 3D Priors
Chenyang Ma
Kai Lu
Ta-Ying Cheng
Niki Trigoni
Andrew Markham
LRM
37
7
0
18 Mar 2024
Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors
Ruicheng Wang
Jianfeng Xiang
Jiaolong Yang
Xin Tong
DiffM
37
4
0
18 Mar 2024
Touch-GS: Visual-Tactile Supervised 3D Gaussian Splatting
Aiden Swann
Matthew Strong
Won Kyung Do
Gadiel Sznaier Camps
Mac Schwager
Monroe Kennedy
3DGS
36
9
0
14 Mar 2024
3D-VLA: A 3D Vision-Language-Action Generative World Model
Haoyu Zhen
Xiaowen Qiu
Peihao Chen
Jincheng Yang
Xin Yan
Yilun Du
Yining Hong
Chuang Gan
LM&Ro
VGen
PINN
39
90
0
14 Mar 2024
3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation
Frank Zhang
Yibo Zhang
Quan Zheng
R. Ma
W. Hua
Hujun Bao
Weiwei Xu
Changqing Zou
54
9
0
14 Mar 2024
LVIC: Multi-modality segmentation by Lifting Visual Info as Cue
Zichao Dong
Bowen Pang
Xufeng Huang
Hang Ji
Xin Zhan
Junbo Chen
3DPC
37
0
0
08 Mar 2024
Scene Depth Estimation from Traditional Oriental Landscape Paintings
Sungho Kang
Yeonghyeon Park
H. Park
Juneho Yi
52
0
0
06 Mar 2024
OPEx: A Component-Wise Analysis of LLM-Centric Agents in Embodied Instruction Following
Haochen Shi
Zhiyuan Sun
Xingdi Yuan
Marc-Alexandre Côté
Bang Liu
LLMAG
37
10
0
05 Mar 2024
Splat-Nav: Safe Real-Time Robot Navigation in Gaussian Splatting Maps
Timothy Chen
O. Shorinwa
Joseph Bruno
Javier Yu
Weijia Zeng
Weijia Zeng
Keiko Nagami
Mac Schwager
Mac Schwager
3DGS
40
31
0
05 Mar 2024
How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey
Fabio Tosi
Youming Zhang
Ziren Gong
Erik Sandström
S. Mattoccia
Martin R. Oswald
Matteo Poggi
3DGS
63
54
0
20 Feb 2024
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation
Chong Zeng
Yue Dong
Pieter Peers
Youkang Kong
Hongzhi Wu
Xin Tong
36
28
0
19 Feb 2024
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
Shoubin Yu
Jaehong Yoon
Mohit Bansal
77
4
0
08 Feb 2024
SPAD : Spatially Aware Multiview Diffusers
Yash Kant
Ziyi Wu
Michael Vasilkovsky
Guocheng Qian
Jian Ren
R. A. Guler
Guohao Li
Sergey Tulyakov
Igor Gilitschenski
Aliaksandr Siarohin
DiffM
24
35
0
07 Feb 2024
MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction
Heng Zhou
Zhetao Guo
Shuhong Liu
Lechen Zhang
Qihao Wang
Yuxiang Ren
Mingrui Li
MDE
36
13
0
06 Feb 2024
Extreme Two-View Geometry From Object Poses with Diffusion Models
Yujing Sun
Caiyi Sun
Yuan Liu
Yuexin Ma
S. Yiu
34
2
0
05 Feb 2024
RIDERS: Radar-Infrared Depth Estimation for Robust Sensing
Han Li
Yukai Ma
Yuehao Huang
Yaqing Gu
Weihua Xu
Yong-Jin Liu
Xingxing Zuo
26
4
0
03 Feb 2024
Geometry Transfer for Stylizing Radiance Fields
Hyunyoung Jung
Seonghyeon Nam
N. Sarafianos
Sungjoo Yoo
Alexander Sorkine-Hornung
Rakesh Ranjan
45
10
0
01 Feb 2024
Template-Free Single-View 3D Human Digitalization with Diffusion-Guided LRM
Zhenzhen Weng
Jingyuan Liu
Hao Tan
Zhan Xu
Yang Zhou
Serena Yeung-Levy
Jimei Yang
3DH
37
8
0
22 Jan 2024
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities
Boyuan Chen
Zhuo Xu
Sean Kirmani
Brian Ichter
Danny Driess
Pete Florence
Dorsa Sadigh
Leonidas J. Guibas
Fei Xia
LRM
ReLM
52
206
0
22 Jan 2024
General Flow as Foundation Affordance for Scalable Robot Learning
Chengbo Yuan
Chuan Wen
Tong Zhang
Yang Gao
AI4CE
21
31
0
21 Jan 2024
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Lihe Yang
Bingyi Kang
Zilong Huang
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
VLM
155
709
0
19 Jan 2024
Previous
1
2
3
4
5
6
7
Next