Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.12387
Cited By
Continuous 3D Perception Model with Persistent State
21 January 2025
Qianqian Wang
Yifei Zhang
Aleksander Holyñski
Alexei A. Efros
Angjoo Kanazawa
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Continuous 3D Perception Model with Persistent State"
27 / 27 papers shown
Title
Toward Memory-Aided World Models: Benchmarking via Spatial Consistency
Kewei Lian
Shaofei Cai
Yilun Du
Yitao Liang
40
0
0
29 May 2025
Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping
Justin Lazarow
Kai Kang
Afshin Dehghan
3DPC
30
0
0
29 May 2025
Frame In-N-Out: Unbounded Controllable Image-to-Video Generation
Boyang Wang
Xuweiyi Chen
Matheus Gadelha
Zezhou Cheng
DiffM
VGen
67
0
0
27 May 2025
Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models
Runsen Xu
Weiyao Wang
Hao Tang
Xingyu Chen
Xiaodong Wang
Fu-Jen Chu
Dahua Lin
Matt Feiszli
Kevin J. Liang
LRM
85
1
0
22 May 2025
VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold
Dominic Maggio
Hyungtae Lim
Luca Carlone
58
0
0
18 May 2025
Visual Imitation Enables Contextual Humanoid Control
Arthur Allshire
Hongsuk Choi
Junyi Zhang
David McAllister
Anthony Zhang
Chung Min Kim
Trevor Darrell
Pieter Abbeel
Jitendra Malik
Angjoo Kanazawa
LM&Ro
366
0
0
06 May 2025
RayZer: A Self-supervised Large View Synthesis Model
Hanwen Jiang
Hao Tan
Peng Wang
Haian Jin
Yue Zhao
...
Kai Zhang
Fujun Luan
Kalyan Sunkavalli
Qixing Huang
Georgios Pavlakos
105
0
0
01 May 2025
Dynamic Camera Poses and Where to Find Them
C. Rockwell
Joseph Tung
Nayeon Lee
Xuan Li
David Fouhey
Chen-Hsuan Lin
88
0
0
24 Apr 2025
Towards Understanding Camera Motions in Any Video
Zhiqiu Lin
Siyuan Cen
Daniel Jiang
Jay Karhade
Hewei Wang
...
Rushikesh Zawar
Xue Bai
Yilun Du
Chuang Gan
Deva Ramanan
VGen
78
1
0
21 Apr 2025
Mono3R: Exploiting Monocular Cues for Geometric 3D Reconstruction
Wenyu Li
Sidun Liu
Peng Qiao
Yong Dou
82
0
0
18 Apr 2025
St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World
Haiwen Feng
Junyi Zhang
Qianqian Wang
Yufei Ye
Pengcheng Yu
Michael J. Black
Trevor Darrell
Angjoo Kanazawa
VGen
3DV
102
2
0
17 Apr 2025
Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting
Jiaxin Huang
Sheng Miao
BangBnag Yang
Yuewen Ma
Yiyi Liao
VGen
MDE
121
0
0
15 Apr 2025
FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution
Gene Chou
Wenqi Xian
Guandao Yang
Mohamed Abdelfattah
Bharath Hariharan
Noah Snavely
Ning Yu
P. Debevec
MDE
96
0
0
09 Apr 2025
POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction
Songyan Zhang
Yongtao Ge
Jinyuan Tian
Guangkai Xu
Hao Chen
Chen Lv
Chunhua Shen
3DPC
67
0
0
08 Apr 2025
3R-GS: Best Practice in Optimizing Camera Poses Along with 3DGS
Zhisheng Huang
Peng Wang
Jingdong Zhang
Yuan Liu
Xin Li
Wenping Wang
3DGS
115
0
0
05 Apr 2025
Can Test-Time Scaling Improve World Foundation Model?
Wenyan Cong
Hanqing Zhu
Peihao Wang
Bangya Liu
Dejia Xu
Kevin Wang
David Z. Pan
Yan Wang
Zhiwen Fan
Ziyi Wang
109
1
0
31 Mar 2025
Easi3R: Estimating Disentangled Motion from DUSt3R Without Training
Xingyu Chen
Yue Chen
Yuliang Xiu
Andreas Geiger
Anpei Chen
3DPC
VGen
88
5
0
31 Mar 2025
Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion
Jangho Park
Taesung Kwon
Jong Chul Ye
VGen
84
1
0
28 Mar 2025
Can Video Diffusion Model Reconstruct 4D Geometry?
Jinjie Mai
Wenxuan Zhu
Haozhe Liu
Bing Li
Cheng Zheng
Jürgen Schmidhuber
Bernard Ghanem
VGen
MDE
133
0
0
27 Mar 2025
Aether: Geometric-Aware Unified World Modeling
Aether Team
Haoyi Zhu
Yanjie Wang
Jianjun Zhou
Wenzheng Chang
...
Zizun Li
Junyi Chen
Chunhua Shen
Jiangmiao Pang
Tong He
DiffM
VGen
102
5
0
24 Mar 2025
Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image
Jerred Chen
Ronald Clark
86
1
0
21 Mar 2025
Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction
Edgar Sucar
Zihang Lai
Eldar Insafutdinov
Andrea Vedaldi
70
0
0
20 Mar 2025
A Recipe for Generating 3D Worlds From a Single Image
Katja Schwarz
Denys Rozumnyi
Samuel Rota Buló
Lorenzo Porzi
Peter Kontschieder
VGen
114
3
0
20 Mar 2025
VGGT: Visual Geometry Grounded Transformer
Jianyuan Wang
Minghao Chen
Nikita Karaev
Andrea Vedaldi
Christian Rupprecht
David Novotny
ViT
106
29
0
14 Mar 2025
Alligat0R: Pre-Training Through Co-Visibility Segmentation for Relative Camera Pose Regression
Thibaut Loiseau
Guillaume Bourmaud
Vincent Lepetit
99
0
0
10 Mar 2025
From Thousands to Billions: 3D Visual Language Grounding via Render-Supervised Distillation from 2D VLMs
Ang Cao
Sergio Arnaud
Oleksandr Maksymets
Jianing Yang
Ayush Jain
...
Aravind Rajeswaran
Franziska Meier
Justin Johnson
Jeong Joon Park
Alexander Sax
104
0
0
27 Feb 2025
L4P: Low-Level 4D Vision Perception Unified
Abhishek Badki
Hang Su
Bowen Wen
Orazio Gallo
VLM
125
1
0
18 Feb 2025
1