ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2412.03572
  4. Cited By
Navigation World Models
v1v2 (latest)

Navigation World Models

4 December 2024
Amir Bar
G. Zhou
Danny Tran
Trevor Darrell
Yann LeCun
    VGenEgoV
ArXiv (abs)PDFHTML

Papers citing "Navigation World Models"

50 / 82 papers shown
Title
GenWorld: Towards Detecting AI-generated Real-world Simulation Videos
GenWorld: Towards Detecting AI-generated Real-world Simulation Videos
Weiliang Chen
Wenzhao Zheng
Yu Zheng
Lei Chen
Jie Zhou
Jiwen Lu
Yueqi Duan
VGen
140
0
0
12 Jun 2025
TARDIS STRIDE: A Spatio-Temporal Road Image Dataset and World Model for Autonomy
TARDIS STRIDE: A Spatio-Temporal Road Image Dataset and World Model for Autonomy
Héctor Carrión
Yutong Bai
Víctor A. Hernández Castro
Kishan Panaganti
Ayush Zenith
Matthew Trang
Tony Zhang
Pietro Perona
Jitendra Malik
VGen
30
0
0
12 Jun 2025
Efficient Generation of Diverse Cooperative Agents with World Models
Efficient Generation of Diverse Cooperative Agents with World Models
Yi Loo
Akshunn Trivedi
Malika Meghjani
34
0
0
09 Jun 2025
3DFlowAction: Learning Cross-Embodiment Manipulation from 3D Flow World Model
3DFlowAction: Learning Cross-Embodiment Manipulation from 3D Flow World Model
Hongyan Zhi
Peihao Chen
Siyuan Zhou
Yubo Dong
Quanxi Wu
Lei Han
Mingkui Tan
90
0
0
06 Jun 2025
WorldPrediction: A Benchmark for High-level World Modeling and Long-horizon Procedural Planning
Delong Chen
Willy Chung
Yejin Bang
Ziwei Ji
Pascale Fung
VGenLM&Ro
81
0
0
04 Jun 2025
Enhancing Safety of Foundation Models for Visual Navigation through Collision Avoidance via Repulsive Estimation
Enhancing Safety of Foundation Models for Visual Navigation through Collision Avoidance via Repulsive Estimation
Joonkyung Kim
Joonyeol Sim
Woojun Kim
Katia Sycara
Changjoo Nam
103
0
0
04 Jun 2025
Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning
Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning
Yijun Yang
Zhao-Yang Wang
Qiuping Liu
Shuwen Sun
Kang Wang
...
Zongwei Zhou
Alan Yuille
Lei Zhu
Yu Zhang
Jieneng Chen
35
0
0
02 Jun 2025
Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation
Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation
P. Zhang
Yifei Su
Pengyuan Wu
Dong An
Li Zhang
Zhigang Wang
Dong Wang
Yan Ding
Bin Zhao
Xuelong Li
LM&Ro
90
0
0
27 May 2025
WorldEval: World Model as Real-World Robot Policies Evaluator
WorldEval: World Model as Real-World Robot Policies Evaluator
Yaxuan Li
Yichen Zhu
Junjie Wen
Chaomin Shen
Yi Xu
OffRLVGen
41
0
0
25 May 2025
WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions
Zizhang Li
Hong-Xing Yu
Wei Liu
Yin Yang
Charles Herrmann
Gordon Wetzstein
Jiajun Wu
VGen
92
0
0
23 May 2025
Vid2World: Crafting Video Diffusion Models to Interactive World Models
Vid2World: Crafting Video Diffusion Models to Interactive World Models
Siqiao Huang
Jialong Wu
Qixing Zhou
Shangchen Miao
Mingsheng Long
VGen
64
0
0
20 May 2025
A Survey of Interactive Generative Video
A Survey of Interactive Generative Video
Jiwen Yu
Yiran Qin
Haoxuan Che
Quande Liu
Xinyu Wang
Pengfei Wan
Di Zhang
Kun Gai
Hao Chen
Xihui Liu
VGen
121
3
0
30 Apr 2025
Learned Perceptive Forward Dynamics Model for Safe and Platform-aware Robotic Navigation
Learned Perceptive Forward Dynamics Model for Safe and Platform-aware Robotic Navigation
Pascal Roth
Jonas Frey
Cesar Cadena
Marco Hutter
114
0
0
27 Apr 2025
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography
Mengchen Zhang
Tong Wu
Jing Tan
Ziwei Liu
Gordon Wetzstein
Dahua Lin
VGen
113
0
0
09 Apr 2025
VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior
VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior
Xindi Yang
Baolu Li
Yanzhe Zhang
Zhenfei Yin
Lei Bai
...
Zhiyong Wang
Jianfei Cai
Tien-Tsin Wong
Huchuan Lu
Xu Jia
DiffMVGen
150
0
0
30 Mar 2025
AdaWorld: Learning Adaptable World Models with Latent Actions
AdaWorld: Learning Adaptable World Models with Latent Actions
Shenyuan Gao
Siyuan Zhou
Yilun Du
Jun Zhang
Chuang Gan
VGen
193
8
0
24 Mar 2025
Multi-Agent LLM Actor-Critic Framework for Social Robot Navigation
Weizheng Wang
Ike Obi
Byung-Cheol Min
LLMAG
171
2
0
12 Mar 2025
Fake It To Make It: Virtual Multiviews to Enhance Monocular Indoor Semantic Scene Completion
Anith Selvakumar
Manasa Bharadwaj
108
0
0
07 Mar 2025
GSplatVNM: Point-of-View Synthesis for Visual Navigation Models Using Gaussian Splatting
Kohei Honda
Takeshi Ishita
Yasuhiro Yoshimura
Ryo Yonetani
3DGS
196
1
0
07 Mar 2025
Four Principles for Physically Interpretable World Models
Four Principles for Physically Interpretable World Models
Jordan Peper
Zhenjiang Mao
Yuang Geng
Siyuan Pan
Ivan Ruchkin
158
1
0
04 Mar 2025
Discrete Codebook World Models for Continuous Control
Aidan Scannell
Mohammadreza Nakhaei
Kalle Kujanpää
Yi Zhao
Kevin Sebastian Luck
Dieter Büchler
Joni Pajarinen
OffRL
95
2
0
01 Mar 2025
Strengthening Generative Robot Policies through Predictive World Modeling
Strengthening Generative Robot Policies through Predictive World Modeling
Han Qi
Haocheng Yin
Aris Zhu
Yilun Du
Heng Yang
204
6
0
02 Feb 2025
DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning
DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning
G. Zhou
Hengkai Pan
Yann LeCun
Lerrel Pinto
VGenLM&RoOffRL
107
32
0
07 Nov 2024
VEDIT: Latent Prediction Architecture For Procedural Video
  Representation Learning
VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
Han Lin
Tushar Nagarajan
Nicolas Ballas
Mido Assran
Mojtaba Komeili
Joey Tianyi Zhou
Koustuv Sinha
AI4TS
114
5
0
04 Oct 2024
Diffusion Models Are Real-Time Game Engines
Diffusion Models Are Real-Time Game Engines
Dani Valevski
Yaniv Leviathan
Moab Arar
Shlomi Fruchter
DiffMVGenAI4CE
147
91
0
27 Aug 2024
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis
Basile Van Hoorick
Rundi Wu
Ege Ozguroglu
Kyle Sargent
Ruoshi Liu
P. Tokmakov
Achal Dave
Changxi Zheng
Carl Vondrick
DiffMVGen
127
35
0
23 May 2024
Diffusion for World Modeling: Visual Details Matter in Atari
Diffusion for World Modeling: Visual Details Matter in Atari
Eloi Alonso
Adam Jelley
Vincent Micheli
Anssi Kanervisto
Amos Storkey
Tim Pearce
Franccois Fleuret
115
69
0
20 May 2024
Genie: Generative Interactive Environments
Genie: Generative Interactive Environments
Jake Bruce
Michael Dennis
Ashley D. Edwards
Jack Parker-Holder
Yuge Shi
...
Konrad Zolna
Jeff Clune
Nando de Freitas
Satinder Singh
Tim Rocktaschel
VGenVLM
161
188
0
23 Feb 2024
Lumiere: A Space-Time Diffusion Model for Video Generation
Lumiere: A Space-Time Diffusion Model for Video Generation
Omer Bar-Tal
Hila Chefer
Omer Tov
Charles Herrmann
Roni Paiss
...
T. Michaeli
Oliver Wang
Deqing Sun
Tali Dekel
Inbar Mosseri
VGen
220
259
0
23 Jan 2024
VideoPoet: A Large Language Model for Zero-Shot Video Generation
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Dan Kondratyuk
Lijun Yu
Xiuye Gu
José Lezama
Jonathan Huang
...
Irfan Essa
Huisheng Wang
David A. Ross
Bryan Seybold
Lu Jiang
VGen
155
273
0
21 Dec 2023
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large
  Datasets
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
A. Blattmann
Tim Dockhorn
Sumith Kulal
Daniel Mendelevitch
Maciej Kilian
...
Zion English
Vikram S. Voleti
Adam Letts
Varun Jampani
Robin Rombach
VGen
345
1,190
0
25 Nov 2023
Fast-Slow Test-Time Adaptation for Online Vision-and-Language Navigation
Fast-Slow Test-Time Adaptation for Online Vision-and-Language Navigation
Junyu Gao
Xuan Yao
Changsheng Xu
TTA
155
6
0
22 Nov 2023
Emu Video: Factorizing Text-to-Video Generation by Explicit Image
  Conditioning
Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning
Rohit Girdhar
Mannat Singh
Andrew Brown
Quentin Duval
S. Azadi
Sai Saketh Rambhatla
Akbar Shah
Xi Yin
Devi Parikh
Ishan Misra
DiffMVGen
143
209
0
17 Nov 2023
TD-MPC2: Scalable, Robust World Models for Continuous Control
TD-MPC2: Scalable, Robust World Models for Continuous Control
Nicklas Hansen
Hao Su
Xiaolong Wang
MU
131
159
0
25 Oct 2023
NoMaD: Goal Masked Diffusion Policies for Navigation and Exploration
NoMaD: Goal Masked Diffusion Policies for Navigation and Exploration
A. Sridhar
Dhruv Shah
Catherine Glossop
Sergey Levine
112
129
0
11 Oct 2023
Learning Interactive Real-World Simulators
Learning Interactive Real-World Simulators
Mengjiao Yang
Yilun Du
Kamyar Ghasemipour
Jonathan Tompson
Leslie Kaelbling
Dale Schuurmans
Pieter Abbeel
LM&RoPINN
90
215
0
09 Oct 2023
Learning to Model the World with Language
Learning to Model the World with Language
Jessy Lin
Yuqing Du
Olivia Watkins
Danijar Hafner
Pieter Abbeel
Dan Klein
Anca Dragan
LM&RoSyDa
135
55
0
31 Jul 2023
ViNT: A Foundation Model for Visual Navigation
ViNT: A Foundation Model for Visual Navigation
Dhruv Shah
A. Sridhar
Nitish Dashora
Kyle Stachowicz
Kevin Black
Noriaki Hirose
Sergey Levine
LM&Ro
83
147
0
26 Jun 2023
DreamSim: Learning New Dimensions of Human Visual Similarity using
  Synthetic Data
DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data
Stephanie Fu
Netanel Y. Tamir
Shobhita Sundaram
Lucy Chai
Richard Y. Zhang
Tali Dekel
Phillip Isola
EGVM
100
123
0
15 Jun 2023
SACSoN: Scalable Autonomous Control for Social Navigation
SACSoN: Scalable Autonomous Control for Social Navigation
Noriaki Hirose
Dhruv Shah
A. Sridhar
Sergey Levine
108
33
0
02 Jun 2023
Video Prediction Models as Rewards for Reinforcement Learning
Video Prediction Models as Rewards for Reinforcement Learning
Alejandro Escontrela
Ademi Adeniji
Wilson Yan
Ajay Jain
Xue Bin Peng
Ken Goldberg
Youngwoon Lee
Danijar Hafner
Pieter Abbeel
110
59
0
23 May 2023
Fast Traversability Estimation for Wild Visual Navigation
Fast Traversability Estimation for Wild Visual Navigation
Jonas Frey
Matías Mattamala
Nived Chebrolu
Cesar Cadena
Maurice F. Fallon
Marco Hutter
129
68
0
15 May 2023
Generative Novel View Synthesis with 3D-Aware Diffusion Models
Generative Novel View Synthesis with 3D-Aware Diffusion Models
E. R. Chan
Koki Nagano
Matthew A. Chan
Alexander W. Bergman
Jeong Joon Park
Axel Levy
M. Aittala
Shalini De Mello
Tero Karras
Gordon Wetzstein
DiffM
107
241
0
05 Apr 2023
Zero-1-to-3: Zero-shot One Image to 3D Object
Zero-1-to-3: Zero-shot One Image to 3D Object
Ruoshi Liu
Rundi Wu
Basile Van Hoorick
P. Tokmakov
Sergey Zakharov
Carl Vondrick
DiffM
161
1,113
0
20 Mar 2023
Scalable Diffusion Models with Transformers
Scalable Diffusion Models with Transformers
William S. Peebles
Saining Xie
GNN
198
2,440
0
19 Dec 2022
MAGVIT: Masked Generative Video Transformer
MAGVIT: Masked Generative Video Transformer
Lijun Yu
Yong Cheng
Kihyuk Sohn
José Lezama
Han Zhang
...
Alexander G. Hauptmann
Ming-Hsuan Yang
Yuan Hao
Irfan Essa
Lu Jiang
DiffMVGen
121
249
0
10 Dec 2022
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained
  Transformers
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
Elias Frantar
Saleh Ashkboos
Torsten Hoefler
Dan Alistarh
MQ
218
1,015
0
31 Oct 2022
GNM: A General Navigation Model to Drive Any Robot
GNM: A General Navigation Model to Drive Any Robot
Dhruv Shah
A. Sridhar
Arjun Bhorkar
Noriaki Hirose
Sergey Levine
134
120
0
07 Oct 2022
Imagen Video: High Definition Video Generation with Diffusion Models
Imagen Video: High Definition Video Generation with Diffusion Models
Jonathan Ho
William Chan
Chitwan Saharia
Jay Whang
Ruiqi Gao
...
Diederik P. Kingma
Ben Poole
Mohammad Norouzi
David J. Fleet
Tim Salimans
VGen
210
1,549
0
05 Oct 2022
DreamFusion: Text-to-3D using 2D Diffusion
DreamFusion: Text-to-3D using 2D Diffusion
Ben Poole
Ajay Jain
Jonathan T. Barron
B. Mildenhall
345
2,445
0
29 Sep 2022
12
Next