Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.09777
Cited By
DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
18 September 2023
Xiaofeng Wang
Zheng Hua Zhu
Guan Huang
Xinze Chen
Jiagang Zhu
Jiwen Lu
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving"
50 / 52 papers shown
Title
ProphetDWM: A Driving World Model for Rolling Out Future Actions and Videos
Xiaodong Wang
Peixi Peng
VGen
1.1K
0
0
24 May 2025
FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving
Shuang Zeng
Xinyuan Chang
Mengwei Xie
Xinran Liu
Yifan Bai
Zheng Pan
Mu Xu
Xing Wei
LRM
105
0
0
23 May 2025
Challenger: Affordable Adversarial Driving Video Generation
Zhiyuan Xu
Bohan Li
Huan-ang Gao
Mingju Gao
Yong Chen
Ming-Yuan Liu
Chenxu Yan
Hang Zhao
Shuo Feng
Hao Zhao
AAML
VGen
236
1
0
21 May 2025
End-to-End Driving with Online Trajectory Evaluation via BEV World Model
Yingyan Li
Yuqi Wang
Yang Liu
Jiawei He
Lue Fan
Zhaoxiang Zhang
OffRL
423
2
0
02 Apr 2025
Generating Multimodal Driving Scenes via Next-Scene Prediction
Yanhao Wu
Haoyang Zhang
Tianwei Lin
Lichao Huang
Shujie Luo
Rui Wu
Congpei Qiu
Wei Ke
Tong Zhang
VGen
81
0
0
19 Mar 2025
Advances in 4D Generation: A Survey
Qiaowei Miao
Kehan Li
Jinsheng Quan
Zhiyuan Min
Shaojie Ma
Yichao Xu
Yi Yang
Yawei Luo
126
1
0
18 Mar 2025
Unlock the Power of Unlabeled Data in Language Driving Model
Chaoqun Wang
Jie Yang
Xiaobin Hong
Ruimao Zhang
106
0
0
13 Mar 2025
A Survey of World Models for Autonomous Driving
Tuo Feng
Wenguan Wang
Yue Yang
VGen
124
7
0
20 Jan 2025
Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving
Yu Yang
Jianbiao Mei
Yukai Ma
Siliang Du
Wenqing Chen
Yijie Qian
Yuxiang Feng
Yong Liu
119
14
0
20 Jan 2025
Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation
Ziyang Xie
Zhizheng Liu
Zhenghao Peng
Wayne Wu
Bolei Zhou
VGen
121
5
0
12 Jan 2025
DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes
Yiyuan Liang
Zhiying Yan
Liqun Chen
Jiahuan Zhou
Luxin Yan
Sheng Zhong
Xu Zou
DiffM
VGen
69
1
0
31 Dec 2024
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
Xiaotao Hu
Wei Yin
Mingkai Jia
Junyuan Deng
Xiaoyang Guo
Qian Zhang
Xiaoxiao Long
Ping Tan
VGen
104
13
0
31 Dec 2024
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
Jianbiao Mei
T. Hu
Xuemeng Yang
Licheng Wen
Yu Yang
Tiantian Wei
Yukai Ma
Min Dou
Botian Shi
Yong Liu
VGen
DiffM
105
3
0
06 Sep 2024
DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model
Xiaofan Li
Yifu Zhang
Xiaoqing Ye
VGen
95
77
0
11 Oct 2023
CityDreamer: Compositional Generative Model of Unbounded 3D Cities
Haozhe Xie
Zhaoxi Chen
Fangzhou Hong
Ziwei Liu
67
39
0
01 Sep 2023
BEVControl: Accurately Controlling Street-view Elements with Multi-perspective Consistency via BEV Sketch Layout
Kairui Yang
Enhui Ma
Jibing Peng
Qing Guo
Di Lin
Kaicheng Yu
DiffM
66
60
0
03 Aug 2023
Learning to Model the World with Language
Jessy Lin
Yuqing Du
Olivia Watkins
Danijar Hafner
Pieter Abbeel
Dan Klein
Anca Dragan
LM&Ro
SyDa
65
54
0
31 Jul 2023
Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation
Rui Chen
Yuxiao Chen
Ningxin Jiao
Kui Jia
DiffM
89
583
0
24 Mar 2023
T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models
Chong Mou
Xintao Wang
Liangbin Xie
Yanze Wu
Shuai Liu
Zhongang Qi
Ying Shan
Xiaohu Qie
DiffM
60
1,010
0
16 Feb 2023
Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
103
4,074
1
10 Feb 2023
Imagen Video: High Definition Video Generation with Diffusion Models
Jonathan Ho
William Chan
Chitwan Saharia
Jay Whang
Ruiqi Gao
...
Diederik P. Kingma
Ben Poole
Mohammad Norouzi
David J. Fleet
Tim Salimans
VGen
100
1,518
0
05 Oct 2022
Make-A-Video: Text-to-Video Generation without Text-Video Data
Uriel Singer
Adam Polyak
Thomas Hayes
Xiaoyue Yin
Jie An
...
Oron Ashual
Oran Gafni
Devi Parikh
Sonal Gupta
Yaniv Taigman
DiffM
VGen
74
1,399
0
29 Sep 2022
ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature Learning
Shengchao Hu
Li Chen
Peng Wu
Hongyang Li
Junchi Yan
Dacheng Tao
68
239
0
15 Jul 2022
Masked World Models for Visual Control
Younggyo Seo
Danijar Hafner
Hao Liu
Fangchen Liu
Stephen James
Kimin Lee
Pieter Abbeel
OffRL
135
148
0
28 Jun 2022
DayDreamer: World Models for Physical Robot Learning
Philipp Wu
Alejandro Escontrela
Danijar Hafner
Ken Goldberg
Pieter Abbeel
97
293
0
28 Jun 2022
Diffusion Models for Video Prediction and Infilling
Tobias Höppe
Arash Mehrjou
Stefan Bauer
Didrik Nielsen
Andrea Dittadi
DiffM
VGen
82
132
0
15 Jun 2022
Deep Hierarchical Planning from Pixels
Danijar Hafner
Kuang-Huei Lee
Ian S. Fischer
Pieter Abbeel
60
94
0
08 Jun 2022
Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in World Models
Minting Pan
Xiangming Zhu
Yunbo Wang
Xiaokang Yang
71
40
0
27 May 2022
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Zhijian Liu
Haotian Tang
Alexander Amini
Xinyu Yang
Huizi Mao
Daniela Rus
Song Han
146
903
0
26 May 2022
Flexible Diffusion Modeling of Long Videos
William Harvey
Saeid Naderiparizi
Vaden Masrani
Christian D. Weilbach
Frank Wood
DiffM
BDL
VGen
214
295
0
23 May 2022
BEVerse: Unified Perception and Prediction in Birds-Eye-View for Vision-Centric Autonomous Driving
Yunpeng Zhang
Zheng Hua Zhu
Wenzhao Zheng
Junjie Huang
Guan Huang
Jie Zhou
Jiwen Lu
73
192
0
19 May 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
339
6,830
0
13 Apr 2022
Diffusion Probabilistic Modeling for Video Generation
Ruihan Yang
Prakhar Srivastava
Stephan Mandt
DiffM
VGen
115
266
0
16 Mar 2022
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
286
3,571
0
20 Dec 2021
Cascaded Diffusion Models for High Fidelity Image Generation
Jonathan Ho
Chitwan Saharia
William Chan
David J. Fleet
Mohammad Norouzi
Tim Salimans
136
1,213
0
30 May 2021
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
178
7,765
0
11 May 2021
DriveGAN: Towards a Controllable High-Quality Neural Simulation
S. Kim
Jonah Philion
Antonio Torralba
Sanja Fidler
50
113
0
30 Apr 2021
FCOS3D: Fully Convolutional One-Stage Monocular 3D Object Detection
Tai Wang
Xinge Zhu
Jiangmiao Pang
Dahua Lin
3DPC
66
596
0
22 Apr 2021
Improved Denoising Diffusion Probabilistic Models
Alex Nichol
Prafulla Dhariwal
DiffM
285
3,648
0
18 Feb 2021
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
93
849
0
05 Oct 2020
Learning to Simulate Dynamic Environments with GameGAN
Seung Wook Kim
Yuhao Zhou
Jonah Philion
Antonio Torralba
Sanja Fidler
GAN
66
102
0
25 May 2020
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
B. Mildenhall
Pratul P. Srinivasan
Matthew Tancik
Jonathan T. Barron
R. Ramamoorthi
Ren Ng
114
2,597
0
19 Mar 2020
Stochastic Latent Residual Video Prediction
Jean-Yves Franceschi
E. Delasalles
Mickaël Chen
Sylvain Lamprier
Patrick Gallinari
VGen
55
159
0
21 Feb 2020
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
108
1,349
0
03 Dec 2019
Exploring the Limitations of Behavior Cloning for Autonomous Driving
Felipe Codevilla
Eder Santana
Antonio M. López
Adrien Gaidon
46
542
0
18 Apr 2019
nuScenes: A multimodal dataset for autonomous driving
Holger Caesar
Varun Bankiti
Alex H. Lang
Sourabh Vora
Venice Erin Liong
Qiang Xu
Anush Krishnan
Yuxin Pan
G. Baldan
Oscar Beijbom
3DPC
271
5,705
0
26 Mar 2019
Learning to Decompose and Disentangle Representations for Video Prediction
Jun-Ting Hsieh
Bingbin Liu
De-An Huang
Li Fei-Fei
Juan Carlos Niebles
DRL
171
306
0
11 Jun 2018
Stochastic Video Generation with a Learned Prior
Emily L. Denton
Rob Fergus
VGen
80
525
0
21 Feb 2018
Sigmoid-Weighted Linear Units for Neural Network Function Approximation in Reinforcement Learning
Stefan Elfwing
E. Uchibe
Kenji Doya
112
1,717
0
10 Feb 2017
Generating Videos with Scene Dynamics
Carl Vondrick
Hamed Pirsiavash
Antonio Torralba
GAN
VGen
174
1,468
0
08 Sep 2016
1
2
Next