ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.18576
  4. Cited By
DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment

DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment

22 April 2025
Xuzhao Li
Chenming Wu
Zhao Yang
Zhihao Xu
Dingkang Liang
Yanzhe Zhang
Ji Wan
Jiadong Wang
    VGen
ArXiv (abs)PDFHTML

Papers citing "DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment"

32 / 32 papers shown
Title
Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception
Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception
Dingkang Liang
Dingyuan Zhang
Xin Zhou
Sifan Tu
Tianrui Feng
Xiaofan Li
Yumeng Zhang
Mingyang Du
Xiao Tan
Xiang Bai
104
3
0
17 Mar 2025
DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Zhao Yang
Zezhong Qian
Xiaofan Li
Weixiang Xu
Gongpeng Zhao
Ruohong Yu
Lingsi Zhu
Longjun Liu
DiffMVGen
117
2
0
05 Mar 2025
The Role of World Models in Shaping Autonomous Driving: A Comprehensive Survey
The Role of World Models in Shaping Autonomous Driving: A Comprehensive Survey
Sifan Tu
Xin Zhou
Dingkang Liang
Xingyu Jiang
Yumeng Zhang
Xiaofan Li
Xiang Bai
VGen
100
5
0
14 Feb 2025
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
Xiaotao Hu
Wei Yin
Mingkai Jia
Junyuan Deng
Xiaoyang Guo
Qian Zhang
Xiaoxiao Long
Ping Tan
VGen
152
14
0
31 Dec 2024
DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving
  Scene Representation
DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation
Guosheng Zhao
Chaojun Ni
Xiaofeng Wang
Zheng Zhu
Xinming Zhang
...
Xinze Chen
Boyuan Wang
Youyi Zhang
Wenjun Mei
Xingang Wang
VGen
162
32
0
17 Oct 2024
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
Zhuoyi Yang
Jiayan Teng
Wendi Zheng
Ming Ding
Shiyu Huang
...
Weihan Wang
Yean Cheng
Xiaotao Gu
Yuxiao Dong
Jie Tang
DiffMVGen
245
565
0
12 Aug 2024
Vista: A Generalizable Driving World Model with High Fidelity and
  Versatile Controllability
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
146
103
0
27 May 2024
DriveWorld: 4D Pre-trained Scene Understanding via World Models for
  Autonomous Driving
DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving
Chen Min
Dawei Zhao
Liang Xiao
Jian-jun Zhao
Xinli Xu
...
Yulan Guo
Junliang Xing
Liping Jing
Yiming Nie
Bin Dai
VGenVLM
69
35
0
07 May 2024
World Models for Autonomous Driving: An Initial Survey
World Models for Autonomous Driving: An Initial Survey
Yanchen Guan
Haicheng Liao
Zhenning Li
Jia Hu
Runze Yuan
Yunjian Li
Guohui Zhang
Chengzhong Xu
148
41
0
05 Mar 2024
Latte: Latent Diffusion Transformer for Video Generation
Latte: Latent Diffusion Transformer for Video Generation
Xin Ma
Yaohui Wang
Gengyun Jia
Xinyuan Chen
Ziqiang Liu
Yuan-Fang Li
Cunjian Chen
Yu Qiao
DiffMVGen
277
278
0
05 Jan 2024
Driving into the Future: Multiview Visual Forecasting and Planning with
  World Model for Autonomous Driving
Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving
Yu-Quan Wang
Jiawei He
Lue Fan
Hongxin Li
Yuntao Chen
Zhaoxiang Zhang
VGen
140
143
0
29 Nov 2023
OccWorld: Learning a 3D Occupancy World Model for Autonomous Driving
OccWorld: Learning a 3D Occupancy World Model for Autonomous Driving
Wenzhao Zheng
Weiliang Chen
Yuanhui Huang
Borui Zhang
Yueqi Duan
Jiwen Lu
VGen
127
90
0
27 Nov 2023
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion
  Models
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
Shiwei Zhang
Jiayu Wang
Yingya Zhang
Kang Zhao
Hangjie Yuan
Zhan Qin
Xiang Wang
Deli Zhao
Jingren Zhou
DiffMVGen
125
231
0
07 Nov 2023
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and
  Prediction
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
Xinyuan Chen
Yaohui Wang
Lingjun Zhang
Shaobin Zhuang
Xin Ma
Jiashuo Yu
Yali Wang
Dahua Lin
Yu Qiao
Ziwei Liu
VGenDiffM
75
146
0
31 Oct 2023
GAIA-1: A Generative World Model for Autonomous Driving
GAIA-1: A Generative World Model for Autonomous Driving
Masane Fuchi
Lloyd Russell
Hudson Yeo
Zak Murez
Hiroto Minami
Alex Kendall
Tomohiro Takagi
Gianluca Corrado
VGen
119
252
0
29 Sep 2023
DriveDreamer: Towards Real-world-driven World Models for Autonomous
  Driving
DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
Xiaofeng Wang
Zheng Hua Zhu
Guan Huang
Xinze Chen
Jiagang Zhu
Jiwen Lu
VGen
116
166
0
18 Sep 2023
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video
  Generators
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
Levon Khachatryan
A. Movsisyan
Vahram Tadevosyan
Roberto Henschel
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
VGen
83
576
0
23 Mar 2023
Compositional 3D Scene Generation using Locally Conditioned Diffusion
Compositional 3D Scene Generation using Locally Conditioned Diffusion
Ryan Po
Gordon Wetzstein
DiffM
90
89
0
21 Mar 2023
Scalable Diffusion Models with Transformers
Scalable Diffusion Models with Transformers
William S. Peebles
Saining Xie
GNN
124
2,436
0
19 Dec 2022
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and
  Video Generation
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Ludan Ruan
Yi Ma
Huan Yang
Huiguo He
Bei Liu
Jianlong Fu
Nicholas Jing Yuan
Qin Jin
B. Guo
DiffMVGen
119
191
0
19 Dec 2022
CogVideo: Large-scale Pretraining for Text-to-Video Generation via
  Transformers
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
316
631
0
29 May 2022
Video Diffusion Models
Video Diffusion Models
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
DiffMVGen
230
1,642
0
07 Apr 2022
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
520
15,788
0
20 Dec 2021
LoRA: Low-Rank Adaptation of Large Language Models
LoRA: Low-Rank Adaptation of Large Language Models
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRLAI4TSAI4CEALMAIMat
522
10,563
0
17 Jun 2021
DriveGAN: Towards a Controllable High-Quality Neural Simulation
DriveGAN: Towards a Controllable High-Quality Neural Simulation
S. Kim
Jonah Philion
Antonio Torralba
Sanja Fidler
94
119
0
30 Apr 2021
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
Max Bain
Arsha Nagrani
Gül Varol
Andrew Zisserman
VGen
176
1,190
0
01 Apr 2021
Denoising Diffusion Implicit Models
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLMDiffM
304
7,500
0
06 Oct 2020
Denoising Diffusion Probabilistic Models
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
782
18,408
0
19 Jun 2020
Scalability in Perception for Autonomous Driving: Waymo Open Dataset
Scalability in Perception for Autonomous Driving: Waymo Open Dataset
Pei Sun
Henrik Kretzschmar
Xerxes Dotiwalla
Aurelien Chouard
Vijaysai Patnaik
...
Shuyang Cheng
Yu Zhang
Jonathon Shlens
Zhifeng Chen
Dragomir Anguelov
152
2,910
0
10 Dec 2019
nuScenes: A multimodal dataset for autonomous driving
nuScenes: A multimodal dataset for autonomous driving
Holger Caesar
Varun Bankiti
Alex H. Lang
Sourabh Vora
Venice Erin Liong
Qiang Xu
Anush Krishnan
Yuxin Pan
G. Baldan
Oscar Beijbom
3DPC
306
5,790
0
26 Mar 2019
World Models
World Models
David R Ha
Jürgen Schmidhuber
SyDa
164
1,102
0
27 Mar 2018
ORB-SLAM: a Versatile and Accurate Monocular SLAM System
ORB-SLAM: a Versatile and Accurate Monocular SLAM System
Raul Mur-Artal
José M.M. Montiel
Juan D. Tardós
133
6,421
0
03 Feb 2015
1