ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.19239
  4. Cited By
DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving

DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving

25 May 2025
Chen Shi
Shaoshuai Shi
Kehua Sheng
Bo Zhang
Li Jiang
    VGen
ArXivPDFHTML

Papers citing "DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving"

23 / 23 papers shown
Title
Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving
Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving
Xiang Li
Pengfei Li
Yupeng Zheng
Wei Sun
Yan Wang
Yilun Chen
3DPC
105
2
0
11 Feb 2025
Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving
Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving
Yu Yang
Jianbiao Mei
Yukai Ma
Siliang Du
Wenqing Chen
Yijie Qian
Yuxiang Feng
Yong Liu
114
13
0
20 Jan 2025
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
Bencheng Liao
Shaoyu Chen
Haoran Yin
Bo Jiang
Cheng Wang
...
Xinbang Zhang
Xiangyu Li
Y. Zhang
Qian Zhang
Xinggang Wang
150
29
0
22 Nov 2024
Diffusion Models Are Real-Time Game Engines
Diffusion Models Are Real-Time Game Engines
Dani Valevski
Yaniv Leviathan
Moab Arar
Shlomi Fruchter
DiffM
VGen
AI4CE
61
72
0
27 Aug 2024
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
Zhuoyi Yang
Jiayan Teng
Wendi Zheng
Ming Ding
Shiyu Huang
...
Weihan Wang
Yean Cheng
Xiaotao Gu
Yuxiao Dong
Jie Tang
DiffM
VGen
117
453
0
12 Aug 2024
Enhancing End-to-End Autonomous Driving with Latent World Model
Enhancing End-to-End Autonomous Driving with Latent World Model
Yingyan Li
Lue Fan
Jiawei He
Yuqi Wang
Yuntao Chen
Zhaoxiang Zhang
Tieniu Tan
99
16
0
12 Jun 2024
Vista: A Generalizable Driving World Model with High Fidelity and
  Versatile Controllability
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
114
86
0
27 May 2024
MagicDrive3D: Controllable 3D Generation for Any-View Rendering in
  Street Scenes
MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
Ruiyuan Gao
Kai Chen
Zhihao Li
Lanqing Hong
Zhenguo Li
Qiang Xu
VGen
59
30
0
23 May 2024
DriveWorld: 4D Pre-trained Scene Understanding via World Models for
  Autonomous Driving
DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving
Chen Min
Dawei Zhao
Liang Xiao
Jian-jun Zhao
Xinli Xu
...
Yulan Guo
Junliang Xing
Liping Jing
Yiming Nie
Bin Dai
VGen
VLM
53
30
0
07 May 2024
OccWorld: Learning a 3D Occupancy World Model for Autonomous Driving
OccWorld: Learning a 3D Occupancy World Model for Autonomous Driving
Wenzhao Zheng
Weiliang Chen
Yuanhui Huang
Borui Zhang
Yueqi Duan
Jiwen Lu
VGen
88
77
0
27 Nov 2023
GAIA-1: A Generative World Model for Autonomous Driving
GAIA-1: A Generative World Model for Autonomous Driving
Masane Fuchi
Lloyd Russell
Hudson Yeo
Zak Murez
Hiroto Minami
Alex Kendall
Tomohiro Takagi
Gianluca Corrado
VGen
72
230
0
29 Sep 2023
RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering
  Supervision
RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision
Mingjie Pan
Jiaming Liu
Renrui Zhang
Peixiang Huang
Xiaoqi Li
Bing Wang
Hongwei Xie
Li Liu
Shanghang Zhang
74
84
0
18 Sep 2023
InternImage: Exploring Large-Scale Vision Foundation Models with
  Deformable Convolutions
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Wenhai Wang
Jifeng Dai
Zhe Chen
Zhenhang Huang
Zhiqi Li
...
Tong Lu
Lewei Lu
Hongsheng Li
Xiaogang Wang
Yu Qiao
VLM
98
677
0
10 Nov 2022
CogVideo: Large-scale Pretraining for Text-to-Video Generation via
  Transformers
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
277
585
0
29 May 2022
Flexible Diffusion Modeling of Long Videos
Flexible Diffusion Modeling of Long Videos
William Harvey
Saeid Naderiparizi
Vaden Masrani
Christian D. Weilbach
Frank Wood
DiffM
BDL
VGen
190
293
0
23 May 2022
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera
  Images via Spatiotemporal Transformers
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
Zhiqi Li
Wenhai Wang
Hongyang Li
Enze Xie
Chonghao Sima
Tong Lu
Qiao Yu
Jifeng Dai
98
1,269
0
31 Mar 2022
Self-supervised Point Cloud Prediction Using 3D Spatio-temporal
  Convolutional Networks
Self-supervised Point Cloud Prediction Using 3D Spatio-temporal Convolutional Networks
Benedikt Mersch
Xieyuanli Chen
Jens Behley
C. Stachniss
3DPC
68
56
0
28 Sep 2021
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
166
4,993
0
08 Oct 2020
Mastering Atari with Discrete World Models
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
81
834
0
05 Oct 2020
Center-based 3D Object Detection and Tracking
Center-based 3D Object Detection and Tracking
Tianwei Yin
Xingyi Zhou
Philipp Krahenbuhl
3DPC
68
1,589
0
19 Jun 2020
Denoising Diffusion Probabilistic Models
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
299
17,550
0
19 Jun 2020
nuScenes: A multimodal dataset for autonomous driving
nuScenes: A multimodal dataset for autonomous driving
Holger Caesar
Varun Bankiti
Alex H. Lang
Sourabh Vora
Venice Erin Liong
Qiang Xu
Anush Krishnan
Yuxin Pan
G. Baldan
Oscar Beijbom
3DPC
251
5,653
0
26 Mar 2019
Feature Pyramid Networks for Object Detection
Feature Pyramid Networks for Object Detection
Nayeon Lee
Piotr Dollár
Ross B. Girshick
Kaiming He
Bharath Hariharan
Serge J. Belongie
ObjD
418
21,951
0
09 Dec 2016
1