ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.07338
  4. Cited By
Temporal Triplane Transformers as Occupancy World Models

Temporal Triplane Transformers as Occupancy World Models

10 March 2025
Haoran Xu
Peixi Peng
Guang Tan
Yiqian Chang
Yisen Zhao
Yonghong Tian
ArXivPDFHTML

Papers citing "Temporal Triplane Transformers as Occupancy World Models"

50 / 51 papers shown
Title
OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework
OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework
Junliang Chen
Huaiyuan Xu
Yi Wang
Lap-Pui Chau
87
1
0
24 Feb 2025
Occ-LLM: Enhancing Autonomous Driving with Occupancy-Based Large Language Models
Tianshuo Xu
Hao Lu
Xu Yan
Yingjie Cai
Bingbing Liu
Yingcong Chen
51
4
0
10 Feb 2025
A Survey of World Models for Autonomous Driving
A Survey of World Models for Autonomous Driving
Tuo Feng
Wenguan Wang
Yue Yang
VGen
124
7
0
20 Jan 2025
HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for
  Autonomous Driving
HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving
Z. Wu
Jingcheng Ni
Xiaodong Wang
Yuxin Guo
Rui Chen
Lewei Lu
Jifeng Dai
Yuwen Xiong
107
7
0
02 Dec 2024
EventGPT: Event Stream Understanding with Multimodal Large Language
  Models
EventGPT: Event Stream Understanding with Multimodal Large Language Models
Shaoyu Liu
Jianing Li
Guanghui Zhao
Yize Zhang
Xin Meng
Fei Richard Yu
Xiangyang Ji
Ming Li
MLLM
73
1
0
01 Dec 2024
Spatiotemporal Decoupling for Efficient Vision-Based Occupancy
  Forecasting
Spatiotemporal Decoupling for Efficient Vision-Based Occupancy Forecasting
Jingyi Xu
Xieyuanli Chen
Junyi Ma
Jiawei Huang
Jintao Xu
Yanjie Wang
Ling Pei
80
1
0
21 Nov 2024
RenderWorld: World Model with Self-Supervised 3D Label
RenderWorld: World Model with Self-Supervised 3D Label
Ziyang Yan
Wenzhen Dong
Yihua Shao
Yuhang Lu
Liu Haiyang
...
Haozhe Wang
Zhe Wang
Yan Wang
Fabio Remondino
Yuexin Ma
3DV
VGen
91
14
0
17 Sep 2024
OPUS: Occupancy Prediction Using a Sparse Set
OPUS: Occupancy Prediction Using a Sparse Set
Jiabao Wang
Zhaojiang Liu
Qiang Meng
Liujiang Yan
Ke Wang
Jie Yang
Wei Liu
Qibin Hou
Ming-Ming Cheng
48
11
0
14 Sep 2024
OccLLaMA: An Occupancy-Language-Action Generative World Model for
  Autonomous Driving
OccLLaMA: An Occupancy-Language-Action Generative World Model for Autonomous Driving
Julong Wei
Shanshuai Yuan
Pengfei Li
Qingda Hu
Zhongxue Gan
Wenchao Ding
VLM
60
20
0
05 Sep 2024
Vista: A Generalizable Driving World Model with High Fidelity and
  Versatile Controllability
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
114
86
0
27 May 2024
ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D
  Occupancy Perception via View-Guided Transformers
ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers
Jinke Li
Xiao He
Chonghua Zhou
Xiaoqiang Cheng
Yang Wen
Dan Zhang
ViT
64
14
0
07 May 2024
Volumetric Environment Representation for Vision-Language Navigation
Volumetric Environment Representation for Vision-Language Navigation
Rui Liu
Wenguan Wang
Yi Yang
61
26
0
21 Mar 2024
SemCity: Semantic Scene Generation with Triplane Diffusion
SemCity: Semantic Scene Generation with Triplane Diffusion
Jumin Lee
Sebin Lee
Changho Jo
Woobin Im
Juhyeong Seon
Sung-eui Yoon
DiffM
75
19
0
12 Mar 2024
Tri$^{2}$-plane: Thinking Head Avatar via Feature Pyramid
Tri2^{2}2-plane: Thinking Head Avatar via Feature Pyramid
Luchuan Song
Pinxin Liu
Lele Chen
Guojun Yin
Chenliang Xu
3DH
60
7
0
17 Jan 2024
TriNeRFLet: A Wavelet Based Triplane NeRF Representation
TriNeRFLet: A Wavelet Based Triplane NeRF Representation
Rajaei Khatib
Raja Giryes
49
3
0
11 Jan 2024
COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy
  Prediction
COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction
Qihang Ma
Xin Tan
Yanyun Qu
Lizhuang Ma
Zhizhong Zhang
Yuan Xie
64
39
0
04 Dec 2023
Driving into the Future: Multiview Visual Forecasting and Planning with
  World Model for Autonomous Driving
Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving
Yu-Quan Wang
Jiawei He
Lue Fan
Hongxin Li
Yuntao Chen
Zhaoxiang Zhang
VGen
94
125
0
29 Nov 2023
Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in
  Autonomous Driving Applications
Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications
Junyi Ma
Xieyuanli Chen
Jiawei Huang
Jingyi Xu
Zhen Luo
Jintao Xu
Weihao Gu
Rui Ai
Hesheng Wang
52
23
0
29 Nov 2023
OccWorld: Learning a 3D Occupancy World Model for Autonomous Driving
OccWorld: Learning a 3D Occupancy World Model for Autonomous Driving
Wenzhao Zheng
Weiliang Chen
Yuanhui Huang
Borui Zhang
Yueqi Duan
Jiwen Lu
VGen
88
77
0
27 Nov 2023
ADriver-I: A General World Model for Autonomous Driving
ADriver-I: A General World Model for Autonomous Driving
Fan Jia
Weixin Mao
Yingfei Liu
Yucheng Zhao
Yuqing Wen
Chi Zhang
Xiangyu Zhang
Tiancai Wang
68
66
0
22 Nov 2023
FB-OCC: 3D Occupancy Prediction based on Forward-Backward View
  Transformation
FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation
Zhiqi Li
Zhiding Yu
David Austin
Mingsheng Fang
Shiyi Lan
Jan Kautz
J. Álvarez
69
105
0
04 Jul 2023
Scene as Occupancy
Scene as Occupancy
Chonghao Sima
Wenwen Tong
Tai Wang
Li Chen
Silei Wu
...
Yingshuang Gu
Lewei Lu
Ping Luo
Dahua Lin
Hongyang Li
56
146
0
05 Jun 2023
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large
  Language Models
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Gengze Zhou
Yicong Hong
Qi Wu
ELM
LM&Ro
LLMAG
LRM
76
146
0
26 May 2023
Occ3D: A Large-Scale 3D Occupancy Prediction Benchmark for Autonomous
  Driving
Occ3D: A Large-Scale 3D Occupancy Prediction Benchmark for Autonomous Driving
Xiaoyu Tian
Tao Jiang
Longfei Yun
Yucheng Mao
Huitong Yang
Yue Wang
Yilun Wang
Hang Zhao
3DPC
3DV
91
213
0
27 Apr 2023
Text2Performer: Text-Driven Human Video Generation
Text2Performer: Text-Driven Human Video Generation
Yuming Jiang
Shuai Yang
Tong Liang Koh
Wayne Wu
Chen Change Loy
Ziwei Liu
DiffM
VGen
67
49
0
17 Apr 2023
TriVol: Point Cloud Rendering via Triple Volumes
TriVol: Point Cloud Rendering via Triple Volumes
T. Hu
Xiaogang Xu
Ruihang Chu
Jiaya Jia
3DPC
63
16
0
29 Mar 2023
VAD: Vectorized Scene Representation for Efficient Autonomous Driving
VAD: Vectorized Scene Representation for Efficient Autonomous Driving
Bo Jiang
Shaoyu Chen
Qing Xu
Bencheng Liao
Jiajie Chen
Helong Zhou
Qian Zhang
Wenyu Liu
Chang Huang
Xinggang Wang
124
214
0
21 Mar 2023
SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving
SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving
Yi Wei
Linqing Zhao
Wenzhao Zheng
Zhengbiao Zhu
Jie Zhou
Jiwen Lu
3DPC
62
219
0
16 Mar 2023
LLaMA: Open and Efficient Foundation Language Models
LLaMA: Open and Efficient Foundation Language Models
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
...
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
907
12,840
0
27 Feb 2023
Point Cloud Forecasting as a Proxy for 4D Occupancy Forecasting
Point Cloud Forecasting as a Proxy for 4D Occupancy Forecasting
Tarasha Khurana
Peiyun Hu
David Held
Deva Ramanan
3DPC
52
48
0
25 Feb 2023
Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction
Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction
Yuan-Ko Huang
Wenzhao Zheng
Yunpeng Zhang
Jie Zhou
Jiwen Lu
3DPC
56
288
0
15 Feb 2023
Adding Conditional Control to Text-to-Image Diffusion Models
Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
77
4,015
1
10 Feb 2023
Planning-oriented Autonomous Driving
Planning-oriented Autonomous Driving
Yi Hu
Jiazhi Yang
Li Chen
Keyu Li
Chonghao Sima
...
Xiaosong Jia
Qiang Liu
Jifeng Dai
Yu Qiao
Hongyang Li
69
613
0
20 Dec 2022
3D Neural Field Generation using Triplane Diffusion
3D Neural Field Generation using Triplane Diffusion
J. Shue
E. R. Chan
Ryan Po
Zachary Ankner
Jiajun Wu
Gordon Wetzstein
DiffM
84
235
0
30 Nov 2022
Differentiable Raycasting for Self-supervised Occupancy Forecasting
Differentiable Raycasting for Self-supervised Occupancy Forecasting
Tarasha Khurana
Peiyun Hu
Achal Dave
Jason Ziglar
David Held
Deva Ramanan
35
70
0
04 Oct 2022
ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal
  Feature Learning
ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature Learning
Shengchao Hu
Li Chen
Peng Wu
Hongyang Li
Junchi Yan
Dacheng Tao
59
235
0
15 Jul 2022
Hierarchical Latent Structure for Multi-Modal Vehicle Trajectory
  Forecasting
Hierarchical Latent Structure for Multi-Modal Vehicle Trajectory Forecasting
Dooseop Choi
Kyoung‐Wook Min
73
20
0
11 Jul 2022
Self-supervised Point Cloud Prediction Using 3D Spatio-temporal
  Convolutional Networks
Self-supervised Point Cloud Prediction Using 3D Spatio-temporal Convolutional Networks
Benedikt Mersch
Xieyuanli Chen
Jens Behley
C. Stachniss
3DPC
68
56
0
28 Sep 2021
End-to-end Interpretable Neural Motion Planner
End-to-end Interpretable Neural Motion Planner
Wenyuan Zeng
Wenjie Luo
Simon Suo
Abbas Sadat
Binh Yang
Sergio Casas
R. Urtasun
3DV
56
403
0
17 Jan 2021
Sparse Single Sweep LiDAR Point Cloud Segmentation via Learning
  Contextual Shape Priors from Scene Completion
Sparse Single Sweep LiDAR Point Cloud Segmentation via Learning Contextual Shape Priors from Scene Completion
Xu Yan
Jiantao Gao
Jie Li
Ruimao Zhang
Zhen Li
Rui Huang
Shuguang Cui
3DPC
59
266
0
07 Dec 2020
MoNet: Motion-based Point Cloud Prediction Network
MoNet: Motion-based Point Cloud Prediction Network
Fan Lu
Guang Chen
Yinlong Liu
Zhijun Li
Sanqing Qu
Tianpei Zou
3DPC
59
34
0
21 Nov 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
526
41,106
0
28 May 2020
Decision-Making with Auto-Encoding Variational Bayes
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
244
10,591
0
17 Feb 2020
Generating Diverse High-Fidelity Images with VQ-VAE-2
Generating Diverse High-Fidelity Images with VQ-VAE-2
Ali Razavi
Aaron van den Oord
Oriol Vinyals
DRL
BDL
108
1,788
0
02 Jun 2019
nuScenes: A multimodal dataset for autonomous driving
nuScenes: A multimodal dataset for autonomous driving
Holger Caesar
Varun Bankiti
Alex H. Lang
Sourabh Vora
Venice Erin Liong
Qiang Xu
Anush Krishnan
Yuxin Pan
G. Baldan
Oscar Beijbom
3DPC
253
5,653
0
26 Mar 2019
Recurrent World Models Facilitate Policy Evolution
Recurrent World Models Facilitate Policy Evolution
David R Ha
Jürgen Schmidhuber
SyDa
TPM
109
930
0
04 Sep 2018
World Models
World Models
David R Ha
Jürgen Schmidhuber
SyDa
113
1,062
0
27 Mar 2018
Neural Discrete Representation Learning
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
173
4,928
0
02 Nov 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
501
129,831
0
12 Jun 2017
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
1.5K
192,638
0
10 Dec 2015
12
Next