ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.07573
  4. Cited By
MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for
  End-to-End Autonomous Driving

MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for End-to-End Autonomous Driving

13 May 2024
Yiqun Duan
Xianda Guo
Zheng Zhu
Zhen Wang
Yu-Kai Wang
Chin-Teng Lin
ArXiv (abs)PDFHTML

Papers citing "MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for End-to-End Autonomous Driving"

38 / 38 papers shown
Title
GenAD: Generative End-to-End Autonomous Driving
GenAD: Generative End-to-End Autonomous Driving
Wenzhao Zheng
Ruiqi Song
Xianda Guo
Chenming Zhang
Long Chen
105
67
0
18 Feb 2024
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion
  Transformer
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
Hao Shao
Letian Wang
Ruobing Chen
Hongsheng Li
Y. Liu
91
207
0
28 Jul 2022
ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal
  Feature Learning
ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature Learning
Shengchao Hu
Li Chen
Peng Wu
Hongyang Li
Junchi Yan
Dacheng Tao
90
250
0
15 Jul 2022
MMFN: Multi-Modal-Fusion-Net for End-to-End Driving
MMFN: Multi-Modal-Fusion-Net for End-to-End Driving
Qingwen Zhang
Mingkai Tang
R. Geng
Feiyi Chen
Ren Xin
Lujia Wang
80
36
0
01 Jul 2022
BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object
  Detection
BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection
Yinhao Li
Zheng Ge
Guanyi Yu
Jinrong Yang
Zengran Wang
Yukang Shi
Jian‐Yuan Sun
Zeming Li
MDE
82
615
0
21 Jun 2022
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View
  Representation
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Zhijian Liu
Haotian Tang
Alexander Amini
Xinyu Yang
Huizi Mao
Daniela Rus
Song Han
153
908
0
26 May 2022
Learning from All Vehicles
Learning from All Vehicles
Dian Chen
Philipp Krahenbuhl
97
185
0
22 Mar 2022
BEVDet: High-performance Multi-camera 3D Object Detection in
  Bird-Eye-View
BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-View
Junjie Huang
Guan Huang
Zheng Zhu
Yun Ye
Dalong Du
3DPC
100
704
0
22 Dec 2021
Uncertainty Estimation via Response Scaling for Pseudo-mask Noise
  Mitigation in Weakly-supervised Semantic Segmentation
Uncertainty Estimation via Response Scaling for Pseudo-mask Noise Mitigation in Weakly-supervised Semantic Segmentation
Yi Li
Yiqun Duan
Zhanghui Kuang
Yimin Chen
Wayne Zhang
Xiaomeng Li
69
73
0
14 Dec 2021
GRI: General Reinforced Imitation and its Application to Vision-Based
  Autonomous Driving
GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving
Raphael Chekroun
Marin Toromanoff
Sascha Hornauer
Fabien Moutarde
75
61
0
16 Nov 2021
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViTTPM
467
7,814
0
11 Nov 2021
NEAT: Neural Attention Fields for End-to-End Autonomous Driving
NEAT: Neural Attention Fields for End-to-End Autonomous Driving
Kashyap Chitta
Aditya Prakash
Andreas Geiger
3DPC
90
213
0
09 Sep 2021
TransformerFusion: Monocular RGB Scene Reconstruction using Transformers
TransformerFusion: Monocular RGB Scene Reconstruction using Transformers
Aljavz Bovzivc
Pablo Rodríguez Palafox
Justus Thies
Angela Dai
Matthias Nießner
ViT
89
138
0
05 Jul 2021
RoadMap: A Light-Weight Semantic Map for Visual Localization towards
  Autonomous Driving
RoadMap: A Light-Weight Semantic Map for Visual Localization towards Autonomous Driving
Tong Qin
Yucai Zheng
Tongqing Chen
Yilun Chen
Qing Su
44
106
0
04 Jun 2021
FIERY: Future Instance Prediction in Bird's-Eye View from Surround
  Monocular Cameras
FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras
Anthony Hu
Zak Murez
Nikhil C. Mohan
Sofía Dudas
Jeffrey Hawke
Vijay Badrinarayanan
R. Cipolla
Alex Kendall
212
261
0
21 Apr 2021
Multi-Modal Fusion Transformer for End-to-End Autonomous Driving
Multi-Modal Fusion Transformer for End-to-End Autonomous Driving
Aditya Prakash
Kashyap Chitta
Andreas Geiger
ViT
108
531
0
19 Apr 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
458
21,439
0
25 Mar 2021
Ground-aware Monocular 3D Object Detection for Autonomous Driving
Ground-aware Monocular 3D Object Detection for Autonomous Driving
Yuxuan Liu
Yuan Yixuan
Ming-Yuan Liu
3DPC
80
139
0
01 Feb 2021
End-to-end Contextual Perception and Prediction with Interaction
  Transformer
End-to-end Contextual Perception and Prediction with Interaction Transformer
Lingyun Luke Li
Binh Yang
Ming Liang
Wenyuan Zeng
Mengye Ren
Sean Segal
R. Urtasun
ViT
73
119
0
13 Aug 2020
VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized
  Representation
VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation
Jiyang Gao
Chen Sun
Hang Zhao
Yi Shen
Dragomir Anguelov
Congcong Li
Cordelia Schmid
126
814
0
08 May 2020
Lidar for Autonomous Driving: The principles, challenges, and trends for
  automotive lidar and perception systems
Lidar for Autonomous Driving: The principles, challenges, and trends for automotive lidar and perception systems
You Li
J. Ibañez-Guzmán
61
555
0
17 Apr 2020
PiP: Planning-informed Trajectory Prediction for Autonomous Driving
PiP: Planning-informed Trajectory Prediction for Autonomous Driving
Haoran Song
Wenchao Ding
Yuxuan Chen
Shaojie Shen
M. Y. Wang
Qifeng Chen
93
140
0
25 Mar 2020
Monocular Depth Estimation Based On Deep Learning: An Overview
Monocular Depth Estimation Based On Deep Learning: An Overview
Chaoqiang Zhao
Qiyu Sun
Chongzhen Zhang
Yang Tang
Feng Qian
MDE
201
254
0
14 Mar 2020
A Survey of End-to-End Driving: Architectures and Training Methods
A Survey of End-to-End Driving: Architectures and Training Methods
Ardi Tampuu
Maksym Semikin
Naveed Muhammad
D. Fishman
Tambet Matiisen
3DV
67
236
0
13 Mar 2020
3DSSD: Point-based 3D Single Stage Object Detector
3DSSD: Point-based 3D Single Stage Object Detector
Zetong Yang
Yanan Sun
Shu Liu
Jiaya Jia
3DPC
127
944
0
24 Feb 2020
Learning by Cheating
Learning by Cheating
Dian Chen
Brady Zhou
V. Koltun
Philipp Krahenbuhl
SSL
110
515
0
27 Dec 2019
End-to-End Model-Free Reinforcement Learning for Urban Driving using
  Implicit Affordances
End-to-End Model-Free Reinforcement Learning for Urban Driving using Implicit Affordances
Marin Toromanoff
É. Wirbel
Fabien Moutarde
OffRL
134
208
0
25 Nov 2019
Depth Completion from Sparse LiDAR Data with Depth-Normal Constraints
Depth Completion from Sparse LiDAR Data with Depth-Normal Constraints
Yan Xu
Xinge Zhu
Jianping Shi
Guofeng Zhang
Hujun Bao
Hongsheng Li
3DV
58
226
0
15 Oct 2019
End-to-End Multi-View Fusion for 3D Object Detection in LiDAR Point
  Clouds
End-to-End Multi-View Fusion for 3D Object Detection in LiDAR Point Clouds
Yin Zhou
Pei Sun
Yu Zhang
Dragomir Anguelov
J. Gao
Tom Y. Ouyang
James Guo
Jiquan Ngiam
Vijay Vasudevan
3DPC
83
351
0
15 Oct 2019
CenterNet: Keypoint Triplets for Object Detection
CenterNet: Keypoint Triplets for Object Detection
Kaiwen Duan
S. Bai
Lingxi Xie
H. Qi
Qingming Huang
Q. Tian
NoLa
119
2,698
0
17 Apr 2019
LaserNet: An Efficient Probabilistic 3D Object Detector for Autonomous
  Driving
LaserNet: An Efficient Probabilistic 3D Object Detector for Autonomous Driving
Gregory P. Meyer
A. Laddha
E. Kee
Carlos Vallespi-Gonzalez
Carl K. Wellington
3DPC
76
338
0
20 Mar 2019
Learning to Drive in a Day
Learning to Drive in a Day
Alex Kendall
Jeffrey Hawke
David Janz
Przemyslaw Mazur
Daniele Reda
John M. Allen
Vinh-Dieu Lam
Alex Bewley
Amar Shah
102
657
0
01 Jul 2018
YOLOv3: An Incremental Improvement
YOLOv3: An Incremental Improvement
Joseph Redmon
Ali Farhadi
ObjD
126
21,470
0
08 Apr 2018
VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection
VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection
Yin Zhou
Oncel Tuzel
3DPC
112
3,731
0
17 Nov 2017
End-to-end Driving via Conditional Imitation Learning
End-to-end Driving via Conditional Imitation Learning
Felipe Codevilla
Matthias Muller
Antonio M. López
V. Koltun
Alexey Dosovitskiy
131
1,066
0
06 Oct 2017
Multi-View 3D Object Detection Network for Autonomous Driving
Multi-View 3D Object Detection Network for Autonomous Driving
Xiaozhi Chen
Huimin Ma
Ji Wan
Bo Li
Tian Xia
3DPC
192
2,780
0
23 Nov 2016
End to End Learning for Self-Driving Cars
End to End Learning for Self-Driving Cars
Mariusz Bojarski
D. Testa
Daniel Dworakowski
Bernhard Firner
B. Flepp
...
Urs Muller
Jiakai Zhang
Xin Zhang
Jake Zhao
Karol Zieba
SSL
100
4,175
0
25 Apr 2016
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence
  Modeling
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
Junyoung Chung
Çağlar Gülçehre
Kyunghyun Cho
Yoshua Bengio
593
12,734
0
11 Dec 2014
1