ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.09224
  4. Cited By
Multi-Modal Fusion Transformer for End-to-End Autonomous Driving

Multi-Modal Fusion Transformer for End-to-End Autonomous Driving

19 April 2021
Aditya Prakash
Kashyap Chitta
Andreas Geiger
    ViT
ArXivPDFHTML

Papers citing "Multi-Modal Fusion Transformer for End-to-End Autonomous Driving"

50 / 256 papers shown
Title
PaaS: Planning as a Service for reactive driving in CARLA Leaderboard
PaaS: Planning as a Service for reactive driving in CARLA Leaderboard
Nhat Hao Truong
Huu Thien Mai
T. Anh
M. Tran
Duc Duy Nguyen
Ngoc Viet Phuong Pham
26
2
0
17 Apr 2023
$β$-Variational autoencoders and transformers for reduced-order
  modelling of fluid flows
βββ-Variational autoencoders and transformers for reduced-order modelling of fluid flows
Alberto Solera-Rico
Carlos Sanmiguel Vila
Miguel Gómez-López
Yuning Wang
Abdulrahman Almashjary
Scott T. M. Dawson
Ricardo Vinuesa
DRL
16
74
0
07 Apr 2023
Integrated Behavior Planning and Motion Control for Autonomous Vehicles
  with Traffic Rules Compliance
Integrated Behavior Planning and Motion Control for Autonomous Vehicles with Traffic Rules Compliance
Haichao Liu
Kai Chen
Yuling Li
Zhen Huang
Jianghua Duan
Jun Ma
27
11
0
03 Apr 2023
VAD: Vectorized Scene Representation for Efficient Autonomous Driving
VAD: Vectorized Scene Representation for Efficient Autonomous Driving
Bo Jiang
Shaoyu Chen
Qing Xu
Bencheng Liao
Jiajie Chen
Helong Zhou
Qian Zhang
Wenyu Liu
Chang Huang
Xinggang Wang
110
194
0
21 Mar 2023
Penalty-Based Imitation Learning With Cross Semantics Generation Sensor
  Fusion for Autonomous Driving
Penalty-Based Imitation Learning With Cross Semantics Generation Sensor Fusion for Autonomous Driving
Hongkuan Zhou
Aifen Sui
Letian Shi
Yinxian Li
27
2
0
21 Mar 2023
V2V4Real: A Real-world Large-scale Dataset for Vehicle-to-Vehicle
  Cooperative Perception
V2V4Real: A Real-world Large-scale Dataset for Vehicle-to-Vehicle Cooperative Perception
Runsheng Xu
Xin Xia
Jinlong Li
Hanzhao Li
Shuo Zhang
...
Xiaoyu Dong
Rui Song
Hongkai Yu
Bolei Zhou
Jiaqi Ma
59
148
0
14 Mar 2023
Towards Driving Policies with Personality: Modeling Behavior and Style
  in Risky Scenarios via Data Collection in Virtual Reality
Towards Driving Policies with Personality: Modeling Behavior and Style in Risky Scenarios via Data Collection in Virtual Reality
L. Zheng
Julio Poveda
James Mullen
Shreelekha Revankar
Ming-Chyuan Lin
15
1
0
08 Mar 2023
LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global
  Cross-Modal Fusion
LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal Fusion
Xin Li
Tengyu Ma
Yuenan Hou
Botian Shi
Yucheng Yang
...
Xingjiao Wu
Qingsheng Chen
Yikang Li
Yu Qiao
Liangbo He
3DPC
40
83
0
07 Mar 2023
Delivering Arbitrary-Modal Semantic Segmentation
Delivering Arbitrary-Modal Semantic Segmentation
Jiaming Zhang
R. Liu
Haowen Shi
Kailun Yang
Simon Reiß
Kunyu Peng
Haodong Fu
Kaiwei Wang
Rainer Stiefelhagen
VLM
51
88
0
02 Mar 2023
From Prediction to Planning With Goal Conditioned Lane Graph Traversals
From Prediction to Planning With Goal Conditioned Lane Graph Traversals
Marcel Hallgarten
Martin Stoll
A. Zell
AI4CE
28
31
0
15 Feb 2023
DualStreamFoveaNet: A Dual Stream Fusion Architecture with Anatomical
  Awareness for Robust Fovea Localization
DualStreamFoveaNet: A Dual Stream Fusion Architecture with Anatomical Awareness for Robust Fovea Localization
Sifan Song
Jinfeng Wang
Zilong Wang
Jionglong Su
Xiucai Ding
K. Dang
MedIm
27
0
0
14 Feb 2023
SwinCross: Cross-modal Swin Transformer for Head-and-Neck Tumor
  Segmentation in PET/CT Images
SwinCross: Cross-modal Swin Transformer for Head-and-Neck Tumor Segmentation in PET/CT Images
Gary Y. Li
Junyu Chen
Se-In Jang
Kuang Gong
Quanzheng Li
ViT
MedIm
46
14
0
08 Feb 2023
Scaling Vision-based End-to-End Driving with Multi-View Attention
  Learning
Scaling Vision-based End-to-End Driving with Multi-View Attention Learning
Yi Xiao
Felipe Codevilla
Diego Porres
Antonio M. López
VLM
16
4
0
07 Feb 2023
Policy Pre-training for Autonomous Driving via Self-supervised Geometric
  Modeling
Policy Pre-training for Autonomous Driving via Self-supervised Geometric Modeling
Peng Wu
Li Chen
Hongyang Li
Xiaosong Jia
Junchi Yan
Yu Qiao
92
28
0
03 Jan 2023
On Transforming Reinforcement Learning by Transformer: The Development
  Trajectory
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya-Qin Zhang
Yixin Chen
Dacheng Tao
OffRL
27
25
0
29 Dec 2022
AVstack: An Open-Source, Reconfigurable Platform for Autonomous Vehicle
  Development
AVstack: An Open-Source, Reconfigurable Platform for Autonomous Vehicle Development
R. S. Hallyburton
Shucheng Zhang
Miroslav Pajic
14
10
0
28 Dec 2022
Planning-oriented Autonomous Driving
Planning-oriented Autonomous Driving
Yi Hu
Jiazhi Yang
Li Chen
Keyu Li
Chonghao Sima
...
Xiaosong Jia
Qiang Liu
Jifeng Dai
Yu Qiao
Hongyang Li
52
589
0
20 Dec 2022
SST: Real-time End-to-end Monocular 3D Reconstruction via Sparse
  Spatial-Temporal Guidance
SST: Real-time End-to-end Monocular 3D Reconstruction via Sparse Spatial-Temporal Guidance
Chenyang Zhang
Zhiqiang Lou
Yan Di
F. Tombari
Xiangyang Ji
27
6
0
13 Dec 2022
Weakly Supervised 3D Multi-person Pose Estimation for Large-scale Scenes
  based on Monocular Camera and Single LiDAR
Weakly Supervised 3D Multi-person Pose Estimation for Large-scale Scenes based on Monocular Camera and Single LiDAR
Peishan Cong
Yiteng Xu
Yiming Ren
Juze Zhang
Lan Xu
Jingya Wang
Jingyi Yu
Yuexin Ma
3DH
27
27
0
30 Nov 2022
A Transformer Framework for Data Fusion and Multi-Task Learning in Smart
  Cities
A Transformer Framework for Data Fusion and Multi-Task Learning in Smart Cities
Alexander C. DeRieux
Walid Saad
W. Zuo
R. Budiarto
M. D. Koerniawan
D. Novitasari
20
1
0
18 Nov 2022
LAPTNet: LiDAR-Aided Perspective Transform Network
LAPTNet: LiDAR-Aided Perspective Transform Network
M. Diaz-Zapata
Özgür Erkent
Christian Laugier
J. Dibangoye
D. S. González
3DPC
21
0
0
14 Nov 2022
RCDPT: Radar-Camera fusion Dense Prediction Transformer
RCDPT: Radar-Camera fusion Dense Prediction Transformer
Chen-Chou Lo
P. Vandewalle
ViT
MDE
30
13
0
04 Nov 2022
DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement
  Learning in Imitation Learning Based Autonomous Driving
DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous Driving
Resul Dagdanov
Feyza Eksen
Halil Durmus
Ferhat Yurdakul
N. K. Üre
6
3
0
29 Oct 2022
Many-Objective Reinforcement Learning for Online Testing of DNN-Enabled
  Systems
Many-Objective Reinforcement Learning for Online Testing of DNN-Enabled Systems
Fitash Ul Haq
Donghwan Shin
Lionel C. Briand
OffRL
44
39
0
27 Oct 2022
PlanT: Explainable Planning Transformers via Object-Level
  Representations
PlanT: Explainable Planning Transformers via Object-Level Representations
Katrin Renz
Kashyap Chitta
Otniel-Bogdan Mercea
A. Sophia Koepke
Zeynep Akata
Andreas Geiger
ViT
36
94
0
25 Oct 2022
Scratching Visual Transformer's Back with Uniform Attention
Scratching Visual Transformer's Back with Uniform Attention
Nam Hyeon-Woo
Kim Yu-Ji
Byeongho Heo
Doonyoon Han
Seong Joon Oh
Tae-Hyun Oh
364
23
0
16 Oct 2022
Model-Based Imitation Learning for Urban Driving
Model-Based Imitation Learning for Urban Driving
Anthony Hu
Gianluca Corrado
Nicolas Griffiths
Zak Murez
Corina Gurau
Hudson Yeo
Alex Kendall
R. Cipolla
Jamie Shotton
112
135
0
14 Oct 2022
Exploring Contextual Representation and Multi-Modality for End-to-End
  Autonomous Driving
Exploring Contextual Representation and Multi-Modality for End-to-End Autonomous Driving
Shoaib Azam
Farzeen Munir
Ville Kyrki
M. Jeon
Witold Pedrycz
56
1
0
13 Oct 2022
Traffic-Aware Autonomous Driving with Differentiable Traffic Simulation
Traffic-Aware Autonomous Driving with Differentiable Traffic Simulation
L. Zheng
Sanghyun Son
Ming-Chyuan Lin
35
3
0
07 Oct 2022
Toward Edge-Efficient Dense Predictions with Synergistic Multi-Task
  Neural Architecture Search
Toward Edge-Efficient Dense Predictions with Synergistic Multi-Task Neural Architecture Search
Thanh Vu
Yan-Quan Zhou
Chun-Yung Wen
Yueqi Li
Jan-Michael Frahm
39
4
0
04 Oct 2022
Husformer: A Multi-Modal Transformer for Multi-Modal Human State
  Recognition
Husformer: A Multi-Modal Transformer for Multi-Modal Human State Recognition
Ruiqi Wang
Wonse Jo
Dezhong Zhao
Weizheng Wang
B. Yang
Guohua Chen
Byung-Cheol Min
HAI
26
28
0
30 Sep 2022
Physical Adversarial Attack meets Computer Vision: A Decade Survey
Physical Adversarial Attack meets Computer Vision: A Decade Survey
Hui Wei
Hao Tang
Xuemei Jia
Zhixiang Wang
Han-Bing Yu
Zhubo Li
Shiníchi Satoh
Luc Van Gool
Zheng Wang
AAML
29
43
0
30 Sep 2022
InFi: End-to-End Learning to Filter Input for Resource-Efficiency in
  Mobile-Centric Inference
InFi: End-to-End Learning to Filter Input for Resource-Efficiency in Mobile-Centric Inference
Mu Yuan
Lan Zhang
Fengxiang He
Xueting Tong
Miao-Hui Song
Zhengyuan Xu
Xiang-Yang Li
32
2
0
28 Sep 2022
From One to Many: Dynamic Cross Attention Networks for LiDAR and Camera
  Fusion
From One to Many: Dynamic Cross Attention Networks for LiDAR and Camera Fusion
Rui Wan
Shuangjie Xu
Wei Wu
Xiaoyi Zou
Tongyi Cao
3DPC
20
4
0
25 Sep 2022
Multimodal Channel-Mixing: Channel and Spatial Masked AutoEncoder on
  Facial Action Unit Detection
Multimodal Channel-Mixing: Channel and Spatial Masked AutoEncoder on Facial Action Unit Detection
Xiang Zhang
Huiyuan Yang
Taoyue Wang
Xiaotian Li
L. Yin
19
7
0
25 Sep 2022
Graph Reasoning Transformer for Image Parsing
Graph Reasoning Transformer for Image Parsing
Dong Zhang
Jinhui Tang
Kwang-Ting Cheng
ViT
24
16
0
20 Sep 2022
FFPA-Net: Efficient Feature Fusion with Projection Awareness for 3D
  Object Detection
FFPA-Net: Efficient Feature Fusion with Projection Awareness for 3D Object Detection
Chaokang Jiang
Guangming Wang
Jinxing Wu
Yanzi Miao
Hesheng Wang
3DPC
29
5
0
15 Sep 2022
EViT: Privacy-Preserving Image Retrieval via Encrypted Vision
  Transformer in Cloud Computing
EViT: Privacy-Preserving Image Retrieval via Encrypted Vision Transformer in Cloud Computing
Qihua Feng
Peiya Li
Zhixun Lu
Chaozhuo Li
Zefang Wang
Zhiquan Liu
Chunhui Duan
Feiran Huang
ViT
29
10
0
31 Aug 2022
Augmenting Reinforcement Learning with Transformer-based Scene
  Representation Learning for Decision-making of Autonomous Driving
Augmenting Reinforcement Learning with Transformer-based Scene Representation Learning for Decision-making of Autonomous Driving
Haochen Liu
Zhiyu Huang
Xiaoyu Mo
Chen Lv
ViT
OffRL
33
33
0
24 Aug 2022
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion
  Transformer
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
Hao Shao
Letian Wang
Ruobing Chen
Hongsheng Li
Y. Liu
47
195
0
28 Jul 2022
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning
Adam R. Villaflor
Zheng Huang
Swapnil Pande
John M. Dolan
J. Schneider
OffRL
25
23
0
21 Jul 2022
DeepIPC: Deeply Integrated Perception and Control for an Autonomous
  Vehicle in Real Environments
DeepIPC: Deeply Integrated Perception and Control for an Autonomous Vehicle in Real Environments
Oskar Natan
J. Miura
32
1
0
20 Jul 2022
ANTI-CARLA: An Adversarial Testing Framework for Autonomous Vehicles in
  CARLA
ANTI-CARLA: An Adversarial Testing Framework for Autonomous Vehicles in CARLA
Shreyas Ramakrishna
Baiting Luo
Christopher B. Kuhn
G. Karsai
Abhishek Dubey
AAML
28
17
0
19 Jul 2022
ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal
  Feature Learning
ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature Learning
Shengchao Hu
Li Chen
Peng Wu
Hongyang Li
Junchi Yan
Dacheng Tao
31
226
0
15 Jul 2022
MMFN: Multi-Modal-Fusion-Net for End-to-End Driving
MMFN: Multi-Modal-Fusion-Net for End-to-End Driving
Qingwen Zhang
Mingkai Tang
R. Geng
Feiyi Chen
Ren Xin
Lujia Wang
35
34
0
01 Jul 2022
HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object
  Detection
HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection
Tim Broedermann
Christos Sakaridis
Dengxin Dai
Luc Van Gool
52
31
0
30 Jun 2022
Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming
Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming
Chuan Wen
Jianing Qian
Jierui Lin
Jiaye Teng
Dinesh Jayaraman
Yang Gao
AAML
26
17
0
22 Jun 2022
Level 2 Autonomous Driving on a Single Device: Diving into the Devils of
  Openpilot
Level 2 Autonomous Driving on a Single Device: Diving into the Devils of Openpilot
Li Chen
Tutian Tang
Zhitian Cai
Yang Li
Peng Wu
Hongyang Li
Jianping Shi
Junchi Yan
Yu Qiao
VLM
36
13
0
16 Jun 2022
Trajectory-guided Control Prediction for End-to-end Autonomous Driving:
  A Simple yet Strong Baseline
Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline
Peng Wu
Xiaosong Jia
Li Chen
Junchi Yan
Hongyang Li
Yu Qiao
32
182
0
16 Jun 2022
Multimodal Learning with Transformers: A Survey
Multimodal Learning with Transformers: A Survey
P. Xu
Xiatian Zhu
David A. Clifton
ViT
66
527
0
13 Jun 2022
Previous
123456
Next