Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.13542
Cited By
v1
v2 (latest)
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
26 May 2022
Zhijian Liu
Haotian Tang
Alexander Amini
Xinyu Yang
Huizi Mao
Daniela Rus
Song Han
Re-assign community
ArXiv (abs)
PDF
HTML
Github (2624★)
Papers citing
"BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation"
50 / 533 papers shown
Title
FlatFusion: Delving into Details of Sparse Transformer-based Camera-LiDAR Fusion for Autonomous Driving
Yutao Zhu
Xiaosong Jia
Xinyu Yang
Junchi Yan
ViT
87
6
0
01 Jul 2025
PerLDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models
Jinhua Zhang
Hualian Sheng
Sijia Cai
Bing Deng
Qiao Liang
Wen Li
Ying Fu
Jieping Ye
Shuhang Gu
DiffM
83
2
0
01 Jul 2025
ParkFormer: A Transformer-Based Parking Policy with Goal Embedding and Pedestrian-Aware Control
Jun Fu
Bin Tian
Haonan Chen
Shi Meng
Tingting Yao
ViT
24
0
0
20 Jun 2025
X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability
Yu Yang
Alan Liang
Jianbiao Mei
Yukai Ma
Yong-Jin Liu
Gim Hee Lee
VGen
32
0
0
16 Jun 2025
GraphGSOcc: Semantic-Geometric Graph Transformer with Dynamic-Static Decoupling for 3D Gaussian Splatting-based Occupancy Prediction
Ke Song
Yunhe Wu
Chunchit Siu
Huiyuan Xiong
30
0
0
13 Jun 2025
DINO-CoDT: Multi-class Collaborative Detection and Tracking with Vision Foundation Models
Xunjie He
Christina Dao Wen Lee
Meiling Wang
Chengran Yuan
Zefan Huang
Yufeng Yue
Marcelo H. Ang Jr
31
0
0
09 Jun 2025
BEVCALIB: LiDAR-Camera Calibration via Geometry-Guided Bird's-Eye View Representations
Weiduo Yuan
Jerry Li
Justin Yue
Divyank Shah
Konstantinos Karydis
Hang Qiu
67
0
0
03 Jun 2025
S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Modelwith Spatio-Temporal Visual Representation
Yichen Xie
Runsheng Xu
Tong He
Jyh-Jing Hwang
Katie Luo
...
Letian Chen
Yiren Lu
Zhaoqi Leng
Dragomir Anguelov
Mingxing Tan
VLM
LRM
46
0
0
30 May 2025
RefAV: Towards Planning-Centric Scenario Mining
Cainan Davidson
Deva Ramanan
Neehar Peri
89
2
0
27 May 2025
Echo Planning for Autonomous Driving: From Current Observations to Future Trajectories and Back
Jintao Sun
Hu Zhang
Gangyi Ding
Zhedong Zheng
62
0
0
25 May 2025
Learning better representations for crowded pedestrians in offboard LiDAR-camera 3D tracking-by-detection
Shichao Li
Peiliang Li
Qing Lian
Peng Yun
Xiaozhi Chen
3DPC
3DV
62
0
0
21 May 2025
InstanceBEV: Unifying Instance and BEV Representation for Global Modeling
Feng Li
Kun Xu
Zhaoyue Wang
Yunduan Cui
Mohammad Masum Billah
Jia Liu
68
0
0
20 May 2025
Safety2Drive: Safety-Critical Scenario Benchmark for the Evaluation of Autonomous Driving
Jingzheng Li
Tiancheng Wang
Xingyu Peng
Jiasi Chen
Zhijun Chen
Bing Li
Xianglong Liu
ELM
78
0
0
20 May 2025
MoRAL: Motion-aware Multi-Frame 4D Radar and LiDAR Fusion for Robust 3D Object Detection
Xiangyuan Peng
Yu Wang
Miao Tang
Bierzynski Kay
Lorenzo Servadei
Robert Wille
3DPC
72
0
0
14 May 2025
DepthFusion: Depth-Aware Hybrid Feature Fusion for LiDAR-Camera 3D Object Detection
Mingqian Ji
Jian Yang
Shanshan Zhang
3DPC
MDE
79
0
0
12 May 2025
BETTY Dataset: A Multi-modal Dataset for Full-Stack Autonomy
Micah Nye
Ayoub Raji
Andrew Saba
Eidan Erlich
Robert Exley
...
Ritesh Misra
Matthew Sivaprakasam
Marko Bertogna
Deva Ramanan
Sebastian A. Scherer
131
0
0
12 May 2025
RESAR-BEV: An Explainable Progressive Residual Autoregressive Approach for Camera-Radar Fusion in BEV Segmentation
Zhiwen Zeng
Yunfei Yin
Zheng Yuan
Argho Dey
Xianjian Bao
116
0
0
10 May 2025
DenseGrounding: Improving Dense Language-Vision Semantics for Ego-Centric 3D Visual Grounding
Henry Zheng
Hao Shi
Qihang Peng
Yong Xien Chng
Rui Huang
Yepeng Weng
Zhongchao Shi
Gao Huang
114
2
0
08 May 2025
OccCylindrical: Multi-Modal Fusion with Cylindrical Representation for 3D Semantic Occupancy Prediction
Zhenxing Ming
J. S. Berrio
Mao Shan
Yaoqi Huang
Hongyu Lyu
Nguyen Hoang Khoi Tran
Tzu-Yun Tseng
Stewart Worrall
3DPC
106
0
0
06 May 2025
Point Cloud Recombination: Systematic Real Data Augmentation Using Robotic Targets for LiDAR Perception Validation
Hubert Padusinski
Christian Steinhauser
Christian Scherl
Julian Gaal
Jacob Langner
3DPC
99
0
0
05 May 2025
DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion
Haoteng Li
Zhao Yang
Zezhong Qian
Gongpeng Zhao
Yuqi Huang
Jun-chen Yu
Huazheng Zhou
Longjun Liu
272
2
0
03 May 2025
Multimodal and Multiview Deep Fusion for Autonomous Marine Navigation
Dimitrios Dagdilelis
Panagiotis Grigoriadis
R. Galeazzi
3DPC
441
0
0
02 May 2025
Is Intermediate Fusion All You Need for UAV-based Collaborative Perception?
Jiuwu Hao
Liguo Sun
Yuting Wan
Yueyang Wu
Ti Xiang
Haolin Song
Pin Lv
433
0
0
30 Apr 2025
DiVE: Efficient Multi-View Driving Scenes Generation Based on Video Diffusion Transformer
Junpeng Jiang
Gangyi Hong
Miao Zhang
Hengtong Hu
Kun Zhan
Rui Shao
Liqiang Nie
VGen
92
3
0
28 Apr 2025
A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection
Carlo Sgaravatti
Roberto Basla
Riccardo Pieroni
Matteo Corno
S. Savaresi
Luca Magri
Giacomo Boracchi
3DPC
98
0
0
25 Apr 2025
A Data-Centric Approach to 3D Semantic Segmentation of Railway Scenes
Nicolas Münger
M. Ronecker
Xavier Diaz
Michael Karner
Daniel Watzenig
Jan Skaloud
3DPC
490
0
0
25 Apr 2025
S3MOT: Monocular 3D Object Tracking with Selective State Space Model
Zhuohao Yan
Shaoquan Feng
Xingxing Li
Yuxuan Zhou
Chunxi Xia
Shengyu Li
VOT
136
0
0
25 Apr 2025
NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration
Haotian Dong
Xinze Wang
Dahua Lin
Yipeng Wu
Qin Chen
R. Liu
Kairui Yang
Ping Li
Qing Guo
VGen
81
0
0
25 Apr 2025
SignX: The Foundation Model for Sign Recognition
Sen Fang
Chunyu Sui
Hongwei Yi
C. Neidle
Dimitris N. Metaxas
SLR
80
0
0
22 Apr 2025
Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory Prediction
Yushen He
Lei Zhao
Tianchen Deng
Zipeng Fang
Weidong Chen
69
0
0
18 Apr 2025
Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving
Shumin Wang
Zhuoran Yang
Liwen Wang
ZhiPeng Tang
Heng Li
Lehan Pan
Sha Zhang
Jie Peng
Jianmin Ji
Y. Zhang
3DPC
102
0
0
17 Apr 2025
E2E Parking Dataset: An Open Benchmark for End-to-End Autonomous Parking
Kejia Gao
Liguo Zhou
Mingjun Liu
Alois C. Knoll
66
0
0
15 Apr 2025
FastRSR: Efficient and Accurate Road Surface Reconstruction from Bird's Eye View
Yuting Zhao
Yuheng Ji
Xiaoshuai Hao
Shuxiao Li
71
0
0
13 Apr 2025
InSPE: Rapid Evaluation of Heterogeneous Multi-Modal Infrastructure Sensor Placement
Zhaoliang Zheng
Yize Zhang
Zongling Meng
Johnson Liu
Xin Xia
Jiaqi Ma
106
0
0
11 Apr 2025
Inverse++: Vision-Centric 3D Semantic Occupancy Prediction Assisted with 3D Object Detection
Zhenxing Ming
J. S. Berrio
Mao Shan
Stewart Worrall
3DPC
95
3
0
07 Apr 2025
SSLFusion: Scale & Space Aligned Latent Fusion Model for Multimodal 3D Object Detection
Bonan Ding
J. Xie
Jing Nie
Jiale Cao
107
0
0
07 Apr 2025
ZFusion: An Effective Fuser of Camera and 4D Radar for 3D Object Perception in Autonomous Driving
Sheng Yang
Tong Zhan
Shichen Qiao
Jicheng Gong
Qing Yang
Jian Wang
Yanfeng Lu
3DPC
123
0
0
04 Apr 2025
Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision
Xiaofeng Han
Shunpeng Chen
Zenghuang Fu
Zhe Feng
Lue Fan
...
Li Guo
Weiliang Meng
Xiaopeng Zhang
Rongtao Xu
Shibiao Xu
126
4
0
03 Apr 2025
MinkOcc: Towards real-time label-efficient semantic occupancy prediction
Samuel Sze
Daniele De Martini
Lars Kunze
3DPC
106
0
0
03 Apr 2025
Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting
Shu-Wei Lu
Yi-Hsuan Tsai
Yi-Ting Chen
117
2
0
02 Apr 2025
Cal or No Cal? -- Real-Time Miscalibration Detection of LiDAR and Camera Sensors
Ilir Tahiraj
Jeremialie Swadiryus
F. Fent
Markus Lienkamp
82
0
0
31 Mar 2025
A Benchmark for Vision-Centric HD Mapping by V2I Systems
Miao Fan
Shanshan Yu
Shengtong Xu
Kun Jiang
Haoyi Xiong
Xiangzeng Liu
3DV
94
0
0
31 Mar 2025
InteractionMap: Improving Online Vectorized HDMap Construction with Interaction
Kuang Wu
Chuan Yang
Zhanbin Li
94
0
0
27 Mar 2025
Resilient Sensor Fusion under Adverse Sensor Failures via Multi-Modal Expert Fusion
Konyul Park
Yecheol Kim
Daehun Kim
Jun-Won Choi
112
0
0
25 Mar 2025
GAA-TSO: Geometry-Aware Assisted Depth Completion for Transparent and Specular Objects
Yuhang Liu
Tong Jia
Da Cai
Hao Wang
Dongyue Chen
101
1
0
21 Mar 2025
FrustumFusionNets: A Three-Dimensional Object Detection Network Based on Tractor Road Scene
Lili Yang
Mengshuai Chang
Xiao Guo
Yuxin Feng
Yiwen Mei
Caicong Wu
3DPC
111
0
0
18 Mar 2025
Efficient Multimodal 3D Object Detector via Instance-Level Contrastive Distillation
Zhuoqun Su
Huimin Lu
Shuaifeng Jiao
Junhao Xiao
Yanjie Wang
Xieyuanli Chen
3DPC
108
0
0
17 Mar 2025
Industrial-Grade Sensor Simulation via Gaussian Splatting: A Modular Framework for Scalable Editing and Full-Stack Validation
Xianming Zeng
Sicong Du
Qifeng Chen
Lizhe Liu
Haoyu Shu
...
Peng Chen
Yapeng Xue
Chunming Zhao
Sheng Yang
Qiang Li
3DGS
106
0
0
14 Mar 2025
DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Hongbin Lin
Zilu Guo
Yiming Zhang
Shuaicheng Niu
Yafeng Li
Ruiyi Zhang
Shuguang Cui
Zhen Li
DiffM
82
1
0
14 Mar 2025
Active Learning from Scene Embeddings for End-to-End Autonomous Driving
Wenhao Jiang
Duo Li
Menghan Hu
Chao Ma
Ke Wang
Zhipeng Zhang
129
0
0
14 Mar 2025
1
2
3
4
...
9
10
11
Next