Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.01256
Cited By
PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images
2 June 2022
Yingfei Liu
Junjie Yan
Fan Jia
Shuailin Li
Q. Gao
Tiancai Wang
X. Zhang
Jian-jun Sun
3DPC
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images"
50 / 88 papers shown
Title
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Zongchuang Zhao
Haoyu Fu
Dingkang Liang
Xin Zhou
Dingyuan Zhang
Hongwei Xie
Bing Wang
Xiang Bai
MLLM
VLM
49
0
0
13 May 2025
STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow Prediction
Zhimin Liao
Ping Wei
Shuaijia Chen
Haoxuan Wang
Ziyang Ren
100
0
0
28 Apr 2025
Towards Latency-Aware 3D Streaming Perception for Autonomous Driving
Jiaqi Peng
Tai Wang
Jiangmiao Pang
Yuan Shen
44
0
0
27 Apr 2025
Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction
Dubing Chen
Huan Zheng
Jin Fang
Xingping Dong
Xianfei Li
Wenlong Liao
Tao He
Pai Peng
Jianbing Shen
42
0
0
17 Apr 2025
RoPETR: Improving Temporal Camera-Only 3D Detection by Integrating Enhanced Rotary Position Embedding
Hang Ji
Tao Ni
Xufeng Huang
Tao Luo
Xin Zhan
Junbo Chen
3DPC
44
0
0
17 Apr 2025
MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations
Hongyu Ke
Jack Morris
K. Oguchi
Xiaofei Cao
Yongkang Liu
Haoxin Wang
Yi Ding
Mamba
71
0
0
18 Mar 2025
Accelerate 3D Object Detection Models via Zero-Shot Attention Key Pruning
Lizhen Xu
Xiuxiu Bai
Xiaojun Jia
Jianwu Fang
Shanmin Pang
61
0
0
13 Mar 2025
HisTrackMap: Global Vectorized High-Definition Map Construction via History Map Tracking
Jing Yang
Sen Yang
Xiao Tan
Hanli Wang
53
1
0
13 Mar 2025
LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera
Weiyi Xiong
Zean Zou
Qiuchi Zhao
Fengchun He
Bing Zhu
66
1
0
21 Feb 2025
SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset
Goodarz Mehr
A. Eskandarian
63
1
0
04 Feb 2025
SoundLoc3D: Invisible 3D Sound Source Localization and Classification Using a Multimodal RGB-D Acoustic Camera
Yuhang He
Sangyun Shin
Anoop Cherian
Niki Trigoni
Andrew Markham
73
0
0
31 Dec 2024
RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion
Xiaomeng Chu
Jiajun Deng
Guoliang You
Yifan Duan
Houqiang Li
Yanyong Zhang
170
0
0
17 Dec 2024
Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion
Jongseong Bae
Junwoo Ha
Ha Young Kim
84
0
0
25 Nov 2024
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection
Yongjin Lee
Hyeon-Mun Jeong
Yurim Jeon
Sanghyun Kim
45
0
0
16 Nov 2024
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
Chenming Zhu
Tai Wang
Wenwei Zhang
Jiangmiao Pang
Xihui Liu
126
29
0
26 Sep 2024
RopeBEV: A Multi-Camera Roadside Perception Network in Bird's-Eye-View
Jinrang Jia
Guangqi Yi
Yifeng Shi
34
0
0
18 Sep 2024
OPUS: Occupancy Prediction Using a Sparse Set
Jiabao Wang
Zhaojiang Liu
Qiang Meng
Liujiang Yan
Ke Wang
Jie Yang
Wei Liu
Qibin Hou
Ming-Ming Cheng
30
9
0
14 Sep 2024
RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via Ray-Centric Strategies
Xiaomeng Chu
Jiajun Deng
Guoliang You
Yifan Duan
Yao Li
Yanyong Zhang
43
3
0
20 Jul 2024
CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer
Hualian Sheng
Sijia Cai
Na Zhao
Bing Deng
Qiao Liang
Min-Jian Zhao
Jieping Ye
3DPC
42
0
0
12 Jun 2024
S2-Track: A Simple yet Strong Approach for End-to-End 3D Multi-Object Tracking
Lijun Zhou
Tao Tang
Pengkun Hao
Zihang He
Kalok Ho
...
Zhihui Hao
Haiyang Sun
Kun Zhan
Peng Jia
Xianpeng Lang
VOT
58
4
0
04 Jun 2024
Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving
Shaoyuan Xie
Lingdong Kong
Wenwei Zhang
Jiawei Ren
Liang Pan
Kai-xiang Chen
Ziwei Liu
AAML
55
9
0
27 May 2024
DuoSpaceNet: Leveraging Both Bird's-Eye-View and Perspective View Representations for 3D Object Detection
Zhe Huang
Yizhe Zhao
Hao Xiao
Chenyan Wu
Lingting Ge
3DPC
48
1
0
17 May 2024
CoViS-Net: A Cooperative Visual Spatial Foundation Model for Multi-Robot Applications
J. Blumenkamp
Steven D. Morad
Jennifer Gielis
Amanda Prorok
28
4
0
02 May 2024
OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks
Sophia Sirko-Galouchenko
Alexandre Boulch
Spyros Gidaris
Andrei Bursuc
Antonín Vobecký
Patrick Pérez
Renaud Marlet
3DPC
32
7
0
22 Apr 2024
Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Hao Lu
Jiaqi Tang
Xinli Xu
Xu Cao
Yunpeng Zhang
Guoqing Wang
Dalong Du
Hao Chen
Ying Chen
35
3
0
10 Apr 2024
Better Monocular 3D Detectors with LiDAR from the Past
Yurong You
Cheng Perng Phoo
Carlos Diaz-Ruiz
Katie Z Luo
Wei-Lun Chao
Mark E. Campbell
B. Hariharan
Kilian Q. Weinberger
3DPC
33
1
0
08 Apr 2024
MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection
Hou-I Liu
Christine Wu
Jen-Hao Cheng
Wenhao Chai
Shian-Yun Wang
...
Jenq-Neng Hwang
Hong-Han Shuai
Wen-Huang Cheng
Hong-Han Shuai
Wen-Huang Cheng
42
2
0
07 Apr 2024
GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection
Ziying Song
Lei Yang
Shaoqing Xu
Lin Liu
Dongyang Xu
Caiyan Jia
Feiyang Jia
Li-e Wang
3DPC
62
14
0
18 Mar 2024
A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions
Quoc-Vinh Lai-Dang
ViT
33
2
0
12 Mar 2024
Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D Perception
Philipp Wolters
Johannes Gilg
Torben Teepe
Fabian Herzog
Anouar Laouichi
Martin Hofmann
Gerhard Rigoll
MDE
65
12
0
12 Mar 2024
Collaborative Semantic Occupancy Prediction with Hybrid Feature Fusion in Connected Automated Vehicles
Rui Song
Chenwei Liang
Hu Cao
Zhiran Yan
Walter Zimmer
Markus Gross
Andreas Festag
Alois C. Knoll
34
21
0
12 Feb 2024
CurveFormer++: 3D Lane Detection by Curve Propagation with Temporal Curve Queries and Attention
Yifeng Bai
Zhirong Chen
Pengpeng Liang
Erkang Cheng
Erkang Cheng
ViT
33
8
0
09 Feb 2024
Stream Query Denoising for Vectorized HD Map Construction
Shuo Wang
Fan Jia
Yingfei Liu
Yucheng Zhao
Zehui Chen
Tiancai Wang
Chi Zhang
Xiangyu Zhang
Feng Zhao
36
19
0
17 Jan 2024
Lift-Attend-Splat: Bird's-eye-view camera-lidar fusion using transformers
James Gunn
Zygmunt Lenyk
Anuj Sharma
Andrea Donati
Alexandru Buburuzan
John Redford
Romain Mueller
MDE
35
8
0
22 Dec 2023
M-BEV: Masked BEV Perception for Robust Autonomous Driving
Siran Chen
Yue Ma
Yu Qiao
Yali Wang
31
8
0
19 Dec 2023
ADriver-I: A General World Model for Autonomous Driving
Fan Jia
Weixin Mao
Yingfei Liu
Yucheng Zhao
Yuqing Wen
Chi Zhang
Xiangyu Zhang
Tiancai Wang
39
63
0
22 Nov 2023
Multiple View Geometry Transformers for 3D Human Pose Estimation
Ziwei Liao
Jialiang Zhu
Chunyu Wang
Han Hu
Steven L. Waslander
ViT
23
2
0
18 Nov 2023
GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers
Takeru Miyato
Bernhard Jaeger
Max Welling
Andreas Geiger
ViT
39
14
0
16 Oct 2023
Language Prompt for Autonomous Driving
Dongming Wu
Wencheng Han
Tiancai Wang
Yingfei Liu
Cheng-zhong Xu
Jianbing Shen
Jianbing Shen
VLM
33
73
0
08 Sep 2023
LATR: 3D Lane Detection from Monocular Images with Transformer
Yueru Luo
Chaoda Zheng
Xu Yan
Tang Kun
Chao Zheng
Shuguang Cui
Zhen Li
ViT
33
32
0
08 Aug 2023
HeightFormer: Explicit Height Modeling without Extra Data for Camera-only 3D Object Detection in Bird's Eye View
Yiming Wu
Rui Li
Zequn Qin
Xinhai Zhao
Xi Li
39
11
0
25 Jul 2023
EgoVM: Achieving Precise Ego-Localization using Lightweight Vectorized Maps
Yuzhe He
Shuang Liang
Xiaofei Rui
Chengying Cai
Guowei Wan
24
6
0
18 Jul 2023
BEVScope: Enhancing Self-Supervised Depth Estimation Leveraging Bird's-Eye-View in Dynamic Scenarios
Yucheng Mao
Ruowen Zhao
Tianbao Zhang
Hang Zhao
27
3
0
20 Jun 2023
BEVStereo++: Accurate Depth Estimation in Multi-view 3D Object Detection via Dynamic Temporal Stereo
Yinhao Li
Jinrong Yang
Jian‐Yuan Sun
Han Bao
Zheng Ge
Likai Xiao
24
2
0
09 Apr 2023
Geometric-aware Pretraining for Vision-centric 3D Object Detection
Linyan Huang
Huijie Wang
J. Zeng
Shengchuan Zhang
Liujuan Cao
Junchi Yan
Hongyang Li
3DPC
67
9
0
06 Apr 2023
Training Strategies for Vision Transformers for Object Detection
Apoorv Singh
26
4
0
05 Apr 2023
FedBEVT: Federated Learning Bird's Eye View Perception Transformer in Road Traffic Systems
Rui Song
Runsheng Xu
Andreas Festag
Jiaqi Ma
Alois C. Knoll
FedML
28
25
0
04 Apr 2023
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Zhuofan Zong
Dong Jiang
Guanglu Song
Zeyue Xue
Jingyong Su
Hongsheng Li
Yu Liu
49
35
0
03 Apr 2023
BEVFusion4D: Learning LiDAR-Camera Fusion Under Bird's-Eye-View via Cross-Modality Guidance and Temporal Aggregation
Hongxiang Cai
Zeyuan Zhang
Zhenyu Zhou
Ziyin Li
Wenbo Ding
Jiu-Yang Zhao
3DPC
21
30
0
30 Mar 2023
3D Video Object Detection with Learnable Object-Centric Global Optimization
Jiawei He
Yuntao Chen
Naiyan Wang
Zhaoxiang Zhang
3DH
3DPC
63
9
0
27 Mar 2023
1
2
Next