Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.05625
Cited By
v1
v2
v3 (latest)
PETR: Position Embedding Transformation for Multi-View 3D Object Detection
10 March 2022
Yingfei Liu
Tiancai Wang
Xinming Zhang
Jian Sun
3DPC
Re-assign community
ArXiv (abs)
PDF
HTML
Github (945★)
Papers citing
"PETR: Position Embedding Transformation for Multi-View 3D Object Detection"
50 / 388 papers shown
Title
Epipolar Attention Field Transformers for Bird's Eye View Semantic Segmentation
Christian Witte
Jens Behley
Cyrill Stachniss
Marvin Raaijmakers
144
0
0
02 Dec 2024
SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection
Philipp Wolters
Johannes Gilg
Torben Teepe
Fabian Herzog
Felix Fent
Gerhard Rigoll
137
0
0
29 Nov 2024
Monocular Lane Detection Based on Deep Learning: A Survey
Xin He
Haiyun Guo
Kuan Zhu
Bingke Zhu
Xu Zhao
Jianwu Fang
Jinqiao Wang
202
1
0
25 Nov 2024
Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion
Jongseong Bae
Junwoo Ha
Ha Young Kim
174
1
0
25 Nov 2024
A Resource Efficient Fusion Network for Object Detection in Bird's-Eye View using Camera and Raw Radar Data
Kavin Chandrasekaran
Sorin Grigorescu
Gijs Dubbelman
P. Jancura
146
0
0
20 Nov 2024
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving
Shaoqing Xu
Fang Li
Shengyin Jiang
Ziying Song
Li Liu
Zhi-xin Yang
3DGS
SSL
153
2
0
19 Nov 2024
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection
Yongjin Lee
Hyeon-Mun Jeong
Yurim Jeon
Sanghyun Kim
162
0
0
16 Nov 2024
EMPERROR: A Flexible Generative Perception Error Model for Probing Self-Driving Planners
Niklas Hanselmann
Simon Doll
Marius Cordts
Hendrik P. A. Lensch
Andreas Geiger
125
0
0
12 Nov 2024
S
E
(
3
)
SE(3)
SE
(
3
)
Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation
Yinshuang Xu
Dian Chen
Katherine Liu
Sergey Zakharov
Rares Andrei Ambrus
Kostas Daniilidis
Vitor Campagnolo Guizilini
MDE
79
1
0
11 Nov 2024
LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation
Weijie Ma
Jingwei Jiang
Yue Yang
Zhaoyu Chen
Hao Chen
81
0
0
09 Nov 2024
OccLoff: Learning Optimized Feature Fusion for 3D Occupancy Prediction
Ji Zhang
Yiran Ding
Zixin Liu
3DPC
153
4
0
06 Nov 2024
X-Drive: Cross-modality consistent multi-sensor data synthesis for driving scenarios
Yichen Xie
Chenfeng Xu
C-T.John Peng
Shuqi Zhao
Nhat Ho
Alexander T. Pham
Mingyu Ding
Masayoshi Tomizuka
Weidong Zhan
DiffM
96
3
0
02 Nov 2024
GAFusion: Adaptive Fusing LiDAR and Camera with Multiple Guidance for 3D Object Detection
Xiaotian Li
Baojie Fan
Jiandong Tian
Huijie Fan
3DPC
134
9
0
01 Nov 2024
Uncertainty Estimation for 3D Object Detection via Evidential Learning
Nikita Durasov
Rafid Mahmood
Jiwoong Choi
Marc T. Law
James Lucas
Pascal Fua
Jose M. Alvarez
UQCV
EDL
3DPC
101
2
0
31 Oct 2024
S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving
Maciej K. Wozniak
Hariprasath Govindarajan
Marvin Klingner
Camille Maurice
B Ravi Kiran
S. Yogamani
3DPC
165
1
0
30 Oct 2024
Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection
Gyusam Chang
Jiwon Lee
Donghyun Kim
Jinkyu Kim
Dongwook Lee
Daehyun Ji
Sujin Jang
Sangpil Kim
127
1
0
29 Oct 2024
BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV Alignment
M. Hosseinzadeh
Ian Reid
77
1
0
28 Oct 2024
UniDrive: Towards Universal Driving Perception Across Camera Configurations
Ye Li
Wenzhao Zheng
Xiaonan Huang
Kurt Keutzer
151
1
0
17 Oct 2024
Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion
Minkyoung Cho
Yulong Cao
Jiachen Sun
Qingzhao Zhang
Marco Pavone
Jeong Joon Park
Heng Yang
Z. Morley Mao
84
2
0
16 Oct 2024
MambaBEV: An efficient 3D detection model with Mamba2
Zihan You
Hao Wang
Qichao Zhao
Jinxiang Wang
Jinxiang Wang
Mamba
135
4
0
16 Oct 2024
UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles
Hui Ye
Rajshekhar Sunderraman
Shihao Ji
62
3
0
14 Oct 2024
big.LITTLE Vision Transformer for Efficient Visual Recognition
He Guo
Yulong Wang
Zixuan Ye
Jifeng Dai
Yuwen Xiong
ViT
105
0
0
14 Oct 2024
ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object Detection
Jiwei Chen
Laiyan Ding
Chi Zhang
Feifei Li
73
0
0
14 Oct 2024
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Songming Liu
Lingxuan Wu
Bangguo Li
Hengkai Tan
Huayu Chen
Zhengyi Wang
Ke Xu
Hang Su
Jun Zhu
159
126
0
10 Oct 2024
Progressive Multi-Modal Fusion for Robust 3D Object Detection
Rohit Mohan
Daniele Cattaneo
Florian Drews
Abhinav Valada
3DPC
148
4
0
09 Oct 2024
QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation
Yuxin Li
Yiheng Li
Xulei Yang
Mengying Yu
Zihang Huang
Xiaojun Wu
Chai Kiat Yeo
66
0
0
09 Oct 2024
Learning Content-Aware Multi-Modal Joint Input Pruning via Bird's-Eye-View Representation
Yuxin Li
Yiheng Li
Xulei Yang
Mengying Yu
Zihang Huang
Xiaojun Wu
Chai Kiat Yeo
68
0
0
09 Oct 2024
Cross-Camera Data Association via GNN for Supervised Graph Clustering
Đorđe Nedeljković
49
0
0
01 Oct 2024
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
Chenming Zhu
Tai Wang
Wenwei Zhang
Jiangmiao Pang
Xihui Liu
270
52
0
26 Sep 2024
Enhancing Fruit and Vegetable Detection in Unconstrained Environment with a Novel Dataset
Sandeep Khanna
Chiranjoy Chattopadhyay
Suman Kundu
53
3
0
20 Sep 2024
RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking Framework
Xiaoyu Li
Peidong Li
Lijun Zhao
Dedong Liu
Jinghan Gao
Xian Wu
Yitao Wu
Dixiao Cui
VOT
126
1
0
18 Sep 2024
RopeBEV: A Multi-Camera Roadside Perception Network in Bird's-Eye-View
Jinrang Jia
Guangqi Yi
Yifeng Shi
72
0
0
18 Sep 2024
DiFSD: Ego-Centric Fully Sparse Paradigm with Uncertainty Denoising and Iterative Refinement for Efficient End-to-End Autonomous Driving
Haisheng Su
Wei Wu
Junchi Yan
75
0
0
15 Sep 2024
OPUS: Occupancy Prediction Using a Sparse Set
Jiabao Wang
Zhaojiang Liu
Qiang Meng
Liujiang Yan
Ke Wang
Jie Yang
Wei Liu
Qibin Hou
Ming-Ming Cheng
89
15
0
14 Sep 2024
Vision-Driven 2D Supervised Fine-Tuning Framework for Bird's Eye View Perception
Lei He
Qiaoyi Wang
Honglin Sun
Qing Xu
Bolin Gao
Shengbo Eben Li
Jianqiang Wang
Keqiang Li
115
0
0
09 Sep 2024
Driving with Prior Maps: Unified Vector Prior Encoding for Autonomous Vehicle Mapping
Shuang Zeng
Xinyuan Chang
Xinran Liu
Zheng Pan
Xing Wei
129
3
0
09 Sep 2024
RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network
Zhiwei Lin
Zhe Liu
Yongtao Wang
Le Zhang
Ce Zhu
125
4
0
08 Sep 2024
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
Jianbiao Mei
T. Hu
Xuemeng Yang
Licheng Wen
Yu Yang
Tiantian Wei
Yukai Ma
Min Dou
Botian Shi
Yong Liu
VGen
DiffM
179
6
0
06 Sep 2024
Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Dingyuan Zhang
Dingkang Liang
Zichang Tan
Xiaoqing Ye
Cheng Zhang
Jingdong Wang
Xiang Bai
ViT
113
2
0
01 Sep 2024
Enhancing Vectorized Map Perception with Historical Rasterized Maps
Xiaoyu Zhang
Guangwei Liu
Zihao Liu
Ningyi Xu
Yunhui Liu
Ji Zhao
90
9
0
01 Sep 2024
RING#: PR-by-PE Global Localization with Roto-translation Equivariant Gram Learning
Sha Lu
Xuecheng Xu
Yuxuan Wu
Haojian Lu
Xieyuanli Chen
R. Xiong
Yue Wang
99
3
0
30 Aug 2024
PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird's-Eye-View
Zichen Yu
Quanli Liu
Wei Wang
Liyong Zhang
Xiaoguang Zhao
71
3
0
29 Aug 2024
AdaOcc: Adaptive-Resolution Occupancy Prediction
Chao-Yeh Chen
Ruoyu Wang
Yuliang Guo
Cheng Zhao
Xinyu Huang
Chen Feng
Liu Ren
87
0
0
24 Aug 2024
Multimodal Foundational Models for Unsupervised 3D General Obstacle Detection
Tamás Matuszka
Peter Hajas
Dávid Szeghy
82
0
0
22 Aug 2024
HeightLane: BEV Heightmap guided 3D Lane Detection
Chaesong Park
Eunbin Seo
Jongwoo Lim
189
4
0
15 Aug 2024
MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection
Zitian Wang
Zehao Huang
Yulu Gao
Naiyan Wang
Si Liu
3DPC
126
6
0
12 Aug 2024
ParkingE2E: Camera-based End-to-end Parking Network, from Images to Planning
Changze Li
Ziheng Ji
Zhe Chen
Tong Qin
Ming Yang
90
7
0
04 Aug 2024
CAMAv2: A Vision-Centric Approach for Static Map Element Annotation
Shiyuan Chen
Jiaxin Zhang
Ruohong Mei
Yingfeng Cai
Haoran Yin
Tao Chen
Wei Sui
Cong Yang
90
0
0
31 Jul 2024
CardioSyntax: end-to-end SYNTAX score prediction -- dataset, benchmark and method
Alexander Ponomarchuk
Ivan Kruzhilov
Galina Zubkova
Artem Shadrin
Ruslan Utegenov
Ivan Bessonov
Pavel Blinov
125
0
0
29 Jul 2024
Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality Ensemble
Juhan Cha
Minseok Joo
Jihwan Park
Sanghyeok Lee
In-Ho Kim
Hyunwoo J. Kim
122
2
0
27 Jul 2024
Previous
1
2
3
4
5
6
7
8
Next