ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.05625
  4. Cited By
PETR: Position Embedding Transformation for Multi-View 3D Object
  Detection
v1v2v3 (latest)

PETR: Position Embedding Transformation for Multi-View 3D Object Detection

10 March 2022
Yingfei Liu
Tiancai Wang
Xinming Zhang
Jian Sun
    3DPC
ArXiv (abs)PDFHTMLGithub (945★)

Papers citing "PETR: Position Embedding Transformation for Multi-View 3D Object Detection"

50 / 388 papers shown
Title
FlatFusion: Delving into Details of Sparse Transformer-based Camera-LiDAR Fusion for Autonomous Driving
FlatFusion: Delving into Details of Sparse Transformer-based Camera-LiDAR Fusion for Autonomous Driving
Yutao Zhu
Xiaosong Jia
Xinyu Yang
Junchi Yan
ViT
98
6
0
01 Jul 2025
Accelerate 3D Object Detection Models via Zero-Shot Attention Key Pruning
Accelerate 3D Object Detection Models via Zero-Shot Attention Key Pruning
Lizhen Xu
Xiuxiu Bai
Xiaojun Jia
Jianwu Fang
Shanmin Pang
136
0
0
01 Jul 2025
MonoVQD: Monocular 3D Object Detection with Variational Query Denoising and Self-Distillation
MonoVQD: Monocular 3D Object Detection with Variational Query Denoising and Self-Distillation
Kiet Dang Vu
T. Tran
Duc Dung Nguyen
31
0
0
14 Jun 2025
GraphGSOcc: Semantic-Geometric Graph Transformer with Dynamic-Static Decoupling for 3D Gaussian Splatting-based Occupancy Prediction
GraphGSOcc: Semantic-Geometric Graph Transformer with Dynamic-Static Decoupling for 3D Gaussian Splatting-based Occupancy Prediction
Ke Song
Yunhe Wu
Chunchit Siu
Huiyuan Xiong
39
0
0
13 Jun 2025
3DGeoDet: General-purpose Geometry-aware Image-based 3D Object Detection
Yi Zhang
Y. X. R. Wang
Yawen Cui
Lap-Pui Chau
3DPC
69
0
0
11 Jun 2025
DySS: Dynamic Queries and State-Space Learning for Efficient 3D Object Detection from Multi-Camera Videos
DySS: Dynamic Queries and State-Space Learning for Efficient 3D Object Detection from Multi-Camera Videos
R. Yasarla
Shizhong Han
H. Cai
Fatih Porikli
74
0
0
11 Jun 2025
ODG: Occupancy Prediction Using Dual Gaussians
ODG: Occupancy Prediction Using Dual Gaussians
Y. Shi
Yinhao Zhu
Shizhong Han
Jisoo Jeong
Amin Ansari
H. Cai
Fatih Porikli
3DPC
105
0
0
11 Jun 2025
BePo: Leveraging Birds Eye View and Sparse Points for Efficient and Accurate 3D Occupancy Prediction
BePo: Leveraging Birds Eye View and Sparse Points for Efficient and Accurate 3D Occupancy Prediction
Yunxiao Shi
Hong Cai
Jisoo Jeong
Yinhao Zhu
Shizhong Han
Amin Ansari
Fatih Porikli
3DPC
33
0
0
08 Jun 2025
S2GO: Streaming Sparse Gaussian Occupancy Prediction
S2GO: Streaming Sparse Gaussian Occupancy Prediction
Jinhyung D. Park
Yihan Hu
Chensheng Peng
Wenzhao Zheng
Kris Kitani
Wei Zhan
47
0
0
05 Jun 2025
RQR3D: Reparametrizing the regression targets for BEV-based 3D object detection
RQR3D: Reparametrizing the regression targets for BEV-based 3D object detection
Ozsel Kilinc
Cem Tarhan
201
0
0
23 May 2025
InstanceBEV: Unifying Instance and BEV Representation for Global Modeling
InstanceBEV: Unifying Instance and BEV Representation for Global Modeling
Feng Li
Kun Xu
Zhaoyue Wang
Yunduan Cui
Mohammad Masum Billah
Jia Liu
77
0
0
20 May 2025
SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving
SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving
Muleilan Pei
Jiayao Shan
Peiliang Li
Jieqi Shi
Jing Huo
Yang Gao
Shaojie Shen
163
0
0
18 May 2025
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Zongchuang Zhao
Haoyu Fu
Dingkang Liang
Xin Zhou
Dingyuan Zhang
Hongwei Xie
Bing Wang
Xiang Bai
MLLMVLM
150
0
0
13 May 2025
OccCylindrical: Multi-Modal Fusion with Cylindrical Representation for 3D Semantic Occupancy Prediction
OccCylindrical: Multi-Modal Fusion with Cylindrical Representation for 3D Semantic Occupancy Prediction
Zhenxing Ming
J. S. Berrio
Mao Shan
Yaoqi Huang
Hongyu Lyu
Nguyen Hoang Khoi Tran
Tzu-Yun Tseng
Stewart Worrall
3DPC
111
0
0
06 May 2025
Multimodal and Multiview Deep Fusion for Autonomous Marine Navigation
Multimodal and Multiview Deep Fusion for Autonomous Marine Navigation
Dimitrios Dagdilelis
Panagiotis Grigoriadis
R. Galeazzi
3DPC
443
0
0
02 May 2025
STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow Prediction
STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow Prediction
Zhimin Liao
Ping Wei
Shuaijia Chen
Haoxuan Wang
Ziyang Ren
363
1
0
28 Apr 2025
Revisiting Radar Camera Alignment by Contrastive Learning for 3D Object Detection
Revisiting Radar Camera Alignment by Contrastive Learning for 3D Object Detection
Linhua Kong
Dongxia Chang
Lian Liu
Zisen Kong
Pengyuan Li
Yao Zhao
83
0
0
23 Apr 2025
Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory Prediction
Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory Prediction
Yushen He
Lei Zhao
Tianchen Deng
Zipeng Fang
Weidong Chen
82
0
0
18 Apr 2025
Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving
Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving
Shumin Wang
Zhuoran Yang
Liwen Wang
ZhiPeng Tang
Heng Li
Lehan Pan
Sha Zhang
Jie Peng
Jianmin Ji
Y. Zhang
3DPC
110
0
0
17 Apr 2025
Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction
Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction
Dubing Chen
Huan Zheng
Jin Fang
Xingping Dong
Xianfei Li
Wenlong Liao
Tao He
Pai Peng
Jianbing Shen
271
0
0
17 Apr 2025
RoPETR: Improving Temporal Camera-Only 3D Detection by Integrating Enhanced Rotary Position Embedding
RoPETR: Improving Temporal Camera-Only 3D Detection by Integrating Enhanced Rotary Position Embedding
Hang Ji
Tao Ni
Xufeng Huang
Zhan Shi
Tao Luo
Xin Zhan
Junbo Chen
3DPC
95
0
0
17 Apr 2025
A Modular Energy Aware Framework for Multicopter Modeling in Control and Planning Applications
A Modular Energy Aware Framework for Multicopter Modeling in Control and Planning Applications
Sebastian Gasche
Christian Kallies
Andreas Himmel
R. Findeisen
136
1
0
04 Apr 2025
ZFusion: An Effective Fuser of Camera and 4D Radar for 3D Object Perception in Autonomous Driving
ZFusion: An Effective Fuser of Camera and 4D Radar for 3D Object Perception in Autonomous Driving
Sheng Yang
Tong Zhan
Shichen Qiao
Jicheng Gong
Qing Yang
Jian Wang
Yanfeng Lu
3DPC
123
0
0
04 Apr 2025
Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting
Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting
Shu-Wei Lu
Yi-Hsuan Tsai
Yi-Ting Chen
124
2
0
02 Apr 2025
GLane3D : Detecting Lanes with Graph of 3D Keypoints
GLane3D : Detecting Lanes with Graph of 3D Keypoints
H. Öztürk
M. E. Kalfaoglu
Ozsel Kilinc
3DPC
75
1
0
31 Mar 2025
NuGrounding: A Multi-View 3D Visual Grounding Framework in Autonomous Driving
NuGrounding: A Multi-View 3D Visual Grounding Framework in Autonomous Driving
Fuhao Li
Huan Jin
Bin-Bin Gao
Liaoyuan Fan
Lihui Jiang
Long Zeng
147
2
0
28 Mar 2025
Omnidirectional Depth-Aided Occupancy Prediction based on Cylindrical Voxel for Autonomous Driving
Omnidirectional Depth-Aided Occupancy Prediction based on Cylindrical Voxel for Autonomous Driving
Chaofan Wu
Junlong Li
Jinghao Cao
Ming Li
Yongkang Feng
Jinfeng Xu
Zihang Gao
S. Du
Yang Li
MDE
114
0
0
26 Mar 2025
Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection
Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection
Ahyun Seo
Minsu Cho
139
0
0
26 Mar 2025
Resilient Sensor Fusion under Adverse Sensor Failures via Multi-Modal Expert Fusion
Resilient Sensor Fusion under Adverse Sensor Failures via Multi-Modal Expert Fusion
Konyul Park
Yecheol Kim
Daehun Kim
Jun-Won Choi
132
0
0
25 Mar 2025
ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation
ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation
Haoyu Fu
Diankun Zhang
Zongchuang Zhao
Jianfeng Cui
Dingkang Liang
Chong Zhang
Dingyuan Zhang
Hongwei Xie
Bing Wang
Xiang Bai
120
6
0
25 Mar 2025
Is Discretization Fusion All You Need for Collaborative Perception?
Is Discretization Fusion All You Need for Collaborative Perception?
Kang Yang
Tianci Bu
L. Li
Chunxu Li
Yanjie Wang
Deying Li
147
0
0
18 Mar 2025
DiffAD: A Unified Diffusion Modeling Approach for Autonomous Driving
DiffAD: A Unified Diffusion Modeling Approach for Autonomous Driving
Tao Wang
Cong Zhang
Xingguang Qu
Kun Li
Wen Liu
Chenyu Huang
121
1
0
15 Mar 2025
CoCMT: Communication-Efficient Cross-Modal Transformer for Collaborative Perception
CoCMT: Communication-Efficient Cross-Modal Transformer for Collaborative Perception
Rujia Wang
Xiangbo Gao
Hao Xiang
Runsheng Xu
Zhengzhong Tu
92
3
0
13 Mar 2025
HisTrackMap: Global Vectorized High-Definition Map Construction via History Map Tracking
Jing Yang
Sen Yang
Xiao Tan
Hanli Wang
93
1
0
13 Mar 2025
TrackOcc: Camera-based 4D Panoptic Occupancy Tracking
Zhuoguang Chen
Kenan Li
Xiuyu Yang
Tao Jiang
Yongqian Li
Hang Zhao
127
0
0
11 Mar 2025
CATPlan: Loss-based Collision Prediction in End-to-End Autonomous Driving
Ziliang Xiong
Shipeng Liu
Nathaniel Helgesen
Joakim Johnander
Per-Erik Forssén
156
0
0
10 Mar 2025
Availability-aware Sensor Fusion via Unified Canonical Space for 4D Radar, LiDAR, and Camera
Dong-Hee Paek
Seung-Hyun Kong
146
1
0
10 Mar 2025
DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous Driving
DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous Driving
Xiaosong Jia
Junqi You
Zhiyuan Zhang
Junchi Yan
110
14
0
07 Mar 2025
SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird's-Eye-View Segmentation
SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird's-Eye-View Segmentation
Zijie Zhou
Zhangshuo Qi
Luqi Cheng
Guangming Xiong
109
1
0
27 Feb 2025
Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event Cameras
Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event Cameras
Hoonhee Cho
Jae-Young Kang
Y. Kim
Kuk-Jin Yoon
3DPC
165
1
0
26 Feb 2025
Glad: A Streaming Scene Generator for Autonomous Driving
Bin Xie
Yingfei Liu
Tiancai Wang
Jiale Cao
Xinming Zhang
3DGSVGen
124
3
0
26 Feb 2025
DeepInteraction++: Multi-Modality Interaction for Autonomous Driving
DeepInteraction++: Multi-Modality Interaction for Autonomous Driving
Zeyu Yang
Nan Song
Wei Li
Xiatian Zhu
Lefei Zhang
Philip H. S. Torr
171
4
0
24 Feb 2025
SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
Jianing Li
Ming Lu
Hao Wang
Chenyang Gu
Wenzhao Zheng
Li Du
Shanghang Zhang
202
0
0
28 Jan 2025
CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object Detection
CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object Detection
Yongqian Li
Yang Yang
Zhen Lei
3DPC
126
2
0
11 Jan 2025
GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection
GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection
Yan Lu
Xinzhu Ma
Lei Yang
Tianzhu Zhang
Yating Liu
Qi Chu
Tong He
Yonghui Li
W. Ouyang
145
4
0
08 Jan 2025
SoundLoc3D: Invisible 3D Sound Source Localization and Classification Using a Multimodal RGB-D Acoustic Camera
SoundLoc3D: Invisible 3D Sound Source Localization and Classification Using a Multimodal RGB-D Acoustic Camera
Yuhang He
Sangyun Shin
Anoop Cherian
Niki Trigoni
Andrew Markham
129
0
0
31 Dec 2024
TopView: Vectorising road users in a bird's eye view from uncalibrated
  street-level imagery with deep learning
TopView: Vectorising road users in a bird's eye view from uncalibrated street-level imagery with deep learning
Mohamed R Ibrahim
158
1
0
18 Dec 2024
RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion
RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion
Xiaomeng Chu
Jiajun Deng
Guoliang You
YiFan Duan
Houqiang Li
Yanyong Zhang
540
0
0
17 Dec 2024
Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction
Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction
Dongxu Wei
Zhiqi Li
Peidong Liu
215
2
0
09 Dec 2024
Redundant Queries in DETR-Based 3D Detection Methods: Unnecessary and
  Prunable
Redundant Queries in DETR-Based 3D Detection Methods: Unnecessary and Prunable
Lizhen Xu
Shanmin Pang
Wenzhao Qiu
Zehao Wu
Xiuxiu Bai
K. Mei
Jianru Xue
174
2
0
03 Dec 2024
12345678
Next