ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.05625
  4. Cited By
PETR: Position Embedding Transformation for Multi-View 3D Object
  Detection
v1v2v3 (latest)

PETR: Position Embedding Transformation for Multi-View 3D Object Detection

10 March 2022
Yingfei Liu
Tiancai Wang
Xinming Zhang
Jian Sun
    3DPC
ArXiv (abs)PDFHTMLGithub (945★)

Papers citing "PETR: Position Embedding Transformation for Multi-View 3D Object Detection"

50 / 388 papers shown
Title
Towards Efficient 3D Object Detection in Bird's-Eye-View Space for
  Autonomous Driving: A Convolutional-Only Approach
Towards Efficient 3D Object Detection in Bird's-Eye-View Space for Autonomous Driving: A Convolutional-Only Approach
Yuxin Li
Qiang Han
Mengying Yu
Yuxin Jiang
Chai Kiat Yeo
Yiheng Li
Zihang Huang
Nini Liu
Hsuanhan Chen
Xiaojun Wu
50
3
0
01 Dec 2023
Sparse Beats Dense: Rethinking Supervision in Radar-Camera Depth
  Completion
Sparse Beats Dense: Rethinking Supervision in Radar-Camera Depth Completion
Huadong Li
Minhao Jing
Jiajun Liang
Haoqiang Fan
Renhe Ji
106
6
0
01 Dec 2023
Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in
  Autonomous Driving Applications
Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications
Junyi Ma
Xieyuanli Chen
Jiawei Huang
Jingyi Xu
Zhen Luo
Jintao Xu
Weihao Gu
Rui Ai
Hesheng Wang
81
29
0
29 Nov 2023
Panacea: Panoramic and Controllable Video Generation for Autonomous
  Driving
Panacea: Panoramic and Controllable Video Generation for Autonomous Driving
Yuqing Wen
Yucheng Zhao
Yingfei Liu
Fan Jia
Yanhui Wang
Chong Luo
Chi Zhang
Tiancai Wang
Xiaoyan Sun
Xiangyu Zhang
164
72
0
28 Nov 2023
ADriver-I: A General World Model for Autonomous Driving
ADriver-I: A General World Model for Autonomous Driving
Fan Jia
Weixin Mao
Yingfei Liu
Yucheng Zhao
Yuqing Wen
Chi Zhang
Xiangyu Zhang
Tiancai Wang
124
70
0
22 Nov 2023
Sparse4D v3: Advancing End-to-End 3D Detection and Tracking
Sparse4D v3: Advancing End-to-End 3D Detection and Tracking
Xuewu Lin
Zi-Hui Pei
Tianwei Lin
Lichao Huang
Zhizhong Su
107
38
0
20 Nov 2023
FlashOcc: Fast and Memory-Efficient Occupancy Prediction via
  Channel-to-Height Plugin
FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height Plugin
Zichen Yu
Changyong Shu
Jiajun Deng
Kangjie Lu
Zongdai Liu
Jiangyong Yu
Dawei Yang
Hui Li
Yan Chen
125
58
0
18 Nov 2023
Multiple View Geometry Transformers for 3D Human Pose Estimation
Multiple View Geometry Transformers for 3D Human Pose Estimation
Ziwei Liao
Jialiang Zhu
Chunyu Wang
Han Hu
Steven L. Waslander
ViT
82
2
0
18 Nov 2023
PPAD: Iterative Interactions of Prediction and Planning for End-to-end
  Autonomous Driving
PPAD: Iterative Interactions of Prediction and Planning for End-to-end Autonomous Driving
Zhili Chen
Maosheng Ye
Shuangjie Xu
Tongyi Cao
Qifeng Chen
145
12
0
14 Nov 2023
3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features
3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features
Chenfeng Xu
Huan Ling
Sanja Fidler
Or Litany
112
15
0
07 Nov 2023
Augmenting Lane Perception and Topology Understanding with Standard
  Definition Navigation Maps
Augmenting Lane Perception and Topology Understanding with Standard Definition Navigation Maps
Katie Z Luo
Xinshuo Weng
Yan Wang
Shuang Wu
Jie Li
Kilian Q. Weinberger
Yue Wang
Marco Pavone
85
30
0
07 Nov 2023
mmFUSION: Multimodal Fusion for 3D Objects Detection
mmFUSION: Multimodal Fusion for 3D Objects Detection
Javed Ahmad
Alessio Del Bue
3DPC
78
10
0
07 Nov 2023
M&M3D: Multi-Dataset Training and Efficient Network for Multi-view 3D
  Object Detection
M&M3D: Multi-Dataset Training and Efficient Network for Multi-view 3D Object Detection
Hang Zhang
54
1
0
02 Nov 2023
Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive
  Survey and Evaluation
Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation
Yinjie Lei
Zixuan Wang
Feng Chen
Guoqing Wang
Peng Wang
Yang Yang
113
12
0
24 Oct 2023
Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection
Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection
Linyan Huang
Zhiqi Li
Chonghao Sima
Wenhai Wang
Jingdong Wang
Yu Qiao
Hongyang Li
88
13
0
24 Oct 2023
MSFormer: A Skeleton-multiview Fusion Method For Tooth Instance
  Segmentation
MSFormer: A Skeleton-multiview Fusion Method For Tooth Instance Segmentation
Yuan Li
Huan Liu
Y. Tao
Xiangyang He
Haifeng Li
Xiaohu Guo
Hai Lin
107
0
0
23 Oct 2023
2D-3D Interlaced Transformer for Point Cloud Segmentation with
  Scene-Level Supervision
2D-3D Interlaced Transformer for Point Cloud Segmentation with Scene-Level Supervision
Cheng-Kun Yang
Min-Hung Chen
Yung-Yu Chuang
Yen-Yu Lin
ViT3DPC
95
18
0
19 Oct 2023
Towards Generalizable Multi-Camera 3D Object Detection via Perspective
  Debiasing
Towards Generalizable Multi-Camera 3D Object Detection via Perspective Debiasing
Hao Lu
Yunpeng Zhang
Qing Lian
Dalong Du
Ying-Cong Chen
105
6
0
17 Oct 2023
GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers
GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers
Takeru Miyato
Bernhard Jaeger
Max Welling
Andreas Geiger
ViT
163
15
0
16 Oct 2023
UniPAD: A Universal Pre-training Paradigm for Autonomous Driving
UniPAD: A Universal Pre-training Paradigm for Autonomous Driving
Honghui Yang
Sha Zhang
Di Huang
Xiaoyang Wu
Haoyi Zhu
...
Hengshuang Zhao
Qibo Qiu
Binbin Lin
Xiaofei He
Wanli Ouyang
SSL
129
51
0
12 Oct 2023
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
Haoyi Zhu
Honghui Yang
Xiaoyang Wu
Di Huang
Sha Zhang
...
Hengshuang Zhao
Chunhua Shen
Yu Qiao
Tong He
Wanli Ouyang
SSL
221
47
0
12 Oct 2023
DrivingDiffusion: Layout-Guided multi-view driving scene video
  generation with latent diffusion model
DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model
Xiaofan Li
Yifu Zhang
Xiaoqing Ye
VGen
137
78
0
11 Oct 2023
TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning
TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning
Dongming Wu
Jiahao Chang
Fan Jia
Yingfei Liu
Tiancai Wang
Jianbing Shen
LRM
90
28
0
10 Oct 2023
CoBEV: Elevating Roadside 3D Object Detection with Depth and Height
  Complementarity
CoBEV: Elevating Roadside 3D Object Detection with Depth and Height Complementarity
Haowen Shi
Chengshan Pang
Jiaming Zhang
Kailun Yang
Yuhao Wu
Huajian Ni
Yining Lin
Rainer Stiefelhagen
Kaiwei Wang
MDE
91
20
0
04 Oct 2023
Pixel-Aligned Recurrent Queries for Multi-View 3D Object Detection
Pixel-Aligned Recurrent Queries for Multi-View 3D Object Detection
Yiming Xie
Huaizu Jiang
Georgia Gkioxari
Julian Straub
3DPC
79
9
0
02 Oct 2023
BEVHeight++: Toward Robust Visual Centric 3D Object Detection
BEVHeight++: Toward Robust Visual Centric 3D Object Detection
Lei Yang
Dor Tsur
Ziv Goldfeld
Peng Chen
Kun Yuan
Li-e Wang
Yi Huang
Xinyu Zhang
Kaicheng Yu
115
25
0
28 Sep 2023
DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal
  Knowledge Distillation
DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation
Zeyu Wang
Dingwen Li
Chenxu Luo
Cihang Xie
Xiaodong Yang
115
24
0
26 Sep 2023
A Vision-Centric Approach for Static Map Element Annotation
A Vision-Centric Approach for Static Map Element Annotation
Jiaxin Zhang
Shiyuan Chen
Haoran Yin
Ruohong Mei
Xuan Liu
Cong Yang
Qian Zhang
Wei Sui
3DV
80
3
0
21 Sep 2023
RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering
  Supervision
RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision
Mingjie Pan
Jiaming Liu
Renrui Zhang
Peixiang Huang
Xiaoqi Li
Bing Wang
Hongwei Xie
Li Liu
Shanghang Zhang
148
90
0
18 Sep 2023
FusionFormer: A Multi-sensory Fusion in Bird's-Eye-View and Temporal
  Consistent Transformer for 3D Object Detection
FusionFormer: A Multi-sensory Fusion in Bird's-Eye-View and Temporal Consistent Transformer for 3D Object Detection
Chunyong Hu
Hang Zheng
Kun Li
Jianyun Xu
Weibo Mao
...
Kaixuan Liu
Yiru Zhao
Peihan Hao
Minzhe Liu
Kaicheng Yu
ViT3DPC
98
18
0
11 Sep 2023
Language Prompt for Autonomous Driving
Language Prompt for Autonomous Driving
Dongming Wu
Wencheng Han
Tiancai Wang
Yingfei Liu
Cheng-zhong Xu
Jianbing Shen
Jianbing Shen
VLM
136
87
0
08 Sep 2023
ClusterFusion: Leveraging Radar Spatial Features for Radar-Camera 3D
  Object Detection in Autonomous Vehicles
ClusterFusion: Leveraging Radar Spatial Features for Radar-Camera 3D Object Detection in Autonomous Vehicles
Irfan Tito Kurniawan
B. Trilaksono
121
6
0
07 Sep 2023
Diffusion-based 3D Object Detection with Random Boxes
Diffusion-based 3D Object Detection with Random Boxes
Xin Zhou
Jinghua Hou
Tingting Yao
Dingkang Liang
Yanfeng Guo
Zhikang Zou
Xiaoqing Ye
Jianwei Cheng
Xiang Bai
DiffM
77
8
0
05 Sep 2023
SOGDet: Semantic-Occupancy Guided Multi-view 3D Object Detection
SOGDet: Semantic-Occupancy Guided Multi-view 3D Object Detection
Qiu Zhou
Jinming Cao
Hanchao Leng
Yifang Yin
Yu Kun
Roger Zimmermann
3DPC
80
8
0
26 Aug 2023
MapPrior: Bird's-Eye View Map Layout Estimation with Generative Models
MapPrior: Bird's-Eye View Map Layout Estimation with Generative Models
Xiyue Zhu
Vlas Zyrianov
Zhijian Liu
Shenlong Wang
94
12
0
24 Aug 2023
Delving into Motion-Aware Matching for Monocular 3D Object Tracking
Delving into Motion-Aware Matching for Monocular 3D Object Tracking
Kuangyu Huang
Ming-Hsuan Yang
Yi-Hsuan Tsai
87
10
0
22 Aug 2023
QD-BEV : Quantization-aware View-guided Distillation for Multi-view 3D
  Object Detection
QD-BEV : Quantization-aware View-guided Distillation for Multi-view 3D Object Detection
Yifan Zhang
Zhen Dong
Huanrui Yang
Ming Lu
Cheng-Ching Tseng
Yuan Du
Kurt Keutzer
Li Du
Shanghang Zhang
MQ
80
9
0
21 Aug 2023
Far3D: Expanding the Horizon for Surround-view 3D Object Detection
Far3D: Expanding the Horizon for Surround-view 3D Object Detection
Xiaohui Jiang
Shuailin Li
Yingfei Liu
Shihao Wang
Fan Jia
Tiancai Wang
Lijin Han
Xiangyu Zhang
79
51
0
18 Aug 2023
SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera
  Videos
SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos
Haisong Liu
Yao Teng
Tao Lu
Haiguang Wang
Liming Wang
117
107
0
18 Aug 2023
ImGeoNet: Image-induced Geometry-aware Voxel Representation for
  Multi-view 3D Object Detection
ImGeoNet: Image-induced Geometry-aware Voxel Representation for Multi-view 3D Object Detection
Tao Tu
Shun-Po Chuang
Yu-Lun Liu
Cheng Sun
Kecheng Zhang
D. Roy
Cheng-Hao Kuo
Min Sun
3DPC
116
7
0
17 Aug 2023
XVTP3D: Cross-view Trajectory Prediction Using Shared 3D Queries for
  Autonomous Driving
XVTP3D: Cross-view Trajectory Prediction Using Shared 3D Queries for Autonomous Driving
Zijian Song
Huikun Bi
Ruisi Zhang
Tianlu Mao
Zhaoqi Wang
3DPC
71
3
0
17 Aug 2023
UniTR: A Unified and Efficient Multi-Modal Transformer for
  Bird's-Eye-View Representation
UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation
Haiyang Wang
Hao Tang
Shaoshuai Shi
Aoxue Li
Zhenguo Li
Bernt Schiele
Liwei Wang
ViT
125
56
0
15 Aug 2023
UniWorld: Autonomous Driving Pre-training via World Models
UniWorld: Autonomous Driving Pre-training via World Models
Chen Min
Dawei Zhao
Liang Xiao
Yiming Nie
Bin Dai
VGen
76
23
0
14 Aug 2023
MapTRv2: An End-to-End Framework for Online Vectorized HD Map
  Construction
MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction
Bencheng Liao
Shaoyu Chen
Yunchi Zhang
Bo Jiang
Qian Zhang
Wenyu Liu
Chang Huang
Xinggang Wang
3DVViT
175
123
0
10 Aug 2023
Bird's-Eye-View Scene Graph for Vision-Language Navigation
Bird's-Eye-View Scene Graph for Vision-Language Navigation
Ruitao Liu
Xiaohan Wang
Wenguan Wang
Yi Yang
133
57
0
09 Aug 2023
LATR: 3D Lane Detection from Monocular Images with Transformer
LATR: 3D Lane Detection from Monocular Images with Transformer
Yueru Luo
Chaoda Zheng
Xu Yan
Tang Kun
Chao Zheng
Shuguang Cui
Zhen Li
ViT
97
37
0
08 Aug 2023
FocalFormer3D : Focusing on Hard Instance for 3D Object Detection
FocalFormer3D : Focusing on Hard Instance for 3D Object Detection
Yilun Chen
Zhiding Yu
Yukang Chen
Shiyi Lan
Anima Anandkumar
Jiaya Jia
J. Álvarez
3DPC
101
51
0
08 Aug 2023
FB-BEV: BEV Representation from Forward-Backward View Transformations
FB-BEV: BEV Representation from Forward-Backward View Transformations
Zhiqi Li
Zhiding Yu
Wenhai Wang
Anima Anandkumar
Tong Lu
J. Álvarez
96
87
0
04 Aug 2023
QUEST: Query Stream for Practical Cooperative Perception
QUEST: Query Stream for Practical Cooperative Perception
Siqi Fan
Haibao Yu
Wen-Yen Yang
Jirui Yuan
Zaiqing Nie
89
11
0
03 Aug 2023
Target-point Attention Transformer: A novel trajectory predict network
  for end-to-end autonomous driving
Target-point Attention Transformer: A novel trajectory predict network for end-to-end autonomous driving
Jing Du
Yang Zhao
Hong-wei Cheng
ViT
58
1
0
03 Aug 2023
Previous
12345678
Next