ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.01256
  4. Cited By
PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

2 June 2022
Yingfei Liu
Junjie Yan
Fan Jia
Shuailin Li
Q. Gao
Tiancai Wang
Xinming Zhang
Jian Sun
    3DPC
ArXivPDFHTML

Papers citing "PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images"

50 / 265 papers shown
Title
PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object
  Detection in Bird's-Eye-View
PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird's-Eye-View
Zichen Yu
Quanli Liu
Wei Wang
Liyong Zhang
Xiaoguang Zhao
32
0
0
29 Aug 2024
Panoptic Perception for Autonomous Driving: A Survey
Panoptic Perception for Autonomous Driving: A Survey
Yunge Li
Lanyu Xu
40
2
0
27 Aug 2024
AdaOcc: Adaptive-Resolution Occupancy Prediction
AdaOcc: Adaptive-Resolution Occupancy Prediction
Chao-Yeh Chen
Ruoyu Wang
Yuliang Guo
Cheng Zhao
Xinyu Huang
Chen Feng
Liu Ren
50
0
0
24 Aug 2024
MaskBEV: Towards A Unified Framework for BEV Detection and Map
  Segmentation
MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation
Xiao Zhao
Xukun Zhang
Dingkang Yang
Mingyang Sun
Mingcheng Li
Shunli Wang
Lihua Zhang
MoE
42
1
0
17 Aug 2024
PriorMapNet: Enhancing Online Vectorized HD Map Construction with Priors
PriorMapNet: Enhancing Online Vectorized HD Map Construction with Priors
Rongxuan Wang
Xin Lu
Xiaoyang Liu
Xiaoyi Zou
Tongyi Cao
Ying Li
51
5
0
16 Aug 2024
MV2DFusion: Leveraging Modality-Specific Object Semantics for
  Multi-Modal 3D Detection
MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection
Zitian Wang
Zehao Huang
Yulu Gao
Naiyan Wang
Si Liu
3DPC
51
4
0
12 Aug 2024
KAN-RCBEVDepth: A multi-modal fusion algorithm in object detection for
  autonomous driving
KAN-RCBEVDepth: A multi-modal fusion algorithm in object detection for autonomous driving
Zhihao Lai
Chuanhao Liu
Shihui Sheng
Zhiqiang Zhang
3DPC
52
0
0
04 Aug 2024
Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and
  Proximity-based Modality Ensemble
Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality Ensemble
Juhan Cha
Minseok Joo
Jihwan Park
Sanghyeok Lee
In-Ho Kim
Hyunwoo J. Kim
43
2
0
27 Jul 2024
PrevPredMap: Exploring Temporal Modeling with Previous Predictions for
  Online Vectorized HD Map Construction
PrevPredMap: Exploring Temporal Modeling with Previous Predictions for Online Vectorized HD Map Construction
Nan Peng
Xun Zhou
Mingming Wang
Xiaojun Yang
Songming Chen
Guisong Chen
41
3
0
24 Jul 2024
DVPE: Divided View Position Embedding for Multi-View 3D Object Detection
DVPE: Divided View Position Embedding for Multi-View 3D Object Detection
Jiasen Wang
Zhenglin Li
Ke Sun
Xianyuan Liu
Yang Zhou
46
0
0
24 Jul 2024
Learning High-resolution Vector Representation from Multi-Camera Images
  for 3D Object Detection
Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection
Zhili Chen
Shuangjie Xu
Maosheng Ye
Zian Qian
Xiaoyi Zou
Dit-Yan Yeung
Qifeng Chen
58
1
0
22 Jul 2024
Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object
  Detection
Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection
Yiran Yang
Xu Gao
Tong Wang
Xin Hao
Yifeng Shi
Xiao Tan
Xiaoqing Ye
Jingdong Wang
3DPC
37
0
0
22 Jul 2024
RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via
  Ray-Centric Strategies
RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via Ray-Centric Strategies
Xiaomeng Chu
Jiajun Deng
Guoliang You
YiFan Duan
Yao Li
Yanyong Zhang
46
3
0
20 Jul 2024
GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV
  Segmentation
GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation
Florian Chabot
Nicolas Granger
G. Lapouge
3DGS
43
3
0
19 Jul 2024
Mask2Map: Vectorized HD Map Construction Using Bird's Eye View
  Segmentation Masks
Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation Masks
Sehwan Choi
Jungho Kim
Hongjae Shin
Jungwook Choi
3DPC
57
7
0
18 Jul 2024
OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework
  for Bird's-eye-view Vehicle Semantic Segmentation
OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird's-eye-view Vehicle Semantic Segmentation
Jian Sun
Yuqi Dai
Chi-Man Vong
Qing Xu
Shengbo Eben Li
Jianqiang Wang
Lei He
Keqiang Li
40
1
0
18 Jul 2024
RepVF: A Unified Vector Fields Representation for Multi-task 3D
  Perception
RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception
Chunliang Li
Wencheng Han
Junbo Yin
Sanyuan Zhao
Jianbing Shen
32
3
0
15 Jul 2024
LabelDistill: Label-guided Cross-modal Knowledge Distillation for
  Camera-based 3D Object Detection
LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection
Sanmin Kim
Youngseok Kim
Sihwan Hwang
H. Jeong
Dongsuk Kum
40
4
0
14 Jul 2024
FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection
FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection
Zheng Jiang
Jinqing Zhang
Yanan Zhang
Qingjie Liu
Zhenghui Hu
Baohui Wang
Yunhong Wang
32
2
0
14 Jul 2024
IFTR: An Instance-Level Fusion Transformer for Visual Collaborative
  Perception
IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception
Shaohong Wang
Lu Bin
Xinyu Xiao
Zhiyu Xiang
Hangguan Shan
Eryun Liu
ViT
45
2
0
13 Jul 2024
Occupancy as Set of Points
Occupancy as Set of Points
Yiang Shi
Tianheng Cheng
Qian Zhang
Wenyu Liu
Xinggang Wang
3DPC
48
13
0
04 Jul 2024
Cyclic Refiner: Object-Aware Temporal Representation Learning for
  Multi-View 3D Detection and Tracking
Cyclic Refiner: Object-Aware Temporal Representation Learning for Multi-View 3D Detection and Tracking
Mingzhe Guo
Zhipeng Zhang
Liping Jing
Yuan He
Ke Wang
Heng Fan
42
1
0
03 Jul 2024
Hierarchical Temporal Context Learning for Camera-based Semantic Scene
  Completion
Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion
Bohan Li
Jiajun Deng
Wenyao Zhang
Zhujin Liang
Dalong Du
Xin Jin
Wenjun Zeng
42
8
0
02 Jul 2024
MDHA: Multi-Scale Deformable Transformer with Hybrid Anchors for
  Multi-View 3D Object Detection
MDHA: Multi-Scale Deformable Transformer with Hybrid Anchors for Multi-View 3D Object Detection
Michelle Adeline
Junn Yong Loo
Vishnu Monn Baskaran
57
0
0
25 Jun 2024
Multi-Dimensional Pruning: Joint Channel, Layer and Block Pruning with
  Latency Constraint
Multi-Dimensional Pruning: Joint Channel, Layer and Block Pruning with Latency Constraint
Xinglong Sun
Barath Lakshmanan
Maying Shen
Shiyi Lan
Jingde Chen
Jose Alvarez
VLM
44
3
0
17 Jun 2024
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in
  Vision-based Roadside 3D Object Detection
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
Wenjie Wang
Yehao Lu
Guangcong Zheng
Shuigen Zhan
Xiaoqing Ye
Zichang Tan
Jingdong Wang
Gaoang Wang
Xi Li
68
9
0
13 Jun 2024
CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise
  Transformer
CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer
Hualian Sheng
Sijia Cai
Na Zhao
Bing Deng
Qiao Liang
Min-Jian Zhao
Jieping Ye
3DPC
48
0
0
12 Jun 2024
PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for
  Autonomous Driving
PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving
Yining Shi
Jiusi Li
Kun Jiang
Ke Wang
Yunlong Wang
Mengmeng Yang
Diange Yang
3DPC
50
5
0
11 Jun 2024
Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane Priors
Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane Priors
Han Li
Zehao Huang
Zitian Wang
Wenge Rong
Naiyan Wang
Si Liu
ViT
3DPC
45
7
0
05 Jun 2024
S2-Track: A Simple yet Strong Approach for End-to-End 3D Multi-Object Tracking
S2-Track: A Simple yet Strong Approach for End-to-End 3D Multi-Object Tracking
Lijun Zhou
Tao Tang
Pengkun Hao
Zihang He
Kalok Ho
...
Zhihui Hao
Haiyang Sun
Kun Zhan
Peng Jia
Xianpeng Lang
VOT
58
4
0
04 Jun 2024
SparseDrive: End-to-End Autonomous Driving via Sparse Scene
  Representation
SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation
Wenchao Sun
Xuewu Lin
Yining Shi
Chuang Zhang
Haoran Wu
Sifa Zheng
48
24
0
30 May 2024
Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving?
Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving?
Yifan Bai
Dongming Wu
Yingfei Liu
Fan Jia
Weixin Mao
...
Yucheng Zhao
Jianbing Shen
Xing Wei
Tiancai Wang
Xiangyu Zhang
MLLM
40
9
0
28 May 2024
Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving
Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving
Shaoyuan Xie
Lingdong Kong
Wenwei Zhang
Jiawei Ren
Liang Pan
Kai-xiang Chen
Ziwei Liu
AAML
58
9
0
27 May 2024
MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object
  Detection Method
MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method
Pan Liao
Feng Yang
Di Wu
Liu Bo
34
1
0
24 May 2024
Context and Geometry Aware Voxel Transformer for Semantic Scene
  Completion
Context and Geometry Aware Voxel Transformer for Semantic Scene Completion
Zhuopu Yu
Runmin Zhang
Jiacheng Ying
Junchen Yu
Xiaohai Hu
Lun Luo
Siyuan Cao
Hui-Liang Shen
ViT
54
12
0
22 May 2024
Multi-View Attentive Contextualization for Multi-View 3D Object
  Detection
Multi-View Attentive Contextualization for Multi-View 3D Object Detection
Xianpeng Liu
Ce Zheng
Ming Qian
Nan Xue
Cheng Chen
Zhebin Zhang
Chen Li
Tianfu Wu
41
2
0
20 May 2024
DuoSpaceNet: Leveraging Both Bird's-Eye-View and Perspective View Representations for 3D Object Detection
DuoSpaceNet: Leveraging Both Bird's-Eye-View and Perspective View Representations for 3D Object Detection
Zhe Huang
Yizhe Zhao
Hao Xiao
Chenyan Wu
Lingting Ge
3DPC
48
1
0
17 May 2024
RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception
RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception
Xiaosu Zhu
Hualian Sheng
Sijia Cai
Bing Deng
Shaopeng Yang
Qiao Liang
Ken Chen
Lianli Gao
Jingkuan Song
Jieping Ye
48
4
0
16 May 2024
TP3M: Transformer-based Pseudo 3D Image Matching with Reference
TP3M: Transformer-based Pseudo 3D Image Matching with Reference
Liming Han
Zhaoxiang Liu
Shiguo Lian
29
0
0
14 May 2024
A Survey on Occupancy Perception for Autonomous Driving: The Information
  Fusion Perspective
A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective
Huaiyuan Xu
Junliang Chen
Shiyu Meng
Yi Wang
Lap-Pui Chau
3DPC
44
16
0
08 May 2024
CoViS-Net: A Cooperative Visual Spatial Foundation Model for Multi-Robot
  Applications
CoViS-Net: A Cooperative Visual Spatial Foundation Model for Multi-Robot Applications
J. Blumenkamp
Steven D. Morad
Jennifer Gielis
Amanda Prorok
31
4
0
02 May 2024
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning
Shihao Wang
Zhiding Yu
Xiaohui Jiang
Shiyi Lan
Min Shi
Nadine Chang
Jan Kautz
Ying Li
Jose M. Alvarez
LRM
40
47
0
02 May 2024
OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining
  BEV Segmentation Networks
OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks
Sophia Sirko-Galouchenko
Alexandre Boulch
Spyros Gidaris
Andrei Bursuc
Antonín Vobecký
Patrick Pérez
Renaud Marlet
3DPC
38
7
0
22 Apr 2024
TempBEV: Improving Learned BEV Encoders with Combined Image and BEV
  Space Temporal Aggregation
TempBEV: Improving Learned BEV Encoders with Combined Image and BEV Space Temporal Aggregation
T. Monninger
Vandana Dokkadi
Md Zafar Anwar
Steffen Staab
37
2
0
17 Apr 2024
Homography Guided Temporal Fusion for Road Line and Marking Segmentation
Homography Guided Temporal Fusion for Road Line and Marking Segmentation
Shan Wang
Chuong H. Nguyen
Jiawei Liu
Kaihao Zhang
Wenhan Luo
Yanhao Zhang
Sundaram Muthu
F. A. Maken
Hongdong Li
44
4
0
11 Apr 2024
SparseAD: Sparse Query-Centric Paradigm for Efficient End-to-End
  Autonomous Driving
SparseAD: Sparse Query-Centric Paradigm for Efficient End-to-End Autonomous Driving
Diankun Zhang
Guoan Wang
Runwen Zhu
Jianbo Zhao
Xiwu Chen
...
Haotian Yao
Chi Zhang
Xiaojun Liu
Xiaoguang Di
Bin Li
31
11
0
10 Apr 2024
Scaling Multi-Camera 3D Object Detection through Weak-to-Strong
  Eliciting
Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Hao Lu
Jiaqi Tang
Xinli Xu
Xu Cao
Yunpeng Zhang
Guoqing Wang
Dalong Du
Hao Chen
Ying Chen
35
3
0
10 Apr 2024
MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues
MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues
Xiahan Chen
Mingjian Chen
Sanli Tang
Yi Niu
Jiang Zhu
31
2
0
08 Apr 2024
Better Monocular 3D Detectors with LiDAR from the Past
Better Monocular 3D Detectors with LiDAR from the Past
Yurong You
Cheng Perng Phoo
Carlos Diaz-Ruiz
Katie Z Luo
Wei-Lun Chao
Mark E. Campbell
B. Hariharan
Kilian Q. Weinberger
3DPC
33
1
0
08 Apr 2024
MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection
MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection
Hou-I Liu
Christine Wu
Jen-Hao Cheng
Wenhao Chai
Shian-Yun Wang
...
Jenq-Neng Hwang
Hong-Han Shuai
Wen-Huang Cheng
Hong-Han Shuai
Wen-Huang Cheng
42
2
0
07 Apr 2024
Previous
123456
Next