Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.07524
Cited By
DGOcc: Depth-aware Global Query-based Network for Monocular 3D Occupancy Prediction
10 April 2025
Xu Zhao
Pengju Zhang
Bo Liu
Yihong Wu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"DGOcc: Depth-aware Global Query-based Network for Monocular 3D Occupancy Prediction"
19 / 19 papers shown
Title
OFMPNet: Deep End-to-End Model for Occupancy and Flow Prediction in Urban Environment
Youshaa Murhij
Dmitry A. Yudin
98
8
0
02 Apr 2024
MonoOcc: Digging into Monocular Semantic Occupancy Prediction
Yupeng Zheng
Xiang Li
Pengfei Li
Yuhang Zheng
Bu Jin
Chengliang Zhong
Xiaoxiao Long
Hao Zhao
Qichao Zhang
76
30
0
13 Mar 2024
OccFusion: Depth Estimation Free Multi-sensor Fusion for 3D Occupancy Prediction
Ji Zhang
Yiran Ding
Zixin Liu
3DPC
128
10
0
08 Mar 2024
FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height Plugin
Zichen Yu
Changyong Shu
Jiajun Deng
Kangjie Lu
Zongdai Liu
Jiangyong Yu
Dawei Yang
Hui Li
Yan Chen
106
58
0
18 Nov 2023
SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving
Yi Wei
Linqing Zhao
Wenzhao Zheng
Zhengbiao Zhu
Jie Zhou
Jiwen Lu
3DPC
87
231
0
16 Mar 2023
VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion
Yiming Li
Zhiding Yu
Chris Choy
Chaowei Xiao
J. Álvarez
Sanja Fidler
Chen Feng
Anima Anandkumar
ViT
126
236
0
23 Feb 2023
Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction
Yuan-Ko Huang
Wenzhao Zheng
Yunpeng Zhang
Jie Zhou
Jiwen Lu
3DPC
100
303
0
15 Feb 2023
BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection
Yinhao Li
Zheng Ge
Guanyi Yu
Jinrong Yang
Zengran Wang
Yukang Shi
Jian‐Yuan Sun
Zeming Li
MDE
86
622
0
21 Jun 2022
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
Feng Li
Hao Zhang
Hu-Sheng Xu
Siyi Liu
Lei Zhang
L. Ni
H. Shum
ISeg
141
392
0
06 Jun 2022
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
Zhiqi Li
Wenhai Wang
Hongyang Li
Enze Xie
Chonghao Sima
Tong Lu
Qiao Yu
Jifeng Dai
132
1,308
0
31 Mar 2022
Masked-attention Mask Transformer for Universal Image Segmentation
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
274
2,385
0
02 Dec 2021
MonoScene: Monocular 3D Semantic Scene Completion
Anh-Quan Cao
Raoul de Charette
3DV
99
284
0
01 Dec 2021
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
485
7,837
0
11 Nov 2021
KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D
Yiyi Liao
Jun Xie
Andreas Geiger
3DV
3DPC
84
590
0
28 Sep 2021
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
262
5,107
0
08 Oct 2020
LMSCNet: Lightweight Multiscale 3D Semantic Completion
Luis Roldão
Raoul de Charette
Anne Verroust-Blondet
72
160
0
24 Aug 2020
Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by Implicitly Unprojecting to 3D
Jonah Philion
Sanja Fidler
102
1,055
0
13 Aug 2020
Semantic Scene Completion from a Single Depth Image
Shuran Song
Feng Yu
Andy Zeng
Angel X. Chang
Manolis Savva
Thomas Funkhouser
3DV
93
1,246
0
28 Nov 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.3K
194,641
0
10 Dec 2015
1