ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.05729
  4. Cited By
ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient
  Self-Supervised Monocular Depth Estimation

ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient Self-Supervised Monocular Depth Estimation

12 December 2022
Daitao Xing
Jinglin Shen
C. Ho
Anthony Tzes
    ViT
    MDE
ArXivPDFHTML

Papers citing "ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient Self-Supervised Monocular Depth Estimation"

40 / 40 papers shown
Title
RM-Depth: Unsupervised Learning of Recurrent Monocular Depth in Dynamic
  Scenes
RM-Depth: Unsupervised Learning of Recurrent Monocular Depth in Dynamic Scenes
Tak-Wai Hui
MDE
96
47
0
08 Mar 2023
MonoViT: Self-Supervised Monocular Depth Estimation with a Vision
  Transformer
MonoViT: Self-Supervised Monocular Depth Estimation with a Vision Transformer
Chaoqiang Zhao
Youming Zhang
Matteo Poggi
Fabio Tosi
Xianda Guo
Zheng Zhu
Guan Huang
Yang Tang
S. Mattoccia
ViT
MDE
60
181
0
06 Aug 2022
DepthFormer: Exploiting Long-Range Correlation and Local Information for
  Accurate Monocular Depth Estimation
DepthFormer: Exploiting Long-Range Correlation and Local Information for Accurate Monocular Depth Estimation
Zhenyu Li
Zehui Chen
Xianming Liu
Junjun Jiang
ViT
MDE
52
185
1
27 Mar 2022
Vision Transformer with Deformable Attention
Vision Transformer with Deformable Attention
Zhuofan Xia
Xuran Pan
S. Song
Li Erran Li
Gao Huang
ViT
72
476
0
03 Jan 2022
Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth
  Estimation
Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth Estimation
Jiaxing Yan
Hong Zhao
Penghui Bu
Yusheng Jin
3DPC
MDE
33
131
0
24 Dec 2021
BoxeR: Box-Attention for 2D and 3D Transformers
BoxeR: Box-Attention for 2D and 3D Transformers
Duy-Kien Nguyen
Jihong Ju
Olaf Booji
Martin R. Oswald
Cees G. M. Snoek
ViT
46
36
0
25 Nov 2021
CamLessMonoDepth: Monocular Depth Estimation with Unknown Camera
  Parameters
CamLessMonoDepth: Monocular Depth Estimation with Unknown Camera Parameters
Sai Shyam Chanduri
Zeeshan Khan Suri
Igor Vozniak
Christian Müller
MDE
52
12
0
27 Oct 2021
X-Distill: Improving Self-Supervised Monocular Depth via Cross-Task
  Distillation
X-Distill: Improving Self-Supervised Monocular Depth via Cross-Task Distillation
H. Cai
J. Matai
Shubhankar Borse
Yizhe Zhang
Amin Ansari
Fatih Porikli
FedML
VLM
MDE
84
21
0
24 Oct 2021
Fine-grained Semantics-aware Representation Enhancement for
  Self-supervised Monocular Depth Estimation
Fine-grained Semantics-aware Representation Enhancement for Self-supervised Monocular Depth Estimation
Hyun-Joo Jung
Eunhyeok Park
S. Yoo
MDE
50
109
0
19 Aug 2021
Vision Transformer with Progressive Sampling
Vision Transformer with Progressive Sampling
Xiaoyu Yue
Shuyang Sun
Zhanghui Kuang
Meng Wei
Philip Torr
Wayne Zhang
Dahua Lin
ViT
73
84
0
03 Aug 2021
DPT: Deformable Patch-based Transformer for Visual Recognition
DPT: Deformable Patch-based Transformer for Visual Recognition
Zhiyang Chen
Yousong Zhu
Chaoyang Zhao
Guosheng Hu
Wei Zeng
Jinqiao Wang
Ming Tang
ViT
36
99
0
30 Jul 2021
SegFormer: Simple and Efficient Design for Semantic Segmentation with
  Transformers
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
Enze Xie
Wenhai Wang
Zhiding Yu
Anima Anandkumar
J. Álvarez
Ping Luo
ViT
229
4,990
0
31 May 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
391
21,281
0
25 Mar 2021
Vision Transformers for Dense Prediction
Vision Transformers for Dense Prediction
René Ranftl
Alexey Bochkovskiy
V. Koltun
ViT
MDE
125
1,717
0
24 Mar 2021
Monocular Depth Estimation through Virtual-world Supervision and
  Real-world SfM Self-Supervision
Monocular Depth Estimation through Virtual-world Supervision and Real-world SfM Self-Supervision
A. Gurram
Ahmet Faruk Tuna
Fengyi Shen
O. Urfalioglu
Antonio M. López
MDE
59
42
0
22 Mar 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction
  without Convolutions
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
481
3,699
0
24 Feb 2021
Learning Depth via Leveraging Semantics: Self-supervised Monocular Depth
  Estimation with Both Implicit and Explicit Semantic Guidance
Learning Depth via Leveraging Semantics: Self-supervised Monocular Depth Estimation with Both Implicit and Explicit Semantic Guidance
Rui Li
Xiantuo He
Danna Xue
Shaolin Su
Qing Mao
Yu Zhu
Jinqiu Sun
Yanning Zhang
SSL
MDE
85
30
0
11 Feb 2021
Learning Monocular Depth in Dynamic Scenes via Instance-Aware Projection
  Consistency
Learning Monocular Depth in Dynamic Scenes via Instance-Aware Projection Consistency
Seokju Lee
Sunghoon Im
Stephen Lin
In So Kweon
3DV
MDE
54
90
0
04 Feb 2021
HR-Depth: High Resolution Self-Supervised Monocular Depth Estimation
HR-Depth: High Resolution Self-Supervised Monocular Depth Estimation
Xiaoyang Lyu
Lu Liu
Mengmeng Wang
Xin Kong
Lina Liu
Yong Liu
Xinxin Chen
Yi Yuan
MDE
77
257
0
14 Dec 2020
Sparse R-CNN: End-to-End Object Detection with Learnable Proposals
Sparse R-CNN: End-to-End Object Detection with Learnable Proposals
Pei Sun
Rufeng Zhang
Yi Jiang
Tao Kong
Chenfeng Xu
...
Masayoshi Tomizuka
Lei Li
Zehuan Yuan
Changhu Wang
Ping Luo
ObjD
91
1,091
0
25 Nov 2020
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
186
5,046
0
08 Oct 2020
SynDistNet: Self-Supervised Monocular Fisheye Camera Distance Estimation
  Synergized with Semantic Segmentation for Autonomous Driving
SynDistNet: Self-Supervised Monocular Fisheye Camera Distance Estimation Synergized with Semantic Segmentation for Autonomous Driving
V. Kumar
Marvin Klingner
S. Yogamani
Stefan Milz
Tim Fingscheidt
Patrick Mäder
MDE
59
81
0
10 Aug 2020
Learning Monocular Visual Odometry via Self-Supervised Long-Term
  Modeling
Learning Monocular Visual Odometry via Self-Supervised Long-Term Modeling
Yuliang Zou
Pan Ji
Quoc-Huy Tran
Jia-Bin Huang
Manmohan Chandraker
SSL
96
68
0
21 Jul 2020
Feature-metric Loss for Self-supervised Learning of Depth and Egomotion
Feature-metric Loss for Self-supervised Learning of Depth and Egomotion
Chang Shu
Kun Yu
Zhixiang Duan
Kuiyuan Yang
SSL
MDE
65
233
0
21 Jul 2020
Self-Supervised Monocular Depth Estimation: Solving the Dynamic Object
  Problem by Semantic Guidance
Self-Supervised Monocular Depth Estimation: Solving the Dynamic Object Problem by Semantic Guidance
Marvin Klingner
Jan-Aike Termöhlen
Jonas Mikolajczyk
Tim Fingscheidt
MDE
106
321
0
14 Jul 2020
End-to-End Object Detection with Transformers
End-to-End Object Detection with Transformers
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
340
12,966
0
26 May 2020
Towards Better Generalization: Joint Depth-Pose Learning without PoseNet
Towards Better Generalization: Joint Depth-Pose Learning without PoseNet
Wang Zhao
Shaohui Liu
Yezhi Shu
Yong Liu
MDE
67
155
0
03 Apr 2020
The Edge of Depth: Explicit Constraints between Segmentation and Depth
The Edge of Depth: Explicit Constraints between Segmentation and Depth
Shengjie Zhu
Garrick Brazil
Xiaoming Liu
MDE
50
105
0
01 Apr 2020
Distilled Semantics for Comprehensive Scene Understanding from Videos
Distilled Semantics for Comprehensive Scene Understanding from Videos
Fabio Tosi
Filippo Aleotti
Pierluigi Zama Ramirez
Matteo Poggi
Samuele Salti
Luigi Di Stefano
S. Mattoccia
MDE
36
77
0
31 Mar 2020
Self-supervised Monocular Trained Depth Estimation using Self-attention
  and Discrete Disparity Volume
Self-supervised Monocular Trained Depth Estimation using Self-attention and Discrete Disparity Volume
A. Johnston
G. Carneiro
MDE
52
234
0
31 Mar 2020
D3VO: Deep Depth, Deep Pose and Deep Uncertainty for Monocular Visual
  Odometry
D3VO: Deep Depth, Deep Pose and Deep Uncertainty for Monocular Visual Odometry
Nan Yang
Lukas von Stumberg
Rui Wang
Daniel Cremers
MDE
73
375
0
02 Mar 2020
Semantically-Guided Representation Learning for Self-Supervised
  Monocular Depth
Semantically-Guided Representation Learning for Self-Supervised Monocular Depth
Vitor Campagnolo Guizilini
Rui Hou
Jie Li
Rares Andrei Ambrus
Adrien Gaidon
SSL
MDE
86
229
0
27 Feb 2020
Unsupervised Scale-consistent Depth and Ego-motion Learning from
  Monocular Video
Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video
Jiawang Bian
Zhichao Li
Naiyan Wang
Huangying Zhan
Chunhua Shen
Ming-Ming Cheng
Ian Reid
MDE
70
509
0
28 Aug 2019
3D Packing for Self-Supervised Monocular Depth Estimation
3D Packing for Self-Supervised Monocular Depth Estimation
Vitor Campagnolo Guizilini
Rares Andrei Ambrus
Sudeep Pillai
Allan Raventos
Adrien Gaidon
SSL
3DPC
MDE
69
645
0
06 May 2019
Depth Prediction Without the Sensors: Leveraging Structure for
  Unsupervised Learning from Monocular Videos
Depth Prediction Without the Sensors: Leveraging Structure for Unsupervised Learning from Monocular Videos
Vincent Casser
Soren Pirk
R. Mahjourian
A. Angelova
SSL
MDE
77
470
0
15 Nov 2018
MetaAnchor: Learning to Detect Objects with Customized Anchors
MetaAnchor: Learning to Detect Objects with Customized Anchors
Tong Yang
Xiangyu Zhang
Zeming Li
Wenqiang Zhang
Jian Sun
ObjD
75
136
0
03 Jul 2018
Competitive Collaboration: Joint Unsupervised Learning of Depth, Camera
  Motion, Optical Flow and Motion Segmentation
Competitive Collaboration: Joint Unsupervised Learning of Depth, Camera Motion, Optical Flow and Motion Segmentation
Anurag Ranjan
Varun Jampani
Lukas Balles
Kihwan Kim
Deqing Sun
Jonas Wulff
Michael J. Black
SSL
56
591
0
24 May 2018
Unsupervised Learning of Depth and Ego-Motion from Video
Unsupervised Learning of Depth and Ego-Motion from Video
Tinghui Zhou
Matthew A. Brown
Noah Snavely
D. Lowe
MDE
114
2,571
0
25 Apr 2017
Deformable Convolutional Networks
Deformable Convolutional Networks
Jifeng Dai
Haozhi Qi
Yuwen Xiong
Yi Li
Guodong Zhang
Han Hu
Yichen Wei
196
5,320
0
17 Mar 2017
Predicting Depth, Surface Normals and Semantic Labels with a Common
  Multi-Scale Convolutional Architecture
Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture
David Eigen
Rob Fergus
VLM
MDE
181
2,678
0
18 Nov 2014
1