Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.12091
Cited By
Transformer-Based Attention Networks for Continuous Pixel-Wise Prediction
22 March 2021
Guanglei Yang
Hao Tang
M. Ding
N. Sebe
Elisa Ricci
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Transformer-Based Attention Networks for Continuous Pixel-Wise Prediction"
38 / 38 papers shown
Title
VGLD: Visually-Guided Linguistic Disambiguation for Monocular Depth Scale Recovery
Bojin Wu
Jing Chen
MDE
46
0
0
05 May 2025
VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation
Mingxia Zhan
Li Zhang
Xiaomeng Chu
Beibei Wang
MDE
59
0
0
21 Apr 2025
Revisiting Gradient-based Uncertainty for Monocular Depth Estimation
Julia Hornauer
Amir El-Ghoussani
Vasileios Belagiannis
UQCV
55
0
0
09 Feb 2025
FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation
Yunpeng Bai
Qixing Huang
DiffM
91
0
0
01 Dec 2024
DCDepth: Progressive Monocular Depth Estimation in Discrete Cosine Domain
Kun Wang
Zhiqiang Yan
Junkai Fan
Wanlu Zhu
X. Li
Jun Li
Jian Yang
MDE
31
5
0
19 Oct 2024
EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation
Nischal Khanal
Shivanand Venkanna Sheshappanavar
MDE
42
0
0
10 Sep 2024
Towards Scale-Aware Full Surround Monodepth with Transformers
Yuchen Yang
Xinyi Wang
Dong Li
Lu Tian
Ashish Sirasao
Xun Yang
MDE
ViT
29
2
0
15 Jul 2024
SS-SfP:Neural Inverse Rendering for Self Supervised Shape from (Mixed) Polarization
Ashish Tiwari
S. Raman
MDE
21
1
0
12 Jul 2024
DCPI-Depth: Explicitly Infusing Dense Correspondence Prior to Unsupervised Monocular Depth Estimation
Mengtan Zhang
Yi Feng
Qijun Chen
Rui Fan
MDE
43
5
0
27 May 2024
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation
Mu Hu
Wei Yin
C. Zhang
Zhipeng Cai
Xiaoxiao Long
Kaixuan Wang
Kaixuan Wang
Gang Yu
Chunhua Shen
Shaojie Shen
3DGS
54
115
0
22 Mar 2024
FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators
Haiping Wang
Yuan-Bin Liu
Bing Wang
Yujing Sun
Zhenchao Dong
Wenping Wang
Bisheng Yang
DiffM
32
10
0
05 Oct 2023
ADU-Depth: Attention-based Distillation with Uncertainty Modeling for Depth Estimation
Zizhang Wu
Zhuozheng Li
Zhi-Gang Fan
Yunzhe Wu
Xiaoquan Wang
Rui Tang
Jian Pu
23
1
0
26 Sep 2023
The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation
Lingdong Kong
Yaru Niu
Shaoyuan Xie
Hanjiang Hu
Lai Xing Ng
...
Zhenyu Li
Runze Chen
Haiyong Luo
Fang Zhao
Jing Yu
28
13
0
27 Jul 2023
DDP: Diffusion Model for Dense Visual Prediction
Yuanfeng Ji
Zhe Chen
Enze Xie
Lanqing Hong
Xihui Liu
Zhaoqiang Liu
Tong Lu
Zhenguo Li
Ping Luo
DiffM
VLM
47
130
0
30 Mar 2023
Graph Transformer GANs for Graph-Constrained House Generation
H. Tang
Zhenyu Zhang
Humphrey Shi
Bo-wen Li
Lin Shao
N. Sebe
Radu Timofte
Luc Van Gool
43
19
0
14 Mar 2023
A Simple Baseline for Supervised Surround-view Depth Estimation
Xianda Guo
Wenjie Yuan
Yunpeng Zhang
Tian Yang
Chenming Zhang
Zhengbiao Zhu
Long Chen
MDE
36
3
0
14 Mar 2023
DwinFormer: Dual Window Transformers for End-to-End Monocular Depth Estimation
Md Awsafur Rahman
S. Fattah
ViT
MDE
30
4
0
06 Mar 2023
Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training
Zhenglun Kong
Haoyu Ma
Geng Yuan
Mengshu Sun
Yanyue Xie
...
Tianlong Chen
Xiaolong Ma
Xiaohui Xie
Zhangyang Wang
Yanzhi Wang
ViT
28
22
0
19 Nov 2022
Rethinking Skip Connections in Encoder-decoder Networks for Monocular Depth Estimation
Zhitong Lai
Haichao Sun
Rui Tian
Nannan Ding
Zhiguo Wu
Yanjie Wang
MDE
24
3
0
29 Aug 2022
MonoViT: Self-Supervised Monocular Depth Estimation with a Vision Transformer
Chaoqiang Zhao
Youming Zhang
Matteo Poggi
Fabio Tosi
Xianda Guo
Zheng Zhu
Guan Huang
Yang Tang
S. Mattoccia
ViT
MDE
31
174
0
06 Aug 2022
ViGAT: Bottom-up event recognition and explanation in video using factorized graph attention network
Nikolaos Gkalelis
Dimitrios Daskalakis
Vasileios Mezaris
10
10
0
20 Jul 2022
Interaction Transformer for Human Reaction Generation
Baptiste Chopin
Hao Tang
N. Otberdout
Mohamed Daoudi
N. Sebe
ViT
25
27
0
04 Jul 2022
Self-Supervised Pre-training of Vision Transformers for Dense Prediction Tasks
Jaonary Rabarisoa
Velentin Belissen
Florian Chabot
Q. C. Pham
VLM
ViT
SSL
MDE
15
2
0
30 May 2022
Deep Digging into the Generalization of Self-Supervised Monocular Depth Estimation
Ji-Hoon Bae
Sungho Moon
Sunghoon Im
MDE
20
84
0
23 May 2022
SideRT: A Real-time Pure Transformer Architecture for Single Image Depth Estimation
Chang Shu
Zi-Chun Chen
Lei Chen
Kuan Ma
Minghui Wang
Haibing Ren
ViT
19
14
0
29 Apr 2022
Recent Advances in Vision Transformer: A Survey and Outlook of Recent Work
Khawar Islam
ViT
28
45
0
03 Mar 2022
Transformers in Self-Supervised Monocular Depth Estimation with Unknown Camera Intrinsics
Arnav Varma
Hemang Chawla
Bahram Zonooz
Elahe Arani
ViT
MDE
31
49
0
07 Feb 2022
Shape from Polarization for Complex Scenes in the Wild
Chenyang Lei
Chenyang Qi
Jiaxin Xie
Na Fan
V. Koltun
Qifeng Chen
32
60
0
21 Dec 2021
Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking
Yidi Li
Hong Liu
Hao Tang
10
20
0
14 Dec 2021
Object-aware Monocular Depth Prediction with Instance Convolutions
Enis Simsar
Evin Pınar Örnek
Fabian Manhardt
Helisa Dhamo
Nassir Navab
F. Tombari
3DH
MDE
23
1
0
02 Dec 2021
SWAT: Spatial Structure Within and Among Tokens
Kumara Kahatapitiya
Michael S. Ryoo
25
6
0
26 Nov 2021
MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation
Wenhao Li
Hong Liu
H. Tang
Pichao Wang
Luc Van Gool
ViT
27
246
0
24 Nov 2021
Global and Local Alignment Networks for Unpaired Image-to-Image Translation
Guanglei Yang
H. Tang
Humphrey Shi
M. Ding
N. Sebe
Radu Timofte
Luc Van Gool
Elisa Ricci
13
1
0
19 Nov 2021
Depth360: Self-supervised Learning for Monocular Depth Estimation using Learnable Camera Distortion Model
Noriaki Hirose
Kosuke Tahara
MDE
24
4
0
20 Oct 2021
Self-supervised Depth Estimation Leveraging Global Perception and Geometric Smoothness Using On-board Videos
Shaocheng Jia
Xin Pei
W. Yao
S. Wong
3DPC
MDE
35
19
0
07 Jun 2021
Probabilistic Graph Attention Network with Conditional Kernels for Pixel-Wise Prediction
Dan Xu
Xavier Alameda-Pineda
Wanli Ouyang
Elisa Ricci
Xiaogang Wang
N. Sebe
28
23
0
08 Jan 2021
Deep Ordinal Regression Network for Monocular Depth Estimation
Huan Fu
Mingming Gong
Chaohui Wang
Kayhan Batmanghelich
Dacheng Tao
MDE
185
1,707
0
06 Jun 2018
The Application of Two-level Attention Models in Deep Convolutional Neural Network for Fine-grained Image Classification
Tianjun Xiao
Yichong Xu
Kuiyuan Yang
Jiaxing Zhang
Yuxin Peng
Zheng-Wei Zhang
156
789
0
24 Nov 2014
1