Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.00782
Cited By
Polarized Self-Attention: Towards High-quality Pixel-wise Regression
2 July 2021
Huajun Liu
Fuqiang Liu
Xinyi Fan
Dong Huang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Polarized Self-Attention: Towards High-quality Pixel-wise Regression"
39 / 39 papers shown
Title
From Broadcast to Minimap: Achieving State-of-the-Art SoccerNet Game State Reconstruction
V. Golovkin
Nikolay Nemtsev
Vasyl Shandyba
Oleg Udin
Nikita Kasatkin
Pavel Kononov
Anton Afanasiev
Sergey Ulasen
Andrei Boiarov
31
0
0
08 Apr 2025
Improving Accuracy and Generalization for Efficient Visual Tracking
Ram J. Zaveri
Shivang Patel
Yu Gu
Gianfranco Doretto
VLM
91
0
0
28 Nov 2024
UNSCT-HRNet: Modeling Anatomical Uncertainty for Landmark Detection in Total Hip Arthroplasty
Jiaxin Wan
Lin Liu
Haoran Wang
Liangwei Li
Wei Li
...
Xiangbo Shu
Qingbin Liu
Jing Zhang
Xiaohui Du
Ruqian Hao
26
0
0
13 Nov 2024
A Sinkhorn Regularized Adversarial Network for Image Guided DEM Super-resolution using Frequency Selective Hybrid Graph Transformer
Subhajit Paul
Ashutosh Gupta
25
0
0
21 Sep 2024
RTMW: Real-Time Multi-Person 2D and 3D Whole-body Pose Estimation
Tao Jiang
Xinchen Xie
Yining Li
3DH
46
2
0
11 Jul 2024
Hi5: 2D Hand Pose Estimation with Zero Human Annotation
Masum Hasan
Cengiz Ozel
Nina Long
Alexander Martin
Samuel Potter
Tariq Adnan
Sangwu Lee
Amir Zadeh
Ehsan Hoque
DiffM
3DH
37
0
0
05 Jun 2024
SC-HVPPNet: Spatial and Channel Hybrid-Attention Video Post-Processing Network with CNN and Transformer
Tong Zhang
Wenxue Cui
Shao-Bin Liu
Feng Jiang
33
1
0
23 Apr 2024
Human Orientation Estimation under Partial Observation
Jieting Zhao
Hanjing Ye
Yu Zhan
Hong Zhang
41
0
0
22 Apr 2024
Text in the Dark: Extremely Low-Light Text Image Enhancement
Che-Tsung Lin
Chun Chet Ng
Zhi Qin Tan
Wan Jun Nah
Xinyu Wang
Jie-Long Kew
Po-Hao Hsu
Shang-Hong Lai
Chee Seng Chan
Christopher Zach
32
0
0
22 Apr 2024
MarsSeg: Mars Surface Semantic Segmentation with Multi-level Extractor and Connector
Junbo Li
Keyan Chen
Gengju Tian
Lu Li
Z. Shi
52
1
0
05 Apr 2024
Deep Common Feature Mining for Efficient Video Semantic Segmentation
Yaoyan Zheng
Hongyu Yang
Di Huang
54
0
0
05 Mar 2024
PMFSNet: Polarized Multi-scale Feature Self-attention Network For Lightweight Medical Image Segmentation
Jiahui Zhong
Wenhong Tian
Yuanlun Xie
Zhijia Liu
Jie Ou
Taoran Tian
Lei Zhang
23
6
0
15 Jan 2024
Harnessing Diffusion Models for Visual Perception with Meta Prompts
Qiang Wan
Zilong Huang
Bingyi Kang
Jiashi Feng
Li Zhang
MDE
VLM
19
15
0
22 Dec 2023
CLiSA: A Hierarchical Hybrid Transformer Model using Orthogonal Cross Attention for Satellite Image Cloud Segmentation
Subhajit Paul
Ashutosh Gupta
27
2
0
29 Nov 2023
SegmATRon: Embodied Adaptive Semantic Segmentation for Indoor Environment
T. Zemskova
Margarita Kichik
Dmitry A. Yudin
A. Staroverov
Aleksandr I. Panov
29
1
0
18 Oct 2023
ILNet: Low-level Matters for Salient Infrared Small Target Detection
Haoqing Li
Jinfu Yang
Runshi Wang
Yifei Xu
19
6
0
24 Sep 2023
Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text Image Super-Resolution
Wenyu Zhang
Xin Deng
Baojun Jia
Xingtong Yu
Yifan Chen
Jin Ma
Qing Ding
Xinming Zhang
38
11
0
16 Sep 2023
Improving 2D Human Pose Estimation in Rare Camera Views with Synthetic Data
Miroslav Purkrábek
Jivrí Matas
22
2
0
13 Jul 2023
Efficient Multi-Scale Attention Module with Cross-Spatial Learning
Daliang Ouyang
Su He
Jian Zhan
M.L. Luo
Huaiyong Guo
Guo-Liang Zhang
Zhijie Huang
35
504
0
23 May 2023
Fusion-S2iGan: An Efficient and Effective Single-Stage Framework for Speech-to-Image Generation
Zhenxing Zhang
Lambert Schomaker
19
2
0
17 May 2023
Diabetic Foot Ulcer Grand Challenge 2022 Summary
Connah Kendrick
B. Cassidy
N. Reeves
Joseph M Pappachan
C. O'Shea
Vishnu Chandrabalan
Moi Hoon Yap
17
4
0
24 Apr 2023
DINOv2: Learning Robust Visual Features without Supervision
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
...
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
VLM
CLIP
SSL
154
3,070
0
14 Apr 2023
PP-MobileSeg: Explore the Fast and Accurate Semantic Segmentation Model on Mobile Devices
Shiyu Tang
Ting Sun
Juncai Peng
Guowei Chen
Yuying Hao
Manhui Lin
Z. Xiao
Jiangbin You
Yi Liu
ViT
24
14
0
11 Apr 2023
RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose
Tao Jiang
Peng Lu
Li Zhang
Ning Ma
Rui Han
Chengqi Lyu
Yining Li
Kai-xiang Chen
3DH
47
158
0
13 Mar 2023
Mask Matching Transformer for Few-Shot Segmentation
Siyu Jiao
Gengwei Zhang
Shant Navasardyan
Ling-Hao Chen
Yao-Min Zhao
Yunchao Wei
Humphrey Shi
42
28
0
05 Dec 2022
Progressively Dual Prior Guided Few-shot Semantic Segmentation
Qinglong Cao
Yuntian Chen
Xiwen Yao
Junwei Han
19
0
0
20 Nov 2022
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Wenhai Wang
Jifeng Dai
Zhe Chen
Zhenhang Huang
Zhiqi Li
...
Tong Lu
Lewei Lu
Hongsheng Li
Xiaogang Wang
Yu Qiao
VLM
47
660
0
10 Nov 2022
OneFormer: One Transformer to Rule Universal Image Segmentation
Jitesh Jain
Jiacheng Li
M. Chiu
Ali Hassani
Nikita Orlov
Humphrey Shi
ViT
31
330
0
10 Nov 2022
One-Shot Medical Landmark Localization by Edge-Guided Transform and Noisy Landmark Refinement
Zihao Yin
Ping Gong
Chun-yu Wang
Yizhou Yu
Yizhou Wang
29
12
0
31 Jul 2022
CTooth: A Fully Annotated 3D Dataset and Benchmark for Tooth Volume Segmentation on Cone Beam Computed Tomography Images
Weiwei Cui
Yaqi Wang
Qianni Zhang
Huiyu Zhou
Dansheg Song
Xingyong Zuo
Gangyong Jia
Liaoyuan Zeng
3DPC
32
14
0
17 Jun 2022
SHOP: A Deep Learning Based Pipeline for near Real-Time Detection of Small Handheld Objects Present in Blurry Video
Abhinav Ganguly
Amar C Gandhi
Sylvia E
Jeffrey D Chang
Ian M Hudson
ObjD
26
0
0
29 Mar 2022
Occluded Human Mesh Recovery
Rawal Khirodkar
Shashank Tripathi
Kris Kitani
3DH
35
69
0
24 Mar 2022
Beyond Fixation: Dynamic Window Visual Transformer
Pengzhen Ren
Changlin Li
Guangrun Wang
Yun Xiao
Qing Du
Xiaodan Liang
Qing Du Xiaodan Liang Xiaojun Chang
ViT
36
32
0
24 Mar 2022
HDNet: High-resolution Dual-domain Learning for Spectral Compressive Imaging
Xiaowan Hu
Yuanhao Cai
Jing Lin
Haoqian Wang
X. Yuan
Yulun Zhang
Radu Timofte
Luc Van Gool
37
134
0
04 Mar 2022
SeMask: Semantically Masked Transformers for Semantic Segmentation
Jitesh Jain
Anukriti Singh
Nikita Orlov
Zilong Huang
Jiachen Li
Steven Walton
Humphrey Shi
ViT
29
93
0
23 Dec 2021
Rethinking the Heatmap Regression for Bottom-up Human Pose Estimation
Zhengxiong Luo
Zhicheng Wang
Yan Huang
Tieniu Tan
Erjin Zhou
124
141
0
30 Dec 2020
Deep Learning-Based Human Pose Estimation: A Survey
Ce Zheng
Wenhan Wu
Chong Chen
Taojiannan Yang
Sijie Zhu
Ju Shen
N. Kehtarnavaz
M. Shah
3DH
105
566
0
24 Dec 2020
A Survey on Deep Learning in Medical Image Analysis
G. Litjens
Thijs Kooi
B. Bejnordi
A. Setio
F. Ciompi
Mohsen Ghafoorian
Jeroen van der Laak
Bram van Ginneken
C. I. Sánchez
OOD
343
10,633
0
19 Feb 2017
Wider or Deeper: Revisiting the ResNet Model for Visual Recognition
Zifeng Wu
Chunhua Shen
Anton Van Den Hengel
SSeg
260
1,494
0
30 Nov 2016
1