Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.07441
Cited By
HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation
10 July 2024
Guoan Xu
Wenjing Jia
Tao Wu
Ligeng Chen
Guangwei Gao
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation"
25 / 25 papers shown
Title
FacT: Factor-Tuning for Lightweight Adaptation on Vision Transformer
Shibo Jie
Zhi-Hong Deng
49
131
0
06 Dec 2022
HiFormer: Hierarchical Multi-scale Representations Using Transformers for Medical Image Segmentation
Moein Heidari
Amirhossein Kazerouni
Milad Soltany Kadarvish
Reza Azad
Ehsan Khodapanah Aghdam
Julien Cohen-Adad
Dorit Merhof
MedIm
ViT
82
190
0
18 Jul 2022
Dual Vision Transformer
Ting Yao
Yehao Li
Yingwei Pan
Yu Wang
Xiaoping Zhang
Tao Mei
ViT
193
79
0
11 Jul 2022
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation
Wenqiang Zhang
Zilong Huang
Guozhong Luo
Tao Chen
Xinggang Wang
Wenyu Liu
Gang Yu
Chunhua Shen
ViT
82
206
0
12 Apr 2022
Deep Hierarchical Semantic Segmentation
Liulei Li
Tianfei Zhou
Wenguan Wang
Jianwu Li
Yi Yang
SSeg
529
134
0
27 Mar 2022
MPViT: Multi-Path Vision Transformer for Dense Prediction
Youngwan Lee
Jonghee Kim
Jeffrey Willette
Sung Ju Hwang
ViT
66
250
0
21 Dec 2021
MViTv2: Improved Multiscale Vision Transformers for Classification and Detection
Yanghao Li
Chaoxia Wu
Haoqi Fan
K. Mangalam
Bo Xiong
Jitendra Malik
Christoph Feichtenhofer
ViT
144
689
0
02 Dec 2021
FBSNet: A Fast Bilateral Symmetrical Network for Real-Time Semantic Segmentation
Guangwei Gao
Guoan Xu
Juncheng Li
Yi Yu
Huimin Lu
Jian Yang
62
77
0
02 Sep 2021
PVT v2: Improved Baselines with Pyramid Vision Transformer
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
AI4TS
89
1,661
0
25 Jun 2021
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
Enze Xie
Wenhai Wang
Zhiding Yu
Anima Anandkumar
J. Álvarez
Ping Luo
ViT
252
5,017
0
31 May 2021
CvT: Introducing Convolutions to Vision Transformers
Haiping Wu
Bin Xiao
Noel Codella
Mengchen Liu
Xiyang Dai
Lu Yuan
Lei Zhang
ViT
136
1,907
0
29 Mar 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
421
21,347
0
25 Mar 2021
UNETR: Transformers for 3D Medical Image Segmentation
Ali Hatamizadeh
Yucheng Tang
Vishwesh Nath
Dong Yang
Andriy Myronenko
Bennett Landman
H. Roth
Daguang Xu
ViT
MedIm
164
1,603
0
18 Mar 2021
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
Sixiao Zheng
Jiachen Lu
Hengshuang Zhao
Xiatian Zhu
Zekun Luo
...
Yanwei Fu
Jianfeng Feng
Tao Xiang
Philip Torr
Li Zhang
ViT
188
2,897
0
31 Dec 2020
Training data-efficient image transformers & distillation through attention
Hugo Touvron
Matthieu Cord
Matthijs Douze
Francisco Massa
Alexandre Sablayrolles
Hervé Jégou
ViT
361
6,757
0
23 Dec 2020
LEDNet: A Lightweight Encoder-Decoder Network for Real-Time Semantic Segmentation
Yu Wang
Quan Zhou
Jia-Wei Liu
Jian Xiong
Guangwei Gao
Xiaofu Wu
Longin Jan Latecki
SSeg
67
314
0
07 May 2019
DFANet: Deep Feature Aggregation for Real-Time Semantic Segmentation
Hanchao Li
Pengfei Xiong
Haoqiang Fan
Jian Sun
SSeg
73
566
0
03 Apr 2019
CCNet: Criss-Cross Attention for Semantic Segmentation
Zilong Huang
Xinggang Wang
Yunchao Wei
Lichao Huang
Humphrey Shi
Wenyu Liu
Chang Huang
VOS
204
2,544
0
28 Nov 2018
CBAM: Convolutional Block Attention Module
Sanghyun Woo
Jongchan Park
Joon-Young Lee
In So Kweon
208
16,528
0
17 Jul 2018
Non-local Neural Networks
Xinyu Wang
Ross B. Girshick
Abhinav Gupta
Kaiming He
OffRL
277
8,902
0
21 Nov 2017
Pyramid Scene Parsing Network
Hengshuang Zhao
Jianping Shi
Xiaojuan Qi
Xiaogang Wang
Jiaya Jia
VOS
SSeg
638
12,003
0
04 Dec 2016
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation
Adam Paszke
Abhishek Chaurasia
Sangpil Kim
Eugenio Culurciello
SSeg
312
2,080
0
07 Jun 2016
The Cityscapes Dataset for Semantic Urban Scene Understanding
Marius Cordts
Mohamed Omran
Sebastian Ramos
Timo Rehfeld
Markus Enzweiler
Rodrigo Benenson
Uwe Franke
Stefan Roth
Bernt Schiele
1.1K
11,606
0
06 Apr 2016
Rethinking the Inception Architecture for Computer Vision
Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jonathon Shlens
Z. Wojna
3DV
BDL
843
27,303
0
02 Dec 2015
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Vijay Badrinarayanan
Alex Kendall
R. Cipolla
SSeg
1.0K
15,795
0
02 Nov 2015
1