ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.06278
  4. Cited By
Per-Pixel Classification is Not All You Need for Semantic Segmentation

Per-Pixel Classification is Not All You Need for Semantic Segmentation

13 July 2021
Bowen Cheng
Alex Schwing
Alexander Kirillov
    VLM
    ViT
ArXivPDFHTML

Papers citing "Per-Pixel Classification is Not All You Need for Semantic Segmentation"

29 / 329 papers shown
Title
Watch It Move: Unsupervised Discovery of 3D Joints for Re-Posing of
  Articulated Objects
Watch It Move: Unsupervised Discovery of 3D Joints for Re-Posing of Articulated Objects
Atsuhiro Noguchi
Umar Iqbal
Jonathan Tremblay
Tatsuya Harada
Orazio Gallo
49
47
0
21 Dec 2021
Lite Vision Transformer with Enhanced Self-Attention
Lite Vision Transformer with Enhanced Self-Attention
Chenglin Yang
Yilin Wang
Jianming Zhang
He Zhang
Zijun Wei
Zhe Lin
Alan Yuille
ViT
21
114
0
20 Dec 2021
Slot-VPS: Object-centric Representation Learning for Video Panoptic
  Segmentation
Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation
Yi Zhou
Hui Zhang
Hana Lee
Shuyang Sun
Pingjun Li
Yangguang Zhu
ByungIn Yoo
Xiaojuan Qi
Jae-Joon Han
VOS
43
26
0
16 Dec 2021
Decoupling Zero-Shot Semantic Segmentation
Decoupling Zero-Shot Semantic Segmentation
Jian Ding
Nan Xue
Guisong Xia
Dengxin Dai
VLM
56
190
0
15 Dec 2021
5th Place Solution for VSPW 2021 Challenge
5th Place Solution for VSPW 2021 Challenge
Jiafan Zhuang
Y. Zhang
Xinyu Hu
Jianing Li
Zilei Wang
15
0
0
13 Dec 2021
MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection
MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection
Rui Dai
Srijan Das
Kumara Kahatapitiya
Michael S. Ryoo
Francois Bremond
ViT
42
73
0
07 Dec 2021
PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic
  Segmentation
PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation
Haobo Yuan
Xiangtai Li
Yibo Yang
Guangliang Cheng
Jing Zhang
Yunhai Tong
Lefei Zhang
Dacheng Tao
MDE
49
42
0
05 Dec 2021
Masked-attention Mask Transformer for Universal Image Segmentation
Masked-attention Mask Transformer for Universal Image Segmentation
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
144
2,281
0
02 Dec 2021
Adaptive Token Sampling For Efficient Vision Transformers
Adaptive Token Sampling For Efficient Vision Transformers
Mohsen Fayyaz
Soroush Abbasi Koohpayegani
F. Jafari
Sunando Sengupta
Hamid Reza Vaezi Joze
Eric Sommerlade
Hamed Pirsiavash
Juergen Gall
ViT
16
148
0
30 Nov 2021
Self-Supervised Pre-Training of Swin Transformers for 3D Medical Image
  Analysis
Self-Supervised Pre-Training of Swin Transformers for 3D Medical Image Analysis
Yucheng Tang
Dong Yang
Wenqi Li
H. Roth
Bennett Landman
Daguang Xu
V. Nath
Ali Hatamizadeh
ViT
MedIm
42
520
0
29 Nov 2021
High Quality Segmentation for Ultra High-resolution Images
High Quality Segmentation for Ultra High-resolution Images
Tiancheng Shen
Yuechen Zhang
Lu Qi
Jason Kuen
Xingyu Xie
Jianlong Wu
Zhe Lin
Jiaya Jia
81
43
0
29 Nov 2021
Pruning Self-attentions into Convolutional Layers in Single Path
Pruning Self-attentions into Convolutional Layers in Single Path
Haoyu He
Jianfei Cai
Jing Liu
Zizheng Pan
Jing Zhang
Dacheng Tao
Bohan Zhuang
ViT
34
40
0
23 Nov 2021
FBNetV5: Neural Architecture Search for Multiple Tasks in One Run
FBNetV5: Neural Architecture Search for Multiple Tasks in One Run
Bichen Wu
Chaojian Li
Hang Zhang
Xiaoliang Dai
Peizhao Zhang
Matthew Yu
Jialiang Wang
Yingyan Lin
Peter Vajda
ViT
33
23
0
19 Nov 2021
Swin Transformer V2: Scaling Up Capacity and Resolution
Swin Transformer V2: Scaling Up Capacity and Resolution
Ze Liu
Han Hu
Yutong Lin
Zhuliang Yao
Zhenda Xie
...
Yue Cao
Zheng-Wei Zhang
Li Dong
Furu Wei
B. Guo
ViT
94
1,761
0
18 Nov 2021
Multimodal Virtual Point 3D Detection
Multimodal Virtual Point 3D Detection
Tianwei Yin
Xingyi Zhou
Philipp Krahenbuhl
3DPC
160
245
0
12 Nov 2021
A Survey of Visual Transformers
A Survey of Visual Transformers
Yang Liu
Yao Zhang
Yixin Wang
Feng Hou
Jin Yuan
Jiang Tian
Yang Zhang
Zhongchao Shi
Jianping Fan
Zhiqiang He
3DGS
ViT
79
332
0
11 Nov 2021
Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation
Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation
Jiaqi Gu
Hyoukjun Kwon
Dilin Wang
Wei Ye
Meng Li
Yu-Hsin Chen
Liangzhen Lai
Vikas Chandra
David Z. Pan
ViT
29
184
0
01 Nov 2021
Weak Novel Categories without Tears: A Survey on Weak-Shot Learning
Weak Novel Categories without Tears: A Survey on Weak-Shot Learning
Li Niu
VLM
OffRL
21
2
0
06 Oct 2021
Multi-Task Self-Training for Learning General Representations
Multi-Task Self-Training for Learning General Representations
Golnaz Ghiasi
Barret Zoph
E. D. Cubuk
Quoc V. Le
Nayeon Lee
SSL
24
100
0
25 Aug 2021
Trans4Trans: Efficient Transformer for Transparent Object and Semantic
  Scene Segmentation in Real-World Navigation Assistance
Trans4Trans: Efficient Transformer for Transparent Object and Semantic Scene Segmentation in Real-World Navigation Assistance
Jiaming Zhang
Kailun Yang
Angela Constantinescu
Kunyu Peng
Karin Muller
Rainer Stiefelhagen
ViT
46
69
0
20 Aug 2021
FaPN: Feature-aligned Pyramid Network for Dense Image Prediction
FaPN: Feature-aligned Pyramid Network for Dense Image Prediction
Shihua Huang
Zhichao Lu
Ran Cheng
Cheng He
15
203
0
16 Aug 2021
Open-World Entity Segmentation
Open-World Entity Segmentation
Lu Qi
Jason Kuen
Yi Wang
Jiuxiang Gu
Hengshuang Zhao
Zhe Lin
Philip Torr
Jiaya Jia
OCL
SSeg
VLM
42
80
0
29 Jul 2021
A Unified Efficient Pyramid Transformer for Semantic Segmentation
A Unified Efficient Pyramid Transformer for Semantic Segmentation
Fangrui Zhu
Yi Zhu
Li Zhang
Chongruo Wu
Yanwei Fu
Mu Li
ViT
29
29
0
29 Jul 2021
Global Filter Networks for Image Classification
Global Filter Networks for Image Classification
Yongming Rao
Wenliang Zhao
Zheng Zhu
Jiwen Lu
Jie Zhou
ViT
28
452
0
01 Jul 2021
K-Net: Towards Unified Image Segmentation
K-Net: Towards Unified Image Segmentation
Wenwei Zhang
Jiangmiao Pang
Kai-xiang Chen
Chen Change Loy
ISeg
32
358
0
28 Jun 2021
DynamicViT: Efficient Vision Transformers with Dynamic Token
  Sparsification
DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
Yongming Rao
Wenliang Zhao
Benlin Liu
Jiwen Lu
Jie Zhou
Cho-Jui Hsieh
ViT
34
670
0
03 Jun 2021
GANav: Efficient Terrain Segmentation for Robot Navigation in
  Unstructured Outdoor Environments
GANav: Efficient Terrain Segmentation for Robot Navigation in Unstructured Outdoor Environments
Tianrui Guan
D. Kothandaraman
Rohan Chandra
A. Sathyamoorthy
K. Weerakoon
Tianyi Zhou
36
106
0
07 Mar 2021
Conditional Convolutions for Instance Segmentation
Conditional Convolutions for Instance Segmentation
Zhi Tian
Chunhua Shen
Hao Chen
ISeg
196
599
0
12 Mar 2020
A Survey on Deep Learning-based Architectures for Semantic Segmentation
  on 2D images
A Survey on Deep Learning-based Architectures for Semantic Segmentation on 2D images
Irem Ülkü
Erdem Akagündüz
SSeg
42
189
0
21 Dec 2019
Previous
1234567