ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.06278
  4. Cited By
Per-Pixel Classification is Not All You Need for Semantic Segmentation

Per-Pixel Classification is Not All You Need for Semantic Segmentation

13 July 2021
Bowen Cheng
Alex Schwing
Alexander Kirillov
    VLM
    ViT
ArXivPDFHTML

Papers citing "Per-Pixel Classification is Not All You Need for Semantic Segmentation"

50 / 329 papers shown
Title
Shape-Guided Diffusion with Inside-Outside Attention
Shape-Guided Diffusion with Inside-Outside Attention
Dong Huk Park
Grace Luo
C. Toste
S. Azadi
Xihui Liu
M. Karalashvili
Anna Rohrbach
Trevor Darrell
DiffM
40
44
0
01 Dec 2022
NOPE-SAC: Neural One-Plane RANSAC for Sparse-View Planar 3D
  Reconstruction
NOPE-SAC: Neural One-Plane RANSAC for Sparse-View Planar 3D Reconstruction
Bin Tan
Nan Xue
Tianfu Wu
Guisong Xia
35
15
0
30 Nov 2022
Superpoint Transformer for 3D Scene Instance Segmentation
Superpoint Transformer for 3D Scene Instance Segmentation
Jiahao Sun
Chunmei Qing
Junpeng Tan
Xiangmin Xu
3DPC
42
105
0
28 Nov 2022
Connecting the Dots: Floorplan Reconstruction Using Two-Level Queries
Connecting the Dots: Floorplan Reconstruction Using Two-Level Queries
Yuanwen Yue
Theodora Kontogianni
Konrad Schindler
Francis Engelmann
3DV
27
36
0
28 Nov 2022
FsaNet: Frequency Self-attention for Semantic Segmentation
FsaNet: Frequency Self-attention for Semantic Segmentation
Fengyu Zhang
Ashkan Panahi
Guangjun Gao
AI4TS
34
28
0
28 Nov 2022
SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary
  Semantic Segmentation
SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation
Huaishao Luo
Junwei Bao
Youzheng Wu
Xiaodong He
Tianrui Li
VLM
32
146
0
27 Nov 2022
Prototype as Query for Few Shot Semantic Segmentation
Prototype as Query for Few Shot Semantic Segmentation
Leilei Cao
Yibo Guo
Ye Yuan
Qiangguo Jin
ViT
40
11
0
27 Nov 2022
Rethinking Alignment and Uniformity in Unsupervised Image Semantic
  Segmentation
Rethinking Alignment and Uniformity in Unsupervised Image Semantic Segmentation
Daoan Zhang
Chenming Li
Haoquan Li
Wen-Fong Huang
Lingyun Huang
Jianguo Zhang
33
20
0
26 Nov 2022
Mean Shift Mask Transformer for Unseen Object Instance Segmentation
Mean Shift Mask Transformer for Unseen Object Instance Segmentation
Ya Lu
Yuqiao Chen
Nicholas Ruozzi
Yu Xiang
28
23
0
21 Nov 2022
Visual Programming: Compositional visual reasoning without training
Visual Programming: Compositional visual reasoning without training
Tanmay Gupta
Aniruddha Kembhavi
ReLM
VLM
LRM
94
406
0
18 Nov 2022
Delving into Transformer for Incremental Semantic Segmentation
Delving into Transformer for Incremental Semantic Segmentation
Zekai Xu
Mingying Zhang
Jiayue Hou
Xing Gong
Chuan Wen
Chengjie Wang
Junge Zhang
CLL
32
1
0
18 Nov 2022
UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video
  UniFormer
UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer
Kunchang Li
Yali Wang
Yinan He
Yizhuo Li
Yi Wang
Limin Wang
Yu Qiao
ViT
30
107
0
17 Nov 2022
3D-QueryIS: A Query-based Framework for 3D Instance Segmentation
3D-QueryIS: A Query-based Framework for 3D Instance Segmentation
Jiaheng Liu
Tong He
Honghui Yang
Rui Su
Jiayi Tian
Junran Wu
Hongcheng Guo
Ke Xu
Wanli Ouyang
ISeg
31
14
0
17 Nov 2022
Prompt Tuning for Parameter-efficient Medical Image Segmentation
Prompt Tuning for Parameter-efficient Medical Image Segmentation
Marc Fischer
Alexander Bartler
Bin Yang
SSeg
24
18
0
16 Nov 2022
Robust Online Video Instance Segmentation with Track Queries
Robust Online Video Instance Segmentation with Track Queries
Zitong Zhan
Daniel McKee
Svetlana Lazebnik
31
9
0
16 Nov 2022
Mining Unseen Classes via Regional Objectness: A Simple Baseline for
  Incremental Segmentation
Mining Unseen Classes via Regional Objectness: A Simple Baseline for Incremental Segmentation
Zekang Zhang
Guangyu Gao
Zhiyuan Fang
Jianbo Jiao
Yunchao Wei
CLL
31
31
0
13 Nov 2022
Enhancing Few-shot Image Classification with Cosine Transformer
Enhancing Few-shot Image Classification with Cosine Transformer
Quang-Huy Nguyen
Cuong Q. Nguyen
Dung D. Le
Hieu H. Pham
ViT
31
12
0
13 Nov 2022
OneFormer: One Transformer to Rule Universal Image Segmentation
OneFormer: One Transformer to Rule Universal Image Segmentation
Jitesh Jain
Jiacheng Li
M. Chiu
Ali Hassani
Nikita Orlov
Humphrey Shi
ViT
31
330
0
10 Nov 2022
Efficient Unsupervised Video Object Segmentation Network Based on Motion
  Guidance
Efficient Unsupervised Video Object Segmentation Network Based on Motion Guidance
Chao Hu
Liqiang Zhu
VOS
24
2
0
10 Nov 2022
Dynamic loss balancing and sequential enhancement for road-safety
  assessment and traffic scene classification
Dynamic loss balancing and sequential enhancement for road-safety assessment and traffic scene classification
Marin Kavcan
Marko Sevrovic
Sinivsa vSegvić
29
1
0
08 Nov 2022
Large Scale Radio Frequency Wideband Signal Detection & Recognition
Large Scale Radio Frequency Wideband Signal Detection & Recognition
Luke Boegner
Garrett M. Vanhoy
Phillip Vallance
Manbir Gulati
Dresden Feitzinger
B. Comar
Rob Miller
AI4TS
18
6
0
04 Nov 2022
Understanding and Mitigating Overfitting in Prompt Tuning for
  Vision-Language Models
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models
Cheng Ma
Yang Liu
Jiankang Deng
Lingxi Xie
Weiming Dong
Changsheng Xu
VLM
VPVLM
45
44
0
04 Nov 2022
Pointly-Supervised Panoptic Segmentation
Pointly-Supervised Panoptic Segmentation
Junsong Fan
Zhaoxiang Zhang
Tieniu Tan
37
23
0
25 Oct 2022
Token-Label Alignment for Vision Transformers
Token-Label Alignment for Vision Transformers
Han Xiao
Wenzhao Zheng
Zhengbiao Zhu
Jie Zhou
Jiwen Lu
26
4
0
12 Oct 2022
A Generalist Framework for Panoptic Segmentation of Images and Videos
A Generalist Framework for Panoptic Segmentation of Images and Videos
Ting-Li Chen
Lala Li
Saurabh Saxena
Geoffrey E. Hinton
David J. Fleet
VGen
MLLM
43
102
0
12 Oct 2022
SaiT: Sparse Vision Transformers through Adaptive Token Pruning
SaiT: Sparse Vision Transformers through Adaptive Token Pruning
Ling Li
D. Thorsley
Joseph Hassoun
ViT
27
17
0
11 Oct 2022
Point Transformer V2: Grouped Vector Attention and Partition-based
  Pooling
Point Transformer V2: Grouped Vector Attention and Partition-based Pooling
Xiaoyang Wu
Yixing Lao
Li Jiang
Xihui Liu
Hengshuang Zhao
3DPC
ViT
32
369
0
11 Oct 2022
BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised
  Instance Segmentation
BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance Segmentation
Tianheng Cheng
Xinggang Wang
Shaoyu Chen
Qian Zhang
Wenyu Liu
ISeg
40
42
0
11 Oct 2022
Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP
Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP
Feng Liang
Bichen Wu
Xiaoliang Dai
Kunpeng Li
Yinan Zhao
Hang Zhang
Peizhao Zhang
Peter Vajda
Diana Marculescu
CLIP
VLM
44
434
0
09 Oct 2022
GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models
GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models
Chen Liang
Wenguan Wang
Jiaxu Miao
Yi Yang
VLM
46
117
0
05 Oct 2022
MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision
  Models
MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models
Chenglin Yang
Siyuan Qiao
Qihang Yu
Xiaoding Yuan
Yukun Zhu
Alan Yuille
Hartwig Adam
Liang-Chieh Chen
ViT
MoE
44
60
0
04 Oct 2022
Learning Equivariant Segmentation with Instance-Unique Querying
Learning Equivariant Segmentation with Instance-Unique Querying
Wenguan Wang
James Liang
Dongfang Liu
ISeg
43
48
0
03 Oct 2022
Effective Adaptation in Multi-Task Co-Training for Unified Autonomous
  Driving
Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving
Xiwen Liang
Yangxin Wu
Jianhua Han
Hang Xu
Chunjing Xu
Xiaodan Liang
30
31
0
19 Sep 2022
PointScatter: Point Set Representation for Tubular Structure Extraction
PointScatter: Point Set Representation for Tubular Structure Extraction
Dong Wang
Zhao Zhang
Zi-Long Zhao
Yuhang Liu
Yihong Chen
Liwei Wang
3DPC
47
10
0
13 Sep 2022
Articulated 3D Human-Object Interactions from RGB Videos: An Empirical
  Analysis of Approaches and Challenges
Articulated 3D Human-Object Interactions from RGB Videos: An Empirical Analysis of Approaches and Challenges
Sanjay Haresh
Xiaohao Sun
Hanxiao Jiang
Angel X. Chang
Manolis Savva
48
11
0
12 Sep 2022
Detecting Network-based Internet Censorship via Latent Feature
  Representation Learning
Detecting Network-based Internet Censorship via Latent Feature Representation Learning
Shawn P. Duncan
Hui Chen
38
1
0
12 Sep 2022
SUNet: Scale-aware Unified Network for Panoptic Segmentation
SUNet: Scale-aware Unified Network for Panoptic Segmentation
Wei Yan
Yeqiang Qian
Chunxiang Wang
Ming Yang
SSeg
21
0
0
07 Sep 2022
MAFormer: A Transformer Network with Multi-scale Attention Fusion for
  Visual Recognition
MAFormer: A Transformer Network with Multi-scale Attention Fusion for Visual Recognition
Y. Wang
H. Sun
Xiaodi Wang
Bin Zhang
Chaonan Li
Ying Xin
Baochang Zhang
Errui Ding
Shumin Han
ViT
33
9
0
31 Aug 2022
Single-Stage Open-world Instance Segmentation with Cross-task
  Consistency Regularization
Single-Stage Open-world Instance Segmentation with Cross-task Consistency Regularization
Xizhe Xue
Dongdong Yu
Lingqiao Liu
Yu Liu
Satoshi Tsutsui
Ying Li
Zehuan Yuan
Ping Song
Mike Zheng Shou
ISeg
30
4
0
18 Aug 2022
L3: Accelerator-Friendly Lossless Image Format for High-Resolution,
  High-Throughput DNN Training
L3: Accelerator-Friendly Lossless Image Format for High-Resolution, High-Throughput DNN Training
Jonghyun Bae
W. Baek
Tae Jun Ham
Jae W. Lee
28
1
0
18 Aug 2022
PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative
  Grounding
PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding
Zihan Ding
Zixiang Ding
Tianrui Hui
Junshi Huang
Xiaoming Wei
Xiaolin K. Wei
Si Liu
25
12
0
11 Aug 2022
Occlusion-Aware Instance Segmentation via BiLayer Network Architectures
Occlusion-Aware Instance Segmentation via BiLayer Network Architectures
Lei Ke
Yu-Wing Tai
Chi-Keung Tang
ISeg
29
11
0
08 Aug 2022
Visual Recognition by Request
Visual Recognition by Request
Chufeng Tang
Lingxi Xie
Xiaopeng Zhang
Xiaolin Hu
Qi Tian
VLM
16
15
0
28 Jul 2022
Behind Every Domain There is a Shift: Adapting Distortion-aware Vision
  Transformers for Panoramic Semantic Segmentation
Behind Every Domain There is a Shift: Adapting Distortion-aware Vision Transformers for Panoramic Semantic Segmentation
Jiaming Zhang
Kailun Yang
Haowen Shi
Simon Reiß
Kunyu Peng
Chaoxiang Ma
Haodong Fu
Philip H. S. Torr
Kaiwei Wang
Rainer Stiefelhagen
ViT
MDE
41
36
0
25 Jul 2022
Panoptic Scene Graph Generation
Panoptic Scene Graph Generation
Jingkang Yang
Yi Zhe Ang
Zujin Guo
Kaiyang Zhou
Wayne Zhang
Ziwei Liu
54
106
0
22 Jul 2022
Temporal Saliency Query Network for Efficient Video Recognition
Temporal Saliency Query Network for Efficient Video Recognition
Boyang Xia
Zhihao Wang
Wenhao Wu
Haoran Wang
Jungong Han
51
15
0
21 Jul 2022
NeuralBF: Neural Bilateral Filtering for Top-down Instance Segmentation
  on Point Clouds
NeuralBF: Neural Bilateral Filtering for Top-down Instance Segmentation on Point Clouds
Weiwei Sun
Daniel Rebain
Renjie Liao
V. Tankovich
S. Yazdani
K. M. Yi
Andrea Tagliasacchi
3DPC
20
13
0
20 Jul 2022
Zero-Shot Temporal Action Detection via Vision-Language Prompting
Zero-Shot Temporal Action Detection via Vision-Language Prompting
Sauradip Nag
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
VLM
33
65
0
17 Jul 2022
Online Video Instance Segmentation via Robust Context Fusion
Online Video Instance Segmentation via Robust Context Fusion
Xiang Li
Jinglu Wang
Xiaohao Xu
Bhiksha Raj
Yan Lu
45
5
0
12 Jul 2022
Tracking Objects as Pixel-wise Distributions
Tracking Objects as Pixel-wise Distributions
Zelin Zhao
Ze Wu
Yueqing Zhuang
Boxun Li
Jiaya Jia
VOT
38
54
0
12 Jul 2022
Previous
1234567
Next