ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.06278
  4. Cited By
Per-Pixel Classification is Not All You Need for Semantic Segmentation

Per-Pixel Classification is Not All You Need for Semantic Segmentation

13 July 2021
Bowen Cheng
Alex Schwing
Alexander Kirillov
    VLM
    ViT
ArXivPDFHTML

Papers citing "Per-Pixel Classification is Not All You Need for Semantic Segmentation"

50 / 329 papers shown
Title
CaRe-Ego: Contact-aware Relationship Modeling for Egocentric Interactive Hand-object Segmentation
CaRe-Ego: Contact-aware Relationship Modeling for Egocentric Interactive Hand-object Segmentation
Yuejiao Su
Yi Wang
Lap-Pui Chau
70
1
0
08 Jul 2024
CPM: Class-conditional Prompting Machine for Audio-visual Segmentation
CPM: Class-conditional Prompting Machine for Audio-visual Segmentation
Yuanhong Chen
Chong Wang
Yuyuan Liu
Hu Wang
Gustavo Carneiro
50
2
0
07 Jul 2024
SAM Fewshot Finetuning for Anatomical Segmentation in Medical Images
SAM Fewshot Finetuning for Anatomical Segmentation in Medical Images
Weiyi Xie
Nathalie Willems
Shubham Patil
Yang Li
Mayank Kumar
54
13
0
05 Jul 2024
A Refreshed Similarity-based Upsampler for Direct High-Ratio Feature Upsampling
A Refreshed Similarity-based Upsampler for Direct High-Ratio Feature Upsampling
Minghao Zhou
Hong Wang
Yefeng Zheng
Deyu Meng
33
1
0
02 Jul 2024
CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic
  Segmentation of Driving Scenes
CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes
Danial Qashqai
Emad Mousavian
S. B. Shokouhi
S. Mirzakuchaki
56
0
0
01 Jul 2024
Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment
  Anything Model
Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model
Haobo Yuan
Xiangtai Li
Lu Qi
Tao Zhang
Ming-Hsuan Yang
Shuicheng Yan
Chen Change Loy
VLM
39
10
0
27 Jun 2024
DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for
  Efficient Scanned Document Annotation
DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation
Ahmad Mohammadshirazi
Ali Nosrati Firoozsalari
Mengxi Zhou
Dheeraj Kulshrestha
R. Ramnath
38
0
0
25 Jun 2024
GMT: Guided Mask Transformer for Leaf Instance Segmentation
GMT: Guided Mask Transformer for Leaf Instance Segmentation
Feng Chen
Sotirios A. Tsaftaris
M. Giuffrida
30
1
0
24 Jun 2024
LOGCAN++: Adaptive Local-global class-aware network for semantic segmentation of remote sensing imagery
LOGCAN++: Adaptive Local-global class-aware network for semantic segmentation of remote sensing imagery
Xiaowen Ma
Rongrong Lian
Zhenkai Wu
Hongbo Guo
Mengting Ma
Sensen Wu
Zhenhong Du
Siyang Song
Wei Zhang
47
4
0
24 Jun 2024
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
Jiho Choi
Seonho Lee
Seungho Lee
Minhyun Lee
Hyunjung Shim
OCL
48
0
0
17 Jun 2024
Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic
  Segmentation
Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation
Bingfeng Zhang
Siyue Yu
Yunchao Wei
Yao Zhao
Jimin Xiao
VLM
44
8
0
17 Jun 2024
Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part
  Representations
Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations
Daan de Geus
Gijs Dubbelman
42
0
0
14 Jun 2024
F-LMM: Grounding Frozen Large Multimodal Models
F-LMM: Grounding Frozen Large Multimodal Models
Size Wu
Sheng Jin
Wenwei Zhang
Lumin Xu
Wentao Liu
Wei Li
Chen Change Loy
MLLM
80
12
0
09 Jun 2024
Frequency-based Matcher for Long-tailed Semantic Segmentation
Frequency-based Matcher for Long-tailed Semantic Segmentation
Shan Li
Lu Yang
Pu Cao
Liulei Li
Huadong Ma
51
1
0
06 Jun 2024
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Mohamed El Amine Boudjoghra
Angela Dai
Jean Lahoud
Hisham Cholakkal
Rao Muhammad Anwer
Salman Khan
Fahad Shahbaz Khan
VLM
ISeg
83
6
0
04 Jun 2024
Don't drop your samples! Coherence-aware training benefits Conditional diffusion
Don't drop your samples! Coherence-aware training benefits Conditional diffusion
Nicolas Dufour
Victor Besnier
Vicky Kalogeiton
David Picard
DiffM
63
2
0
30 May 2024
GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-Supervision
GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-Supervision
Xin Tan
Wenbin Wu
Zhiwei Zhang
Chaojie Fan
Yong Peng
Zhizhong Zhang
Yuan Xie
Lizhuang Ma
72
11
0
17 May 2024
UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation
UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation
Yachan Guo
Yi Xiao
Danna Xue
Jose Luis Gomez Zurita
Antonio M. López
71
0
0
15 May 2024
Masked Spatial Propagation Network for Sparsity-Adaptive Depth
  Refinement
Masked Spatial Propagation Network for Sparsity-Adaptive Depth Refinement
Jinyoung Jun
Jae-Han Lee
Chang-Su Kim
40
2
0
30 Apr 2024
A Partial Replication of MaskFormer in TensorFlow on TPUs for the
  TensorFlow Model Garden
A Partial Replication of MaskFormer in TensorFlow on TPUs for the TensorFlow Model Garden
Vishal Purohit
Wenxin Jiang
Akshath R. Ravikiran
James C. Davis
42
1
0
29 Apr 2024
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Abhishek Aich
Yumin Suh
S. Schulter
Manmohan Chandraker
56
0
0
23 Apr 2024
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic
  Segmentation
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation
Jiannan Ge
Lingxi Xie
Hongtao Xie
Pandeng Li
Xiaopeng Zhang
Yongdong Zhang
Qi Tian
VLM
34
3
0
08 Apr 2024
JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic
  Dataset in Crowded Human Environments
JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments
Duy-Tho Le
Chenhui Gou
Stavya Datta
Hengcan Shi
Ian Reid
Jianfei Cai
Hamid Rezatofighi
37
2
0
02 Apr 2024
Modeling Weather Uncertainty for Multi-weather Co-Presence Estimation
Modeling Weather Uncertainty for Multi-weather Co-Presence Estimation
Qi Bi
Shaodi You
Theo Gevers
55
1
0
29 Mar 2024
Part-aware Personalized Segment Anything Model for Patient-Specific
  Segmentation
Part-aware Personalized Segment Anything Model for Patient-Specific Segmentation
Chenhui Zhao
Liyue Shen
VLM
47
3
0
08 Mar 2024
Continual Segmentation with Disentangled Objectness Learning and Class
  Recognition
Continual Segmentation with Disentangled Objectness Learning and Class Recognition
Yizheng Gong
Siyue Yu
Xiaoyang Wang
Jimin Xiao
CLL
40
5
0
06 Mar 2024
End-to-End Human Instance Matting
End-to-End Human Instance Matting
Qinglin Liu
Shengping Zhang
Quanling Meng
Bineng Zhong
Peiqiang Liu
H. Yao
3DH
44
5
0
03 Mar 2024
Benchmarking the Robustness of Panoptic Segmentation for Automated
  Driving
Benchmarking the Robustness of Panoptic Segmentation for Automated Driving
Yiting Wang
Haonan Zhao
Daniel Gummadi
M. Dianati
Kurt Debattista
Valentina Donzella
39
2
0
23 Feb 2024
How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey
How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey
Fabio Tosi
Youming Zhang
Ziren Gong
Erik Sandström
S. Mattoccia
Martin R. Oswald
Matteo Poggi
3DGS
86
57
0
20 Feb 2024
ISCUTE: Instance Segmentation of Cables Using Text Embedding
ISCUTE: Instance Segmentation of Cables Using Text Embedding
Shir Kozlovsky
O. Joglekar
Dotan Di Castro
32
2
0
19 Feb 2024
SAGD: Boundary-Enhanced Segment Anything in 3D Gaussian via Gaussian Decomposition
SAGD: Boundary-Enhanced Segment Anything in 3D Gaussian via Gaussian Decomposition
Xu Hu
Yuxi Wang
Lue Fan
Junsong Fan
Junran Peng
Zhen Lei
Qing Li
Zhaoxiang Zhang
Zhaoxiang Zhang
3DGS
47
8
0
31 Jan 2024
PACE: A Pragmatic Agent for Enhancing Communication Efficiency Using
  Large Language Models
PACE: A Pragmatic Agent for Enhancing Communication Efficiency Using Large Language Models
Jiaxuan Li
Minxi Yang
Dahua Gao
Wenlong Xu
Guangming Shi
44
0
0
30 Jan 2024
Learning to Manipulate Artistic Images
Learning to Manipulate Artistic Images
Wei Guo
Yuqi Zhang
De Ma
Qian Zheng
41
0
0
25 Jan 2024
Rethinking Patch Dependence for Masked Autoencoders
Rethinking Patch Dependence for Masked Autoencoders
Letian Fu
Long Lian
Renhao Wang
Baifeng Shi
Xudong Wang
Adam Yala
Trevor Darrell
Alexei A. Efros
Ken Goldberg
39
14
0
25 Jan 2024
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask
  Inpainting
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting
Wouter Van Gansbeke
Bert De Brabandere
DiffM
46
11
0
18 Jan 2024
PlanarNeRF: Online Learning of Planar Primitives with Neural Radiance Fields
PlanarNeRF: Online Learning of Planar Primitives with Neural Radiance Fields
Zheng Chen
Qingan Yan
Huangying Zhan
Changjiang Cai
Xiangyu Xu
Yuzhong Huang
Weihan Wang
Ziyue Feng
Lantao Liu
Yi Tian Xu
3DV
64
3
0
30 Dec 2023
LaneSegNet: Map Learning with Lane Segment Perception for Autonomous
  Driving
LaneSegNet: Map Learning with Lane Segment Perception for Autonomous Driving
Tianyu Li
Peijin Jia
Bangjun Wang
Li Chen
Kun Jiang
Junchi Yan
Hongyang Li
38
36
0
26 Dec 2023
Spherical Mask: Coarse-to-Fine 3D Point Cloud Instance Segmentation with
  Spherical Representation
Spherical Mask: Coarse-to-Fine 3D Point Cloud Instance Segmentation with Spherical Representation
Sangyun Shin
Kaichen Zhou
M. Vankadari
Andrew Markham
Niki Trigoni
3DPC
38
8
0
18 Dec 2023
See, Say, and Segment: Teaching LMMs to Overcome False Premises
See, Say, and Segment: Teaching LMMs to Overcome False Premises
Tsung-Han Wu
Giscard Biamby
David M. Chan
Lisa Dunlap
Ritwik Gupta
Xudong Wang
Joseph E. Gonzalez
Trevor Darrell
VLM
MLLM
44
18
0
13 Dec 2023
PnPNet: Pull-and-Push Networks for Volumetric Segmentation with Boundary
  Confusion
PnPNet: Pull-and-Push Networks for Volumetric Segmentation with Boundary Confusion
Xin You
Ming Ding
Minghui Zhang
Hanxiao Zhang
Yi Yu
Jie Yang
Yun Gu
51
1
0
13 Dec 2023
PEEKABOO: Interactive Video Generation via Masked-Diffusion
PEEKABOO: Interactive Video Generation via Masked-Diffusion
Yash Jain
Anshul Nasery
Vibhav Vineet
Harkirat Singh Behl
VGen
41
31
0
12 Dec 2023
Toward Real Text Manipulation Detection: New Dataset and New Solution
Dongliang Luo
Yuliang Liu
Rui Yang
Xianjin Liu
Jishen Zeng
Yu Zhou
Xiang Bai
42
3
0
12 Dec 2023
Adaptive Human Trajectory Prediction via Latent Corridors
Adaptive Human Trajectory Prediction via Latent Corridors
Neerja Thakkar
K. Mangalam
Andrea V. Bajcsy
Jitendra Malik
30
4
0
11 Dec 2023
ZePT: Zero-Shot Pan-Tumor Segmentation via Query-Disentangling and
  Self-Prompting
ZePT: Zero-Shot Pan-Tumor Segmentation via Query-Disentangling and Self-Prompting
Yankai Jiang
Zhongzhen Huang
Rongzhao Zhang
Xiaofan Zhang
Shaoting Zhang
VLM
47
10
0
07 Dec 2023
GPT-4 Enhanced Multimodal Grounding for Autonomous Driving: Leveraging
  Cross-Modal Attention with Large Language Models
GPT-4 Enhanced Multimodal Grounding for Autonomous Driving: Leveraging Cross-Modal Attention with Large Language Models
Haicheng Liao
Huanming Shen
Zhenning Li
Chengyue Wang
Guofa Li
Yiming Bie
Chengzhong Xu
42
50
0
06 Dec 2023
Predicting Scores of Various Aesthetic Attribute Sets by Learning from
  Overall Score Labels
Predicting Scores of Various Aesthetic Attribute Sets by Learning from Overall Score Labels
Heng Huang
Xin Jin
Yaqi Liu
Hao Lou
Chaoen Xiao
Shuai Cui
Xinning Li
Dongqing Zou
30
1
0
06 Dec 2023
Uni3DL: Unified Model for 3D and Language Understanding
Uni3DL: Unified Model for 3D and Language Understanding
Xiang Li
Jian Ding
Zhaoyang Chen
Mohamed Elhoseiny
38
3
0
05 Dec 2023
Focus on Query: Adversarial Mining Transformer for Few-Shot Segmentation
Focus on Query: Adversarial Mining Transformer for Few-Shot Segmentation
Yuan Wang
Naisong Luo
Tianzhu Zhang
43
11
0
29 Nov 2023
FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding
FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding
Thanh-Dat Truong
Utsav Prabhu
Bhiksha Raj
Jackson Cothren
Khoa Luu
CLL
48
3
0
27 Nov 2023
GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene
  Understanding
GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding
Hao Li
Dingwen Zhang
Yalun Dai
Nian Liu
Lechao Cheng
Jingfeng Li
Jingdong Wang
Junwei Han
44
14
0
20 Nov 2023
Previous
1234567
Next