ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXivPDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,365 papers shown
Title
Mixture-of-Noises Enhanced Forgery-Aware Predictor for Multi-Face
  Manipulation Detection and Localization
Mixture-of-Noises Enhanced Forgery-Aware Predictor for Multi-Face Manipulation Detection and Localization
Changtao Miao
Qi Chu
Tao Gong
Zhentao Tan
Zhenchao Jin
Wanyi Zhuang
Man Luo
Honggang Hu
Nenghai Yu
CVBM
54
1
0
05 Aug 2024
Unsupervised Domain Adaption Harnessing Vision-Language Pre-training
Unsupervised Domain Adaption Harnessing Vision-Language Pre-training
Wenlve Zhou
Zhiheng Zhou
VLM
38
33
0
05 Aug 2024
AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual
  Segmentation
AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation
Zili Wang
Qi Yang
Linsu Shi
Jiazhong Yu
M. Tanveer
Fei Li
Shiming Xiang
VOS
32
1
0
03 Aug 2024
WAS: Dataset and Methods for Artistic Text Segmentation
WAS: Dataset and Methods for Artistic Text Segmentation
Xudong Xie
Yuzhe Li
Yang Liu
Zhifei Zhang
Zhaowen Wang
Wei Xiong
Xiang Bai
DiffM
52
2
0
31 Jul 2024
Open-Vocabulary Audio-Visual Semantic Segmentation
Open-Vocabulary Audio-Visual Semantic Segmentation
Zhenghao Zhang
Junchao Liao
Dantong Niu
Yanyu Qi
Menghao Li
Ji Shi
Bowei Xing
Xianghua Ying
VOS
VLM
40
7
0
31 Jul 2024
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
Anurag Das
Xinting Hu
Li Jiang
Bernt Schiele
VLM
49
3
0
31 Jul 2024
RoadFormer+: Delivering RGB-X Scene Parsing through Scale-Aware
  Information Decoupling and Advanced Heterogeneous Feature Fusion
RoadFormer+: Delivering RGB-X Scene Parsing through Scale-Aware Information Decoupling and Advanced Heterogeneous Feature Fusion
Jianxin Huang
Jiahang Li
Ning Jia
Yuxiang Sun
Chengju Liu
Qijun Chen
Rui Fan
ViT
54
8
0
31 Jul 2024
CAMAv2: A Vision-Centric Approach for Static Map Element Annotation
CAMAv2: A Vision-Centric Approach for Static Map Element Annotation
Shiyuan Chen
Jiaxin Zhang
Ruohong Mei
Yingfeng Cai
Haoran Yin
Tao Chen
Wei Sui
Cong Yang
33
0
0
31 Jul 2024
NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene
  Understanding
NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding
Hongjia Zhai
Gan Huang
Qirui Hu
Guanglin Li
Hujun Bao
Guofeng Zhang
3DGS
45
13
0
30 Jul 2024
Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasets
Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasets
Muhammad Abdullah Jamal
Omid Mohareri
44
1
0
29 Jul 2024
MVPbev: Multi-view Perspective Image Generation from BEV with Test-time
  Controllability and Generalizability
MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllability and Generalizability
Buyu Liu
Kai Wang
Yansong Liu
Jun Bao
Tingting Han
Jun Yu
DiffM
37
3
0
28 Jul 2024
ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon
  Intention Understanding
ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding
Zhen Chen
Zongmin Zhang
Wenwu Guo
Xingjian Luo
Long Bai
Jinlin Wu
Hongliang Ren
Hongbin Liu
41
5
0
28 Jul 2024
Radio Frequency Signal based Human Silhouette Segmentation: A Sequential
  Diffusion Approach
Radio Frequency Signal based Human Silhouette Segmentation: A Sequential Diffusion Approach
Penghui Wen
Kun Hu
Dong Yuan
Zhiyuan Ning
ChangYang Li
Zhiyong Wang
31
0
0
27 Jul 2024
Sparse Refinement for Efficient High-Resolution Semantic Segmentation
Sparse Refinement for Efficient High-Resolution Semantic Segmentation
Zhijian Liu
Zhuoyang Zhang
Samir Khaki
Shang Yang
Haotian Tang
Chenfeng Xu
Kurt Keutzer
Song Han
SSeg
51
1
0
26 Jul 2024
A Survey on Cell Nuclei Instance Segmentation and Classification:
  Leveraging Context and Attention
A Survey on Cell Nuclei Instance Segmentation and Classification: Leveraging Context and Attention
João D. Nunes
D. Montezuma
Domingos Oliveira
Tania Pereira
Jaime S. Cardoso
53
1
0
26 Jul 2024
Learning Spectral-Decomposed Tokens for Domain Generalized Semantic
  Segmentation
Learning Spectral-Decomposed Tokens for Domain Generalized Semantic Segmentation
Jingjun Yi
Qi Bi
Hao Zheng
Haolan Zhan
Wei Ji
Yawen Huang
Yuexiang Li
Yefeng Zheng
36
8
0
26 Jul 2024
VSSD: Vision Mamba with Non-Causal State Space Duality
VSSD: Vision Mamba with Non-Causal State Space Duality
Yuheng Shi
Minjing Dong
Mingjia Li
Chang Xu
Mamba
33
5
0
26 Jul 2024
RefMask3D: Language-Guided Transformer for 3D Referring Segmentation
RefMask3D: Language-Guided Transformer for 3D Referring Segmentation
Shuting He
Henghui Ding
61
10
0
25 Jul 2024
DINOv2 Rocks Geological Image Analysis: Classification, Segmentation,
  and Interpretability
DINOv2 Rocks Geological Image Analysis: Classification, Segmentation, and Interpretability
Florent Brondolo
Samuel Beaussant
AI4CE
26
0
0
25 Jul 2024
CSWin-UNet: Transformer UNet with Cross-Shaped Windows for Medical Image
  Segmentation
CSWin-UNet: Transformer UNet with Cross-Shaped Windows for Medical Image Segmentation
Xiao Liu
Peng Gao
Tao Yu
Fei Wang
Ruyue Yuan
MedIm
ViT
39
14
0
25 Jul 2024
TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo
  Matching within A Joint Learning Framework
TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework
Guanfeng Tang
Zhiyuan Wu
Jiahang Li
Ping Zhong
Xieyuanli Chen
Huiming Liu
Rui Fan
50
0
0
25 Jul 2024
Embedding-Free Transformer with Inference Spatial Reduction for
  Efficient Semantic Segmentation
Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation
Hyunwoo Yu
Yubin Cho
Beoungwoo Kang
Seunghun Moon
Kyeongbo Kong
Suk-Ju Kang
30
3
0
24 Jul 2024
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
Junyi Li
Junfeng Wu
Weizhi Zhao
Song Bai
Xiang Bai
41
1
0
23 Jul 2024
DHGS: Decoupled Hybrid Gaussian Splatting for Driving Scene
DHGS: Decoupled Hybrid Gaussian Splatting for Driving Scene
Xi Shi
Lingli Chen
Peng Wei
Xi Wu
Tian Jiang
Yonggang Luo
Lecheng Xie
3DGS
37
4
0
23 Jul 2024
Strike a Balance in Continual Panoptic Segmentation
Strike a Balance in Continual Panoptic Segmentation
Jinpeng Chen
Runmin Cong
Yuxuan Luo
H. Ip
Sam Kwong
48
4
0
23 Jul 2024
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
Pengfei Chen
Lingxi Xie
Xinyue Huo
Xuehui Yu
Xiaopeng Zhang
Yingfei Sun
Zhenjun Han
Qi Tian
VLM
68
1
0
23 Jul 2024
Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond
Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond
Silvio Galesso
Philipp Schroppel
Hssan Driss
Thomas Brox
31
2
0
22 Jul 2024
RoadPainter: Points Are Ideal Navigators for Topology transformER
RoadPainter: Points Are Ideal Navigators for Topology transformER
Zhongxing Ma
Shuang Liang
Yongkun Wen
Weixin Lu
Guowei Wan
ViT
3DPC
33
6
0
22 Jul 2024
Advancing Chart Question Answering with Robust Chart Component
  Recognition
Advancing Chart Question Answering with Robust Chart Component Recognition
Hanwen Zheng
Sijia Wang
Chris Thomas
Lifu Huang
43
1
0
19 Jul 2024
Early Preparation Pays Off: New Classifier Pre-tuning for Class
  Incremental Semantic Segmentation
Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation
Zhengyuan Xie
Haiquan Lu
Jia-Wen Xiao
Enguang Wang
Le Zhang
Xialei Liu
CLL
33
2
0
19 Jul 2024
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion
  Models
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models
Xiaoyu Zhu
Hao Zhou
Pengfei Xing
Long Zhao
Hao Xu
Junwei Liang
Alex Hauptmann
Ting Liu
Andrew C. Gallagher
DiffM
62
4
0
18 Jul 2024
Mask2Map: Vectorized HD Map Construction Using Bird's Eye View
  Segmentation Masks
Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation Masks
Sehwan Choi
Jungho Kim
Hongjae Shin
Jungwook Choi
3DPC
61
7
0
18 Jul 2024
Open Vocabulary 3D Scene Understanding via Geometry Guided
  Self-Distillation
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation
Pengfei Wang
Yuxi Wang
Shuai Li
Zhaoxiang Zhang
Zhen Lei
Lei Zhang
48
2
0
18 Jul 2024
LIDIA: Precise Liver Tumor Diagnosis on Multi-Phase Contrast-Enhanced CT
  via Iterative Fusion and Asymmetric Contrastive Learning
LIDIA: Precise Liver Tumor Diagnosis on Multi-Phase Contrast-Enhanced CT via Iterative Fusion and Asymmetric Contrastive Learning
Wei Huang
Wei Liu
Xiaoming Zhang
Xiaoli Yin
Xu Han
...
Yu Shi
Le Lu
Ling Zhang
Lei Zhang
Ke Yan
29
0
0
18 Jul 2024
Tree semantic segmentation from aerial image time series
Tree semantic segmentation from aerial image time series
Venkatesh Ramesh
Arthur Ouaknine
David Rolnick
34
0
0
18 Jul 2024
GroupMamba: Efficient Group-Based Visual State Space Model
GroupMamba: Efficient Group-Based Visual State Space Model
Abdelrahman M. Shaker
Syed Talal Wasim
Salman Khan
Juergen Gall
Fahad Shahbaz Khan
Mamba
59
0
0
18 Jul 2024
ViLLa: Video Reasoning Segmentation with Large Language Model
ViLLa: Video Reasoning Segmentation with Large Language Model
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
VOS
LRM
77
2
0
18 Jul 2024
Progressive Proxy Anchor Propagation for Unsupervised Semantic
  Segmentation
Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation
Hyun Seok Seong
WonJun Moon
Subeen Lee
Jae-Pil Heo
40
0
0
17 Jul 2024
I2AM: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps
I2AM: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps
Junseo Park
Hyeryung Jang
81
0
0
17 Jul 2024
Stepping Stones: A Progressive Training Strategy for Audio-Visual
  Semantic Segmentation
Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation
Juncheng Ma
Peiwen Sun
Yaoting Wang
Di Hu
VOS
52
7
0
16 Jul 2024
OAM-TCD: A globally diverse dataset of high-resolution tree cover maps
OAM-TCD: A globally diverse dataset of high-resolution tree cover maps
Josh Veitch-Michaelis
Andrew Cottam
Daniella Schweizer
Eben N. Broadbent
David Dao
Ce Zhang
Angélica María Almeyda Zambrano
Simeon Max
35
1
0
16 Jul 2024
SGIFormer: Semantic-guided and Geometric-enhanced Interleaving
  Transformer for 3D Instance Segmentation
SGIFormer: Semantic-guided and Geometric-enhanced Interleaving Transformer for 3D Instance Segmentation
Lei Yao
Yi Wang
Moyun Liu
Lap-Pui Chau
39
0
0
16 Jul 2024
Cross-Phase Mutual Learning Framework for Pulmonary Embolism
  Identification on Non-Contrast CT Scans
Cross-Phase Mutual Learning Framework for Pulmonary Embolism Identification on Non-Contrast CT Scans
Bizhe Bai
Yan-Jie Zhou
Yujian Hu
Tony C. W. Mok
Yi-lang Xiang
Le Lu
Hongkun Zhang
Minfeng Xu
40
0
0
16 Jul 2024
TCFormer: Visual Recognition via Token Clustering Transformer
TCFormer: Visual Recognition via Token Clustering Transformer
Wang Zeng
Sheng Jin
Lumin Xu
Wentao Liu
Chao Qian
Wanli Ouyang
Ping Luo
Xiaogang Wang
35
3
0
16 Jul 2024
SegSTRONG-C: Segmenting Surgical Tools Robustly On Non-adversarial Generated Corruptions -- An EndoVis'24 Challenge
SegSTRONG-C: Segmenting Surgical Tools Robustly On Non-adversarial Generated Corruptions -- An EndoVis'24 Challenge
Hao Ding
Tuxun Lu
Yuqian Zhang
Ruixing Liang
Hongchao Shu
...
Bo Wang
Marcos Fernández-Rodríguez
Estevao Lima
João L. Vilaça
Mathias Unberath
63
4
0
16 Jul 2024
OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal
  Models
OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models
Zijian Zhou
Zheng Zhu
Holger Caesar
Miaojing Shi
VLM
33
2
0
15 Jul 2024
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes
Yaoting Wang
Peiwen Sun
Dongzhan Zhou
Guangyao Li
Honggang Zhang
Di Hu
VOS
49
5
0
15 Jul 2024
Can Textual Semantics Mitigate Sounding Object Segmentation Preference?
Can Textual Semantics Mitigate Sounding Object Segmentation Preference?
Yaoting Wang
Peiwen Sun
Yuanchao Li
Honggang Zhang
Di Hu
46
5
0
15 Jul 2024
Joint-Embedding Predictive Architecture for Self-Supervised Learning of
  Mask Classification Architecture
Joint-Embedding Predictive Architecture for Self-Supervised Learning of Mask Classification Architecture
Donghee Kim
Sungduk Cho
Hyeonwoo Cho
Chanmin Park
Jinyoung Kim
Won Hwa Kim
52
0
0
15 Jul 2024
PolyRoom: Room-aware Transformer for Floorplan Reconstruction
PolyRoom: Room-aware Transformer for Floorplan Reconstruction
Yuzhou Liu
Lingjie Zhu
Xiaodong Ma
Hanqiao Ye
Xiang Gao
Xianwei Zheng
Shuhan Shen
28
1
0
15 Jul 2024
Previous
123...789...262728
Next