ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXivPDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,365 papers shown
Title
All in Tokens: Unifying Output Space of Visual Tasks via Soft Token
All in Tokens: Unifying Output Space of Visual Tasks via Soft Token
Jia Ning
Chen Li
Zheng-Wei Zhang
Zigang Geng
Qi Dai
Kun He
Han Hu
56
44
0
05 Jan 2023
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance
  Segmentation
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation
Yue Han
Jiangning Zhang
Zhucun Xue
Chao Xu
Xintian Shen
Yabiao Wang
Chengjie Wang
Yong Liu
Xiangtai Li
42
17
0
03 Jan 2023
PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part
  Segmentation
PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation
Xiangtai Li
Shilin Xu
Yibo Yang
Haobo Yuan
Guangliang Cheng
Yu Tong
Zhouchen Lin
Ming-Hsuan Yang
Dacheng Tao
ViT
42
21
0
03 Jan 2023
Betrayed by Captions: Joint Caption Grounding and Generation for Open
  Vocabulary Instance Segmentation
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
Jianzong Wu
Xiangtai Li
Henghui Ding
Xia Li
Guangliang Cheng
Yu Tong
Chen Change Loy
VLM
89
31
0
02 Jan 2023
Deep Learning Technique for Human Parsing: A Survey and Outlook
Deep Learning Technique for Human Parsing: A Survey and Outlook
Lu Yang
Wenhe Jia
Shane Li
Q. Song
ViT
48
17
0
01 Jan 2023
PanDepth: Joint Panoptic Segmentation and Depth Completion
PanDepth: Joint Panoptic Segmentation and Depth Completion
J. Lagos
Esa Rahtu
3DPC
VLM
30
1
0
29 Dec 2022
Representation Separation for Semantic Segmentation with Vision
  Transformers
Representation Separation for Semantic Segmentation with Vision Transformers
Yuanduo Hong
Huihui Pan
Weichao Sun
Xinghu Yu
Huijun Gao
ViT
28
5
0
28 Dec 2022
Reversible Column Networks
Reversible Column Networks
Yuxuan Cai
Yi Zhou
Qi Han
Jianjian Sun
Xiangwen Kong
Jun Yu Li
Xiangyu Zhang
VLM
31
53
0
22 Dec 2022
Generalized Decoding for Pixel, Image, and Language
Generalized Decoding for Pixel, Image, and Language
Xueyan Zou
Zi-Yi Dou
Jianwei Yang
Zhe Gan
Linjie Li
...
Lu Yuan
Nanyun Peng
Lijuan Wang
Yong Jae Lee
Jianfeng Gao
VLM
MLLM
ObjD
21
241
0
21 Dec 2022
Weakly supervised training of universal visual concepts for multi-domain
  semantic segmentation
Weakly supervised training of universal visual concepts for multi-domain semantic segmentation
Petra Bevandić
Marin Orsic
Ivan Grubišić
Josip Saric
Sinisa Segvic
31
5
0
20 Dec 2022
Planning-oriented Autonomous Driving
Planning-oriented Autonomous Driving
Yi Hu
Jiazhi Yang
Li Chen
Keyu Li
Chonghao Sima
...
Xiaosong Jia
Qiang Liu
Jifeng Dai
Yu Qiao
Hongyang Li
52
591
0
20 Dec 2022
Panoptic Lifting for 3D Scene Understanding with Neural Fields
Panoptic Lifting for 3D Scene Understanding with Neural Fields
Yawar Siddiqui
Lorenzo Porzi
Samuel Rota Buló
Norman Muller
Matthias Nießner
Angela Dai
Peter Kontschieder
42
128
0
19 Dec 2022
Rethinking Vision Transformers for MobileNet Size and Speed
Rethinking Vision Transformers for MobileNet Size and Speed
Yanyu Li
Ju Hu
Yang Wen
Georgios Evangelidis
Kamyar Salahi
Yanzhi Wang
Sergey Tulyakov
Jian Ren
ViT
35
159
0
15 Dec 2022
QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware
  Part-Level Query
QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware Part-Level Query
Yabo Xiao
Kai Su
Xiaojuan Wang
Dongdong Yu
Lei Jin
Mingshu He
Zehuan Yuan
3DH
20
17
0
15 Dec 2022
One-Shot Domain Adaptive and Generalizable Semantic Segmentation with
  Class-Aware Cross-Domain Transformers
One-Shot Domain Adaptive and Generalizable Semantic Segmentation with Class-Aware Cross-Domain Transformers
R. Gong
Qin Wang
Dengxin Dai
Luc Van Gool
ViT
27
4
0
14 Dec 2022
Look Before You Match: Instance Understanding Matters in Video Object
  Segmentation
Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Junke Wang
Dongdong Chen
Zuxuan Wu
Chong Luo
Chuanxin Tang
Xiyang Dai
Yucheng Zhao
Yujia Xie
Lu Yuan
Yu-Gang Jiang
VOS
36
39
0
13 Dec 2022
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group
  Propagation
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
Chenhongyi Yang
Jiarui Xu
Shalini De Mello
Elliot J. Crowley
Xueliang Wang
ViT
38
21
0
13 Dec 2022
OAMixer: Object-aware Mixing Layer for Vision Transformers
OAMixer: Object-aware Mixing Layer for Vision Transformers
H. Kang
Sangwoo Mo
Jinwoo Shin
VLM
39
4
0
13 Dec 2022
Test-time Adaptation vs. Training-time Generalization: A Case Study in
  Human Instance Segmentation using Keypoints Estimation
Test-time Adaptation vs. Training-time Generalization: A Case Study in Human Instance Segmentation using Keypoints Estimation
K. Azarian
Debasmit Das
Hyojin Park
Fatih Porikli
3DH
OOD
16
3
0
12 Dec 2022
CamoFormer: Masked Separable Attention for Camouflaged Object Detection
CamoFormer: Masked Separable Attention for Camouflaged Object Detection
Bo Yin
Xuying Zhang
Qibin Hou
Bo Sun
Deng-Ping Fan
Luc Van Gool
28
51
0
10 Dec 2022
RCDT: Relational Remote Sensing Change Detection with Transformer
RCDT: Relational Remote Sensing Change Detection with Transformer
Kaixuan Lu
Xiao Huang
ViT
22
8
0
09 Dec 2022
Towards Accurate Ground Plane Normal Estimation from Ego-Motion
Towards Accurate Ground Plane Normal Estimation from Ego-Motion
Jiaxin Zhang
Wei Sui
Qian Zhang
Tao Chen
Cong Yang
33
5
0
08 Dec 2022
Latent Graph Representations for Critical View of Safety Assessment
Latent Graph Representations for Critical View of Safety Assessment
Aditya Murali
Deepak Alapatt
Pietro Mascagni
Armine Vardazaryan
Alain Garcia
Nariaki Okamoto
Didier Mutter
N. Padoy
MedIm
23
19
0
08 Dec 2022
iQuery: Instruments as Queries for Audio-Visual Sound Separation
iQuery: Instruments as Queries for Audio-Visual Sound Separation
Jiaben Chen
Renrui Zhang
Dongze Lian
Jiaqi Yang
Ziyao Zeng
Jianbo Shi
34
27
0
07 Dec 2022
ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation
ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation
Ziqi Zhou
Bowen Zhang
Yinjie Lei
Lingqiao Liu
Yifan Liu
VLM
38
168
0
07 Dec 2022
Framework-agnostic Semantically-aware Global Reasoning for Segmentation
Framework-agnostic Semantically-aware Global Reasoning for Segmentation
Mir Rayat Imtiaz Hossain
Leonid Sigal
James J. Little
ViT
27
0
0
06 Dec 2022
IncepFormer: Efficient Inception Transformer with Pyramid Pooling for
  Semantic Segmentation
IncepFormer: Efficient Inception Transformer with Pyramid Pooling for Semantic Segmentation
Lihua Fu
Haoyue Tian
Xiang Zhai
Pan Gao
Xiaojiang Peng
ViT
27
9
0
06 Dec 2022
DiffusionInst: Diffusion Model for Instance Segmentation
DiffusionInst: Diffusion Model for Instance Segmentation
Zhangxuan Gu
Haoxing Chen
Zhuoer Xu
Jun Lan
Changhua Meng
Weiqiang Wang
DiffM
14
66
0
06 Dec 2022
Semantic-aware Message Broadcasting for Efficient Unsupervised Domain
  Adaptation
Semantic-aware Message Broadcasting for Efficient Unsupervised Domain Adaptation
Xin Li
Cuiling Lan
Guoqiang Wei
Zhibo Chen
33
4
0
06 Dec 2022
Images Speak in Images: A Generalist Painter for In-Context Visual
  Learning
Images Speak in Images: A Generalist Painter for In-Context Visual Learning
Xinlong Wang
Wen Wang
Yue Cao
Chunhua Shen
Tiejun Huang
VLM
MLLM
66
244
0
05 Dec 2022
Mask Matching Transformer for Few-Shot Segmentation
Mask Matching Transformer for Few-Shot Segmentation
Siyu Jiao
Gengwei Zhang
Shant Navasardyan
Ling-Hao Chen
Yao-Min Zhao
Yunchao Wei
Humphrey Shi
37
28
0
05 Dec 2022
Box2Mask: Box-supervised Instance Segmentation via Level-set Evolution
Box2Mask: Box-supervised Instance Segmentation via Level-set Evolution
Wentong Li
Wenyu Liu
Jianke Zhu
Miaomiao Cui
Risheng Yu
Xia Hua
Lei Zhang
ISeg
29
30
0
03 Dec 2022
3D Segmentation of Humans in Point Clouds with Synthetic Data
3D Segmentation of Humans in Point Clouds with Synthetic Data
Ayca Takmaz
Jonas Schult
Irem Kaftan
Mertcan Akccay
Bastian Leibe
R. Sumner
Francis Engelmann
Siyu Tang
3DH
27
23
0
01 Dec 2022
Superpoint Transformer for 3D Scene Instance Segmentation
Superpoint Transformer for 3D Scene Instance Segmentation
Jiahao Sun
Chunmei Qing
Junpeng Tan
Xiangmin Xu
3DPC
42
104
0
28 Nov 2022
SatlasPretrain: A Large-Scale Dataset for Remote Sensing Image
  Understanding
SatlasPretrain: A Large-Scale Dataset for Remote Sensing Image Understanding
Favyen Bastani
Piper Wolters
Ritwik Gupta
Joe Ferdinando
Aniruddha Kembhavi
35
99
0
28 Nov 2022
Multi-Modal Few-Shot Temporal Action Detection
Multi-Modal Few-Shot Temporal Action Detection
Sauradip Nag
Mengmeng Xu
Xiatian Zhu
Juan-Manuel Perez-Rua
Guohao Li
Yi-Zhe Song
Tao Xiang
VLM
30
6
0
27 Nov 2022
SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary
  Semantic Segmentation
SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation
Huaishao Luo
Junwei Bao
Youzheng Wu
Xiaodong He
Tianrui Li
VLM
32
144
0
27 Nov 2022
Prototype as Query for Few Shot Semantic Segmentation
Prototype as Query for Few Shot Semantic Segmentation
Leilei Cao
Yibo Guo
Ye Yuan
Qiangguo Jin
ViT
32
11
0
27 Nov 2022
From Forks to Forceps: A New Framework for Instance Segmentation of
  Surgical Instruments
From Forks to Forceps: A New Framework for Instance Segmentation of Surgical Instruments
Britty Baby
Daksh Thapar
Mustafa Chasmai
Tamajit Banerjee
Kunal Dargan
A. Suri
Subhashis Banerjee
Chetan Arora
18
26
0
26 Nov 2022
Rethinking Alignment and Uniformity in Unsupervised Image Semantic
  Segmentation
Rethinking Alignment and Uniformity in Unsupervised Image Semantic Segmentation
Daoan Zhang
Chenming Li
Haoquan Li
Wen-Fong Huang
Lingyun Huang
Jianguo Zhang
33
20
0
26 Nov 2022
RbA: Segmenting Unknown Regions Rejected by All
RbA: Segmenting Unknown Regions Rejected by All
Nazir Nayal
Mısra Yavuz
João F. Henriques
Fatma Guney
UQCV
19
46
0
25 Nov 2022
CoMFormer: Continual Learning in Semantic and Panoptic Segmentation
CoMFormer: Continual Learning in Semantic and Panoptic Segmentation
Fabio Cermelli
Matthieu Cord
Arthur Douillard
CLL
VLM
32
20
0
25 Nov 2022
Aggregated Text Transformer for Scene Text Detection
Aggregated Text Transformer for Scene Text Detection
Zhao Zhou
Xiangcheng Du
Yingbin Zheng
Cheng Jin
ViT
38
1
0
25 Nov 2022
Mean Shift Mask Transformer for Unseen Object Instance Segmentation
Mean Shift Mask Transformer for Unseen Object Instance Segmentation
Ya Lu
Yuqiao Chen
Nicholas Ruozzi
Yu Xiang
24
23
0
21 Nov 2022
L-MAE: Masked Autoencoders are Semantic Segmentation Datasets Augmenter
L-MAE: Masked Autoencoders are Semantic Segmentation Datasets Augmenter
Jiaru Jia
Ming Liu
Jiake Xie
Xin Chen
Hong Zhang
Xin Jiang
Aiqing Yang
43
0
0
21 Nov 2022
Castling-ViT: Compressing Self-Attention via Switching Towards
  Linear-Angular Attention at Vision Transformer Inference
Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference
Haoran You
Yunyang Xiong
Xiaoliang Dai
Bichen Wu
Peizhao Zhang
Haoqi Fan
Peter Vajda
Yingyan Lin
37
32
0
18 Nov 2022
Delving into Transformer for Incremental Semantic Segmentation
Delving into Transformer for Incremental Semantic Segmentation
Zekai Xu
Mingying Zhang
Jiayue Hou
Xing Gong
Chuan Wen
Chengjie Wang
Junge Zhang
CLL
24
1
0
18 Nov 2022
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and
  Vision-Language Tasks
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
Hao Li
Jinguo Zhu
Xiaohu Jiang
Xizhou Zhu
Hongsheng Li
...
Xiaohua Wang
Yu Qiao
Xiaogang Wang
Wenhai Wang
Jifeng Dai
MLLM
26
55
0
17 Nov 2022
Towards All-in-one Pre-training via Maximizing Multi-modal Mutual
  Information
Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information
Weijie Su
Xizhou Zhu
Chenxin Tao
Lewei Lu
Bin Li
Gao Huang
Yu Qiao
Xiaogang Wang
Jie Zhou
Jifeng Dai
42
41
0
17 Nov 2022
MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained
  Object Detectors
MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors
Yuang Zhang
Tiancai Wang
Xiangyu Zhang
VOT
33
129
0
17 Nov 2022
Previous
123...2425262728
Next