ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
v1v2v3 (latest)

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXiv (abs)PDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,408 papers shown
Title
Representation Separation for Semantic Segmentation with Vision
  Transformers
Representation Separation for Semantic Segmentation with Vision Transformers
Yuanduo Hong
Huihui Pan
Weichao Sun
Xinghu Yu
Huijun Gao
ViT
83
5
0
28 Dec 2022
Reversible Column Networks
Reversible Column Networks
Yuxuan Cai
Yi Zhou
Qi Han
Jianjian Sun
Xiangwen Kong
Jun Yu Li
Xiangyu Zhang
VLM
96
59
0
22 Dec 2022
Generalized Decoding for Pixel, Image, and Language
Generalized Decoding for Pixel, Image, and Language
Xueyan Zou
Zi-Yi Dou
Jianwei Yang
Zhe Gan
Linjie Li
...
Lu Yuan
Nanyun Peng
Lijuan Wang
Yong Jae Lee
Jianfeng Gao
VLMMLLMObjD
124
259
0
21 Dec 2022
Weakly supervised training of universal visual concepts for multi-domain
  semantic segmentation
Weakly supervised training of universal visual concepts for multi-domain semantic segmentation
Petra Bevandić
Marin Orsic
Ivan Grubišić
Josip Saric
Sinisa Segvic
86
5
0
20 Dec 2022
Planning-oriented Autonomous Driving
Planning-oriented Autonomous Driving
Yi Hu
Jiazhi Yang
Li Chen
Keyu Li
Chonghao Sima
...
Xiaosong Jia
Qiang Liu
Jifeng Dai
Yu Qiao
Hongyang Li
96
664
0
20 Dec 2022
Panoptic Lifting for 3D Scene Understanding with Neural Fields
Panoptic Lifting for 3D Scene Understanding with Neural Fields
Yawar Siddiqui
Lorenzo Porzi
Samuel Rota Buló
Norman Muller
Matthias Nießner
Angela Dai
Peter Kontschieder
128
141
0
19 Dec 2022
Rethinking Vision Transformers for MobileNet Size and Speed
Rethinking Vision Transformers for MobileNet Size and Speed
Yanyu Li
Ju Hu
Yang Wen
Georgios Evangelidis
Kamyar Salahi
Yanzhi Wang
Sergey Tulyakov
Jian Ren
ViT
117
170
0
15 Dec 2022
QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware
  Part-Level Query
QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware Part-Level Query
Yabo Xiao
Kai Su
Xiaojuan Wang
Dongdong Yu
Lei Jin
Mingshu He
Zehuan Yuan
3DH
76
20
0
15 Dec 2022
One-Shot Domain Adaptive and Generalizable Semantic Segmentation with
  Class-Aware Cross-Domain Transformers
One-Shot Domain Adaptive and Generalizable Semantic Segmentation with Class-Aware Cross-Domain Transformers
R. Gong
Qin Wang
Dengxin Dai
Luc Van Gool
ViT
91
4
0
14 Dec 2022
Look Before You Match: Instance Understanding Matters in Video Object
  Segmentation
Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Junke Wang
Dongdong Chen
Zuxuan Wu
Chong Luo
Chuanxin Tang
Xiyang Dai
Yucheng Zhao
Yujia Xie
Lu Yuan
Yu-Gang Jiang
VOS
100
42
0
13 Dec 2022
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group
  Propagation
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
Chenhongyi Yang
Jiarui Xu
Shalini De Mello
Elliot J. Crowley
Xinyu Wang
ViT
109
22
0
13 Dec 2022
OAMixer: Object-aware Mixing Layer for Vision Transformers
OAMixer: Object-aware Mixing Layer for Vision Transformers
H. Kang
Sangwoo Mo
Jinwoo Shin
VLM
119
4
0
13 Dec 2022
Test-time Adaptation vs. Training-time Generalization: A Case Study in
  Human Instance Segmentation using Keypoints Estimation
Test-time Adaptation vs. Training-time Generalization: A Case Study in Human Instance Segmentation using Keypoints Estimation
K. Azarian
Debasmit Das
Hyojin Park
Fatih Porikli
3DHOOD
87
3
0
12 Dec 2022
CamoFormer: Masked Separable Attention for Camouflaged Object Detection
CamoFormer: Masked Separable Attention for Camouflaged Object Detection
Bo Yin
Xuying Zhang
Qibin Hou
Bo Sun
Deng-Ping Fan
Luc Van Gool
104
59
0
10 Dec 2022
RCDT: Relational Remote Sensing Change Detection with Transformer
RCDT: Relational Remote Sensing Change Detection with Transformer
Kaixuan Lu
Xiao Huang
ViT
40
9
0
09 Dec 2022
Towards Accurate Ground Plane Normal Estimation from Ego-Motion
Towards Accurate Ground Plane Normal Estimation from Ego-Motion
Jiaxin Zhang
Wei Sui
Qian Zhang
Tao Chen
Cong Yang
60
5
0
08 Dec 2022
Latent Graph Representations for Critical View of Safety Assessment
Latent Graph Representations for Critical View of Safety Assessment
Aditya Murali
Deepak Alapatt
Pietro Mascagni
Armine Vardazaryan
Alain Garcia
Nariaki Okamoto
Didier Mutter
N. Padoy
MedIm
138
24
0
08 Dec 2022
iQuery: Instruments as Queries for Audio-Visual Sound Separation
iQuery: Instruments as Queries for Audio-Visual Sound Separation
Jiaben Chen
Renrui Zhang
Dongze Lian
Jiaqi Yang
Ziyao Zeng
Jianbo Shi
123
29
0
07 Dec 2022
ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation
ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation
Ziqi Zhou
Bowen Zhang
Yinjie Lei
Lingqiao Liu
Yifan Liu
VLM
97
176
0
07 Dec 2022
Framework-agnostic Semantically-aware Global Reasoning for Segmentation
Framework-agnostic Semantically-aware Global Reasoning for Segmentation
Mir Rayat Imtiaz Hossain
Leonid Sigal
James J. Little
ViT
50
0
0
06 Dec 2022
IncepFormer: Efficient Inception Transformer with Pyramid Pooling for
  Semantic Segmentation
IncepFormer: Efficient Inception Transformer with Pyramid Pooling for Semantic Segmentation
Lihua Fu
Haoyue Tian
Xiang Zhai
Pan Gao
Xiaojiang Peng
ViT
53
9
0
06 Dec 2022
DiffusionInst: Diffusion Model for Instance Segmentation
DiffusionInst: Diffusion Model for Instance Segmentation
Zhangxuan Gu
Haoxing Chen
Zhuoer Xu
Jun Lan
Changhua Meng
Weiqiang Wang
DiffM
78
72
0
06 Dec 2022
Semantic-aware Message Broadcasting for Efficient Unsupervised Domain
  Adaptation
Semantic-aware Message Broadcasting for Efficient Unsupervised Domain Adaptation
Xin Li
Cuiling Lan
Guoqiang Wei
Zhibo Chen
96
4
0
06 Dec 2022
Images Speak in Images: A Generalist Painter for In-Context Visual
  Learning
Images Speak in Images: A Generalist Painter for In-Context Visual Learning
Xinlong Wang
Wen Wang
Yue Cao
Chunhua Shen
Tiejun Huang
VLMMLLM
163
262
0
05 Dec 2022
Mask Matching Transformer for Few-Shot Segmentation
Mask Matching Transformer for Few-Shot Segmentation
Siyu Jiao
Gengwei Zhang
Shant Navasardyan
Ling-Hao Chen
Yao-Min Zhao
Yunchao Wei
Humphrey Shi
80
29
0
05 Dec 2022
Box2Mask: Box-supervised Instance Segmentation via Level-set Evolution
Box2Mask: Box-supervised Instance Segmentation via Level-set Evolution
Wentong Li
Wenyu Liu
Jianke Zhu
Miaomiao Cui
Risheng Yu
Xia Hua
Lei Zhang
ISeg
111
34
0
03 Dec 2022
3D Segmentation of Humans in Point Clouds with Synthetic Data
3D Segmentation of Humans in Point Clouds with Synthetic Data
Ayca Takmaz
Jonas Schult
Irem Kaftan
Mertcan Akccay
Bastian Leibe
R. Sumner
Francis Engelmann
Siyu Tang
3DH
115
25
0
01 Dec 2022
Superpoint Transformer for 3D Scene Instance Segmentation
Superpoint Transformer for 3D Scene Instance Segmentation
Jiahao Sun
Chunmei Qing
Junpeng Tan
Xiangmin Xu
3DPC
117
110
0
28 Nov 2022
SatlasPretrain: A Large-Scale Dataset for Remote Sensing Image
  Understanding
SatlasPretrain: A Large-Scale Dataset for Remote Sensing Image Understanding
Favyen Bastani
Piper Wolters
Ritwik Gupta
Joe Ferdinando
Aniruddha Kembhavi
107
109
0
28 Nov 2022
Multi-Modal Few-Shot Temporal Action Detection
Multi-Modal Few-Shot Temporal Action Detection
Sauradip Nag
Mengmeng Xu
Xiatian Zhu
Juan-Manuel Perez-Rua
Guohao Li
Yi-Zhe Song
Tao Xiang
VLM
66
6
0
27 Nov 2022
SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary
  Semantic Segmentation
SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation
Huaishao Luo
Junwei Bao
Youzheng Wu
Xiaodong He
Tianrui Li
VLM
122
154
0
27 Nov 2022
Prototype as Query for Few Shot Semantic Segmentation
Prototype as Query for Few Shot Semantic Segmentation
Leilei Cao
Yibo Guo
Ye Yuan
Qiangguo Jin
ViT
95
12
0
27 Nov 2022
From Forks to Forceps: A New Framework for Instance Segmentation of
  Surgical Instruments
From Forks to Forceps: A New Framework for Instance Segmentation of Surgical Instruments
Britty Baby
Daksh Thapar
Mustafa Chasmai
Tamajit Banerjee
Kunal Dargan
A. Suri
Subhashis Banerjee
Chetan Arora
89
27
0
26 Nov 2022
Rethinking Alignment and Uniformity in Unsupervised Image Semantic
  Segmentation
Rethinking Alignment and Uniformity in Unsupervised Image Semantic Segmentation
Daoan Zhang
Chenming Li
Haoquan Li
Wen-Fong Huang
Lingyun Huang
Jianguo Zhang
89
20
0
26 Nov 2022
RbA: Segmenting Unknown Regions Rejected by All
RbA: Segmenting Unknown Regions Rejected by All
Nazir Nayal
Mısra Yavuz
João F. Henriques
Fatma Guney
UQCV
99
47
0
25 Nov 2022
CoMFormer: Continual Learning in Semantic and Panoptic Segmentation
CoMFormer: Continual Learning in Semantic and Panoptic Segmentation
Fabio Cermelli
Matthieu Cord
Arthur Douillard
CLLVLM
97
22
0
25 Nov 2022
Aggregated Text Transformer for Scene Text Detection
Aggregated Text Transformer for Scene Text Detection
Zhao Zhou
Xiangcheng Du
Yingbin Zheng
Cheng Jin
ViT
73
1
0
25 Nov 2022
Mean Shift Mask Transformer for Unseen Object Instance Segmentation
Mean Shift Mask Transformer for Unseen Object Instance Segmentation
Ya Lu
Yuqiao Chen
Nicholas Ruozzi
Yu Xiang
84
23
0
21 Nov 2022
L-MAE: Masked Autoencoders are Semantic Segmentation Datasets Augmenter
L-MAE: Masked Autoencoders are Semantic Segmentation Datasets Augmenter
Jiaru Jia
Ming-Yuan Liu
Jiake Xie
Xin Chen
Hong Zhang
Xin Jiang
Aiqing Yang
84
0
0
21 Nov 2022
Castling-ViT: Compressing Self-Attention via Switching Towards
  Linear-Angular Attention at Vision Transformer Inference
Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference
Haoran You
Yunyang Xiong
Xiaoliang Dai
Bichen Wu
Peizhao Zhang
Haoqi Fan
Peter Vajda
Yingyan Lin
159
34
0
18 Nov 2022
Delving into Transformer for Incremental Semantic Segmentation
Delving into Transformer for Incremental Semantic Segmentation
Zekai Xu
Mingying Zhang
Jiayue Hou
Xing Gong
Chuan Wen
Chengjie Wang
Junge Zhang
CLL
66
1
0
18 Nov 2022
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and
  Vision-Language Tasks
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
Hao Li
Jinguo Zhu
Xiaohu Jiang
Xizhou Zhu
Hongsheng Li
...
Xiaohua Wang
Yu Qiao
Xiaogang Wang
Wenhai Wang
Jifeng Dai
MLLM
87
58
0
17 Nov 2022
Towards All-in-one Pre-training via Maximizing Multi-modal Mutual
  Information
Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information
Weijie Su
Xizhou Zhu
Chenxin Tao
Lewei Lu
Bin Li
Gao Huang
Yu Qiao
Xiaogang Wang
Jie Zhou
Jifeng Dai
97
42
0
17 Nov 2022
MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained
  Object Detectors
MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors
Yuang Zhang
Tiancai Wang
Xiangyu Zhang
VOT
86
139
0
17 Nov 2022
TrafficCAM: A Versatile Dataset for Traffic Flow Segmentation
TrafficCAM: A Versatile Dataset for Traffic Flow Segmentation
Zhongying Deng
Yanqi Chen
Lihao Liu
Shujun Wang
Rihuan Ke
Carola-Bibiane Schonlieb
Angelica I Aviles-Rivero
92
3
0
17 Nov 2022
3D-QueryIS: A Query-based Framework for 3D Instance Segmentation
3D-QueryIS: A Query-based Framework for 3D Instance Segmentation
Jiaheng Liu
Tong He
Honghui Yang
Rui Su
Jiayi Tian
Junran Wu
Hongcheng Guo
Ke Xu
Wanli Ouyang
ISeg
72
15
0
17 Nov 2022
Robust Online Video Instance Segmentation with Track Queries
Robust Online Video Instance Segmentation with Track Queries
Zitong Zhan
Daniel McKee
Svetlana Lazebnik
89
9
0
16 Nov 2022
A Generalized Framework for Video Instance Segmentation
A Generalized Framework for Video Instance Segmentation
Miran Heo
Sukjun Hwang
Jeongseok Hyun
Hanju Kim
Seoung Wug Oh
Joon-Young Lee
Seon Joo Kim
VLM
100
43
0
16 Nov 2022
A Unified Mutual Supervision Framework for Referring Expression
  Segmentation and Generation
A Unified Mutual Supervision Framework for Referring Expression Segmentation and Generation
Shijia Huang
Feng Li
Hao Zhang
Siyi Liu
Lei Zhang
Liwei Wang
68
5
0
15 Nov 2022
EVA: Exploring the Limits of Masked Visual Representation Learning at
  Scale
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
Yuxin Fang
Wen Wang
Binhui Xie
Quan-Sen Sun
Ledell Yu Wu
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLMCLIP
251
730
0
14 Nov 2022
Previous
123...2526272829
Next