Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.01527
Cited By
v1
v2
v3 (latest)
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,408 papers shown
Title
Mining Unseen Classes via Regional Objectness: A Simple Baseline for Incremental Segmentation
Zekang Zhang
Guangyu Gao
Zhiyuan Fang
Jianbo Jiao
Yunchao Wei
CLL
67
31
0
13 Nov 2022
Far Away in the Deep Space: Dense Nearest-Neighbor-Based Out-of-Distribution Detection
Silvio Galesso
Max Argus
Thomas Brox
UQCV
96
11
0
12 Nov 2022
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Wenhai Wang
Jifeng Dai
Zhe Chen
Zhenhang Huang
Zhiqi Li
...
Tong Lu
Lewei Lu
Hongsheng Li
Xiaogang Wang
Yu Qiao
VLM
180
699
0
10 Nov 2022
High-Quality Entity Segmentation
Lu Qi
Jason Kuen
Weidong Guo
Tiancheng Shen
Jiuxiang Gu
Jiaya Jia
Zhe Lin
Ming-Hsuan Yang
ISeg
98
55
0
10 Nov 2022
OneFormer: One Transformer to Rule Universal Image Segmentation
Jitesh Jain
Jiacheng Li
M. Chiu
Ali Hassani
Nikita Orlov
Humphrey Shi
ViT
78
349
0
10 Nov 2022
Polite Teacher: Semi-Supervised Instance Segmentation with Mutual Learning and Pseudo-Label Thresholding
Dominik Filipiak
Andrzej Zapala
Piotr Tempczyk
A. Fensel
Marek Cygan
ISeg
104
11
0
07 Nov 2022
Large Scale Radio Frequency Wideband Signal Detection & Recognition
Luke Boegner
Garrett M. Vanhoy
Phillip Vallance
Manbir Gulati
Dresden Feitzinger
B. Comar
Rob Miller
AI4TS
51
6
0
04 Nov 2022
Could Giant Pretrained Image Models Extract Universal Representations?
Yutong Lin
Ze Liu
Zheng Zhang
Han Hu
Nanning Zheng
Stephen Lin
Yue Cao
VLM
106
9
0
03 Nov 2022
Layout Aware Inpainting for Automated Furniture Removal in Indoor Scenes
Prakhar Kulshreshtha
K. Lianos
Brian Pugh
Salma Jiddi
65
6
0
27 Oct 2022
Class Based Thresholding in Early Exit Semantic Segmentation Networks
Alperen Görmez
Erdem Koyuncu
79
5
0
27 Oct 2022
End-to-end Transformer for Compressed Video Quality Enhancement
Li Yu
Wenshuai Chang
Shiyu Wu
Moncef Gabbouj
ViT
69
9
0
25 Oct 2022
BARS: A Benchmark for Airport Runway Segmentation
Wenhui Chen
Zhijiang Zhang
Liang Yu
Yichun Tai
166
11
0
24 Oct 2022
DANLI: Deliberative Agent for Following Natural Language Instructions
Yichi Zhang
Jianing Yang
Jiayi Pan
Shane Storks
N. Devraj
Ziqiao Ma
Keunwoo Peter Yu
Yuwei Bao
J. Chai
LM&Ro
143
16
0
22 Oct 2022
Unsupervised Multi-object Segmentation by Predicting Probable Motion Patterns
Laurynas Karazija
Subhabrata Choudhury
Iro Laina
Christian Rupprecht
Andrea Vedaldi
OCL
162
21
0
21 Oct 2022
A Tri-Layer Plugin to Improve Occluded Detection
Guanqi Zhan
Weidi Xie
Andrew Zisserman
75
20
0
18 Oct 2022
1st Place Solutions for the UVO Challenge 2022
Jiajun Zhang
Boyu Chen
Zhilong Ji
Jinfeng Bai
Zonghai Hu
86
1
0
18 Oct 2022
Intermediate Prototype Mining Transformer for Few-Shot Semantic Segmentation
Yuanwei Liu
Nian Liu
Xiwen Yao
Junwei Han
65
63
0
13 Oct 2022
A Generalist Framework for Panoptic Segmentation of Images and Videos
Ting-Li Chen
Lala Li
Saurabh Saxena
Geoffrey E. Hinton
David J. Fleet
VGen
MLLM
121
104
0
12 Oct 2022
SegViT: Semantic Segmentation with Plain Vision Transformers
Bowen Zhang
Zhi Tian
Quan Tang
Xiangxiang Chu
Xiaolin K. Wei
Chunhua Shen
Yifan Liu
ViT
98
146
0
12 Oct 2022
Point Transformer V2: Grouped Vector Attention and Partition-based Pooling
Xiaoyang Wu
Yixing Lao
Li Jiang
Xihui Liu
Hengshuang Zhao
3DPC
ViT
168
407
0
11 Oct 2022
BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance Segmentation
Tianheng Cheng
Xinggang Wang
Shaoyu Chen
Qian Zhang
Wenyu Liu
ISeg
86
48
0
11 Oct 2022
What the DAAM: Interpreting Stable Diffusion Using Cross Attention
Raphael Tang
Linqing Liu
Akshat Pandey
Zhiying Jiang
Gefei Yang
K. Kumar
Pontus Stenetorp
Jimmy J. Lin
Ferhan Ture
175
177
0
10 Oct 2022
Humans need not label more humans: Occlusion Copy & Paste for Occluded Human Instance Segmentation
Evan Ling
De-Kai Huang
Minhoe Hur
116
5
0
07 Oct 2022
Leveraging Structure from Motion to Localize Inaccessible Bus Stops
Indu Panigrahi
Tom Bu
Christoph Mertz
23
0
0
07 Oct 2022
Mask3D: Mask Transformer for 3D Semantic Instance Segmentation
Jonas Schult
Francis Engelmann
Alexander Hermans
Or Litany
Siyu Tang
Bastian Leibe
ISeg
125
182
0
06 Oct 2022
SoccerNet 2022 Challenges Results
Silvio Giancola
A. Cioppa
Adrien Deliège
Floriane Magera
Vladimir Somers
...
Yingying Li
Yue He
Yujie Zhong
Zhenhua Guo
Zhiheng Li
91
31
0
05 Oct 2022
GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models
Chen Liang
Wenguan Wang
Jiaxu Miao
Yi Yang
VLM
106
122
0
05 Oct 2022
MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models
Chenglin Yang
Siyuan Qiao
Qihang Yu
Xiaoding Yuan
Yukun Zhu
Alan Yuille
Hartwig Adam
Liang-Chieh Chen
ViT
MoE
120
66
0
04 Oct 2022
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Weicong Liang
Yuhui Yuan
Henghui Ding
Xiao Luo
Weihong Lin
Ding Jia
Zheng Zhang
Chao Zhang
Hanhua Hu
117
31
0
03 Oct 2022
Learning Equivariant Segmentation with Instance-Unique Querying
Wenguan Wang
James Liang
Dongfang Liu
ISeg
95
49
0
03 Oct 2022
Learning Hierarchical Image Segmentation For Recognition and By Recognition
Tsung-Wei Ke
Sangwoo Mo
Stella X. Yu
VLM
140
11
0
01 Oct 2022
Dilated Neighborhood Attention Transformer
Ali Hassani
Humphrey Shi
ViT
MedIm
114
73
0
29 Sep 2022
Towards Multimodal Multitask Scene Understanding Models for Indoor Mobile Agents
Yao-Hung Hubert Tsai
Hanlin Goh
Ali Farhadi
Jian Zhang
59
1
0
27 Sep 2022
SAPA: Similarity-Aware Point Affiliation for Feature Upsampling
Hao Lu
Wenze Liu
Zixuan Ye
Hongtao Fu
Yuliang Liu
Zhiguo Cao
3DPC
106
37
0
26 Sep 2022
SOCRATES: A Stereo Camera Trap for Monitoring of Biodiversity
T. Haucke
H. Kühl
Volker Steinhage
90
11
0
19 Sep 2022
SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation
Meng-Hao Guo
Chenggang Lu
Qibin Hou
Zheng Liu
Ming-Ming Cheng
Shiyong Hu
SSeg
ViT
VLM
104
669
0
18 Sep 2022
SIGNet: Intrinsic Image Decomposition by a Semantic and Invariant Gradient Driven Network for Indoor Scenes
P. Das
Sezer Karaoglu
A. Gijsenij
Theo Gevers
80
4
0
30 Aug 2022
VMFormer: End-to-End Video Matting with Transformer
Jiacheng Li
Vidit Goel
Marianna Ohanyan
Shant Navasardyan
Yunchao Wei
Humphrey Shi
ViT
87
19
0
26 Aug 2022
Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
136
48
0
24 Aug 2022
Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks
Wenhui Wang
Hangbo Bao
Li Dong
Johan Bjorck
Zhiliang Peng
...
Kriti Aggarwal
O. Mohammed
Saksham Singhal
Subhojit Som
Furu Wei
MLLM
VLM
ViT
157
646
0
22 Aug 2022
Single-Stage Open-world Instance Segmentation with Cross-task Consistency Regularization
Xizhe Xue
Dongdong Yu
Lingqiao Liu
Yu Liu
Satoshi Tsutsui
Ying Li
Zehuan Yuan
Ping Song
Mike Zheng Shou
ISeg
73
4
0
18 Aug 2022
Open-Vocabulary Universal Image Segmentation with MaskCLIP
Zheng Ding
Jieke Wang
Zhuowen Tu
CLIP
ISeg
VLM
106
90
0
18 Aug 2022
In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation
Bolin Lai
Miao Liu
Fiona Ryan
James M. Rehg
ViT
97
37
0
08 Aug 2022
Occlusion-Aware Instance Segmentation via BiLayer Network Architectures
Lei Ke
Yu-Wing Tai
Chi-Keung Tang
ISeg
77
12
0
08 Aug 2022
Occupancy Planes for Single-view RGB-D Human Reconstruction
Xiaoming Zhao
Yuan-Ting Hu
Zhongzheng Ren
Alex Schwing
3DH
60
9
0
04 Aug 2022
MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training
De-An Huang
Zhiding Yu
Anima Anandkumar
VLM
104
82
0
03 Aug 2022
Connection Reduction of DenseNet for Image Recognition
Ruikang Ju
Jen-Shiun Chiang
Chih-Chia Chen
Yu-Shian Lin
46
1
0
02 Aug 2022
HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions
Yongming Rao
Wenliang Zhao
Yansong Tang
Jie Zhou
Ser-Nam Lim
Jiwen Lu
ViT
117
256
0
28 Jul 2022
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
Qiang Chen
Xiaokang Chen
Jian Wang
Shan Zhang
Kun Yao
Haocheng Feng
Junyu Han
Errui Ding
Gang Zeng
Jingdong Wang
ViT
145
135
0
26 Jul 2022
DETRs with Hybrid Matching
Ding Jia
Yuhui Yuan
Hao He
Xiao-pei Wu
Haojun Yu
Weihong Lin
Lei-huan Sun
Chao Zhang
Hanhua Hu
69
199
0
26 Jul 2022
Previous
1
2
3
...
26
27
28
29
Next