Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.01527
Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,365 papers shown
Title
TrafficCAM: A Versatile Dataset for Traffic Flow Segmentation
Zhongying Deng
Yanqi Chen
Lihao Liu
Shujun Wang
Rihuan Ke
Carola-Bibiane Schonlieb
Angelica I Aviles-Rivero
11
3
0
17 Nov 2022
3D-QueryIS: A Query-based Framework for 3D Instance Segmentation
Jiaheng Liu
Tong He
Honghui Yang
Rui Su
Jiayi Tian
Junran Wu
Hongcheng Guo
Ke Xu
Wanli Ouyang
ISeg
26
14
0
17 Nov 2022
Robust Online Video Instance Segmentation with Track Queries
Zitong Zhan
Daniel McKee
Svetlana Lazebnik
29
9
0
16 Nov 2022
A Generalized Framework for Video Instance Segmentation
Miran Heo
Sukjun Hwang
Jeongseok Hyun
Hanju Kim
Seoung Wug Oh
Joon-Young Lee
Seon Joo Kim
VLM
25
41
0
16 Nov 2022
A Unified Mutual Supervision Framework for Referring Expression Segmentation and Generation
Shijia Huang
Feng Li
Hao Zhang
Siyi Liu
Lei Zhang
Liwei Wang
30
5
0
15 Nov 2022
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
Yuxin Fang
Wen Wang
Binhui Xie
Quan-Sen Sun
Ledell Yu Wu
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
CLIP
87
679
0
14 Nov 2022
Mining Unseen Classes via Regional Objectness: A Simple Baseline for Incremental Segmentation
Zekang Zhang
Guangyu Gao
Zhiyuan Fang
Jianbo Jiao
Yunchao Wei
CLL
26
31
0
13 Nov 2022
Far Away in the Deep Space: Dense Nearest-Neighbor-Based Out-of-Distribution Detection
Silvio Galesso
Max Argus
Thomas Brox
UQCV
33
11
0
12 Nov 2022
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Wenhai Wang
Jifeng Dai
Zhe Chen
Zhenhang Huang
Zhiqi Li
...
Tong Lu
Lewei Lu
Hongsheng Li
Xiaogang Wang
Yu Qiao
VLM
38
660
0
10 Nov 2022
High-Quality Entity Segmentation
Lu Qi
Jason Kuen
Weidong Guo
Tiancheng Shen
Jiuxiang Gu
Jiaya Jia
Zhe-nan Lin
Ming-Hsuan Yang
ISeg
29
51
0
10 Nov 2022
OneFormer: One Transformer to Rule Universal Image Segmentation
Jitesh Jain
Jiacheng Li
M. Chiu
Ali Hassani
Nikita Orlov
Humphrey Shi
ViT
31
327
0
10 Nov 2022
Polite Teacher: Semi-Supervised Instance Segmentation with Mutual Learning and Pseudo-Label Thresholding
Dominik Filipiak
Andrzej Zapala
Piotr Tempczyk
A. Fensel
Marek Cygan
ISeg
21
10
0
07 Nov 2022
Large Scale Radio Frequency Wideband Signal Detection & Recognition
Luke Boegner
Garrett M. Vanhoy
Phillip Vallance
Manbir Gulati
Dresden Feitzinger
B. Comar
Rob Miller
AI4TS
18
6
0
04 Nov 2022
Could Giant Pretrained Image Models Extract Universal Representations?
Yutong Lin
Ze Liu
Zheng-Wei Zhang
Han Hu
Nanning Zheng
Stephen Lin
Yue Cao
VLM
54
9
0
03 Nov 2022
Layout Aware Inpainting for Automated Furniture Removal in Indoor Scenes
Prakhar Kulshreshtha
K. Lianos
Brian Pugh
Salma Jiddi
40
6
0
27 Oct 2022
Class Based Thresholding in Early Exit Semantic Segmentation Networks
Alperen Görmez
Erdem Koyuncu
23
5
0
27 Oct 2022
End-to-end Transformer for Compressed Video Quality Enhancement
Li Yu
Wenshuai Chang
Shiyu Wu
Moncef Gabbouj
ViT
26
8
0
25 Oct 2022
BARS: A Benchmark for Airport Runway Segmentation
Wenhui Chen
Zhijiang Zhang
Liang Yu
Yichun Tai
19
11
0
24 Oct 2022
DANLI: Deliberative Agent for Following Natural Language Instructions
Yichi Zhang
Jianing Yang
Jiayi Pan
Shane Storks
N. Devraj
Ziqiao Ma
Keunwoo Peter Yu
Yuwei Bao
J. Chai
LM&Ro
58
16
0
22 Oct 2022
Unsupervised Multi-object Segmentation by Predicting Probable Motion Patterns
Laurynas Karazija
Subhabrata Choudhury
Iro Laina
Christian Rupprecht
Andrea Vedaldi
OCL
108
21
0
21 Oct 2022
A Tri-Layer Plugin to Improve Occluded Detection
Guanqi Zhan
Weidi Xie
Andrew Zisserman
24
20
0
18 Oct 2022
1st Place Solutions for the UVO Challenge 2022
Jiajun Zhang
Boyu Chen
Zhilong Ji
Jinfeng Bai
Zonghai Hu
36
1
0
18 Oct 2022
Intermediate Prototype Mining Transformer for Few-Shot Semantic Segmentation
Yuanwei Liu
Nian Liu
Xiwen Yao
Junwei Han
31
61
0
13 Oct 2022
A Generalist Framework for Panoptic Segmentation of Images and Videos
Ting-Li Chen
Lala Li
Saurabh Saxena
Geoffrey E. Hinton
David J. Fleet
VGen
MLLM
43
102
0
12 Oct 2022
SegViT: Semantic Segmentation with Plain Vision Transformers
Bowen Zhang
Zhi Tian
Quan Tang
Xiangxiang Chu
Xiaolin K. Wei
Chunhua Shen
Yifan Liu
ViT
21
134
0
12 Oct 2022
Point Transformer V2: Grouped Vector Attention and Partition-based Pooling
Xiaoyang Wu
Yixing Lao
Li Jiang
Xihui Liu
Hengshuang Zhao
3DPC
ViT
32
367
0
11 Oct 2022
BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance Segmentation
Tianheng Cheng
Xinggang Wang
Shaoyu Chen
Qian Zhang
Wenyu Liu
ISeg
35
42
0
11 Oct 2022
What the DAAM: Interpreting Stable Diffusion Using Cross Attention
Raphael Tang
Linqing Liu
Akshat Pandey
Zhiying Jiang
Gefei Yang
K. Kumar
Pontus Stenetorp
Jimmy J. Lin
Ferhan Ture
29
167
0
10 Oct 2022
Humans need not label more humans: Occlusion Copy & Paste for Occluded Human Instance Segmentation
Evan Ling
De-Kai Huang
Minhoe Hur
27
5
0
07 Oct 2022
Leveraging Structure from Motion to Localize Inaccessible Bus Stops
Indu Panigrahi
Tom Bu
Christoph Mertz
14
0
0
07 Oct 2022
Mask3D: Mask Transformer for 3D Semantic Instance Segmentation
Jonas Schult
Francis Engelmann
Alexander Hermans
Or Litany
Siyu Tang
Bastian Leibe
ISeg
50
165
0
06 Oct 2022
SoccerNet 2022 Challenges Results
Silvio Giancola
A. Cioppa
A. Deliège
Floriane Magera
Vladimir Somers
...
Yingying Li
Yue He
Yujie Zhong
Zhenhua Guo
Zhiheng Li
30
30
0
05 Oct 2022
GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models
Chen Liang
Wenguan Wang
Jiaxu Miao
Yi Yang
VLM
39
117
0
05 Oct 2022
MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models
Chenglin Yang
Siyuan Qiao
Qihang Yu
Xiaoding Yuan
Yukun Zhu
Alan Yuille
Hartwig Adam
Liang-Chieh Chen
ViT
MoE
39
59
0
04 Oct 2022
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Weicong Liang
Yuhui Yuan
Henghui Ding
Xiao Luo
Weihong Lin
Ding Jia
Zheng-Wei Zhang
Chao Zhang
Hanhua Hu
35
25
0
03 Oct 2022
Learning Equivariant Segmentation with Instance-Unique Querying
Wenguan Wang
James Liang
Dongfang Liu
ISeg
43
48
0
03 Oct 2022
Learning Hierarchical Image Segmentation For Recognition and By Recognition
Tsung-Wei Ke
Sangwoo Mo
Stella X. Yu
VLM
37
9
0
01 Oct 2022
Dilated Neighborhood Attention Transformer
Ali Hassani
Humphrey Shi
ViT
MedIm
33
68
0
29 Sep 2022
Towards Multimodal Multitask Scene Understanding Models for Indoor Mobile Agents
Yao-Hung Hubert Tsai
Hanlin Goh
Ali Farhadi
Jian Zhang
27
1
0
27 Sep 2022
SAPA: Similarity-Aware Point Affiliation for Feature Upsampling
Hao Lu
Wenze Liu
Zixuan Ye
Hongtao Fu
Yuliang Liu
Zhiguo Cao
3DPC
30
31
0
26 Sep 2022
SOCRATES: A Stereo Camera Trap for Monitoring of Biodiversity
T. Haucke
H. Kühl
Volker Steinhage
39
11
0
19 Sep 2022
SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation
Meng-Hao Guo
Chenggang Lu
Qibin Hou
Zheng Liu
Ming-Ming Cheng
Shiyong Hu
SSeg
ViT
VLM
26
608
0
18 Sep 2022
SIGNet: Intrinsic Image Decomposition by a Semantic and Invariant Gradient Driven Network for Indoor Scenes
P. Das
Sezer Karaoglu
A. Gijsenij
Theo Gevers
27
4
0
30 Aug 2022
VMFormer: End-to-End Video Matting with Transformer
Jiacheng Li
Vidit Goel
Marianna Ohanyan
Shant Navasardyan
Yunchao Wei
Humphrey Shi
ViT
33
18
0
26 Aug 2022
Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
58
46
0
24 Aug 2022
Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks
Wenhui Wang
Hangbo Bao
Li Dong
Johan Bjorck
Zhiliang Peng
...
Kriti Aggarwal
O. Mohammed
Saksham Singhal
Subhojit Som
Furu Wei
MLLM
VLM
ViT
54
629
0
22 Aug 2022
Single-Stage Open-world Instance Segmentation with Cross-task Consistency Regularization
Xizhe Xue
Dongdong Yu
Lingqiao Liu
Yu Liu
Satoshi Tsutsui
Ying Li
Zehuan Yuan
Ping Song
Mike Zheng Shou
ISeg
27
4
0
18 Aug 2022
Open-Vocabulary Universal Image Segmentation with MaskCLIP
Zheng Ding
Jieke Wang
Z. Tu
CLIP
ISeg
VLM
52
86
0
18 Aug 2022
In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation
Bolin Lai
Miao Liu
Fiona Ryan
James M. Rehg
ViT
40
33
0
08 Aug 2022
Occlusion-Aware Instance Segmentation via BiLayer Network Architectures
Lei Ke
Yu-Wing Tai
Chi-Keung Tang
ISeg
27
11
0
08 Aug 2022
Previous
1
2
3
...
25
26
27
28
Next