Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.01527
Cited By
v1
v2
v3 (latest)
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,408 papers shown
Title
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
105
43
0
07 Apr 2023
SegGPT: Segmenting Everything In Context
Xinlong Wang
Xiaosong Zhang
Yue Cao
Wen Wang
Chunhua Shen
Tiejun Huang
VOS
MLLM
VLM
114
209
0
06 Apr 2023
From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection
Changsheng Lu
Hao Zhu
Piotr Koniusz
77
11
0
06 Apr 2023
Segment Anything
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
...
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLM
VLM
479
7,478
0
05 Apr 2023
Uncertainty estimation in Deep Learning for Panoptic segmentation
Michael J. Smith
F. Ferrie
OOD
UQCV
71
0
0
04 Apr 2023
Open-Vocabulary Semantic Segmentation with Decoupled One-Pass Network
Cong Han
Yujie Zhong
Dengjie Li
Kai Han
Lin Ma
VLM
SSeg
96
34
0
03 Apr 2023
RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding
Jihan Yang
Runyu Ding
Weipeng Deng
Zhe Wang
Xiaojuan Qi
133
69
0
03 Apr 2023
Devil is in the Queries: Advancing Mask Transformers for Real-world Medical Image Segmentation and Out-of-Distribution Localization
Mingze Yuan
Yingda Xia
Hexin Dong
Zi Chen
Jiawen Yao
...
Bin Dong
Jing Zhou
Le Lu
Ling Zhang
Li Zhang
OOD
MedIm
57
23
0
01 Apr 2023
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
Xuanyao Chen
Zhijian Liu
Haotian Tang
Li Yi
Hang Zhao
Song Han
ViT
214
48
0
30 Mar 2023
MobileInst: Video Instance Segmentation on the Mobile
Renhong Zhang
Tianheng Cheng
Shusheng Yang
Hao Jiang
Shuai Zhang
...
Xin Li
Xiaowen Ying
Dashan Gao
Wenyu Liu
Xinggang Wang
104
7
0
30 Mar 2023
DDP: Diffusion Model for Dense Visual Prediction
Yuanfeng Ji
Zhe Chen
Enze Xie
Lanqing Hong
Xihui Liu
Zhaoqiang Liu
Tong Lu
Zhenguo Li
Ping Luo
DiffM
VLM
133
138
0
30 Mar 2023
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor
Vidit Goel
E. Peruzzo
Yi Ding
Dejia Xu
Xingqian Xu
N. Sebe
Trevor Darrell
Zhangyang Wang
Humphrey Shi
DiffM
69
8
0
30 Mar 2023
Complementary Random Masking for RGB-Thermal Semantic Segmentation
Ukcheol Shin
Kyunghyun Lee
In So Kweon
Jean Oh
76
23
0
30 Mar 2023
FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation
Jie Qin
Jie Wu
Pengxiang Yan
Ming Li
Ren Yuxi
...
Yitong Wang
Rui Wang
Shilei Wen
X. Pan
Xingang Wang
SSeg
VLM
96
94
0
30 Mar 2023
Masked and Adaptive Transformer for Exemplar Based Image Translation
Changlong Jiang
Fei Gao
Biao Ma
Yuhao Lin
N. Wang
Gang Xu
84
18
0
30 Mar 2023
If At First You Don't Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval
Finlay G. C. Hudson
W. Smith
ViT
128
1
0
30 Mar 2023
Real-time Multi-person Eyeblink Detection in the Wild for Untrimmed Video
Wenzheng Zeng
Yang Xiao
Sicheng Wei
Jinfang Gan
Xintao Zhang
Z. Cao
Zhiwen Fang
Qiufeng Wang
CVBM
51
11
0
28 Mar 2023
HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation
Zijian Zhou
Miaojing Shi
Holger Caesar
89
20
0
28 Mar 2023
Mask-Free Video Instance Segmentation
Lei Ke
Martin Danelljan
Henghui Ding
Yu-Wing Tai
Chi-Keung Tang
Feng Yu
81
23
0
28 Mar 2023
OpenInst: A Simple Query-Based Method for Open-World Instance Segmentation
Cheng Wang
Guoli Wang
Qian Zhang
Pengning Guo
Wenyu Liu
Xinggang Wang
ISeg
VLM
67
7
0
28 Mar 2023
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
Abdelrahman M. Shaker
Muhammad Maaz
H. Rasheed
Salman Khan
Ming-Hsuan Yang
Fahad Shahbaz Khan
ViT
157
98
0
27 Mar 2023
You Only Segment Once: Towards Real-Time Panoptic Segmentation
Jie Hu
Linyan Huang
Tianhe Ren
Shengchuan Zhang
Rongrong Ji
Liujuan Cao
SSeg
108
60
0
26 Mar 2023
Affordance Grounding from Demonstration Video to Target Image
Joya Chen
Difei Gao
Kevin Qinghong Lin
Mike Zheng Shou
70
27
0
26 Mar 2023
BoxVIS: Video Instance Segmentation with Box Annotations
Minghan Li
Lei Zhang
ISeg
VOS
78
1
0
26 Mar 2023
MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging Videos
Minghan Li
Shuai Li
Wangmeng Xiang
Lei Zhang
86
10
0
25 Mar 2023
OPDMulti: Openable Part Detection for Multiple Objects
Xiaohao Sun
Hanxiao Jiang
Manolis Savva
Angel X. Chang
AI4CE
75
17
0
24 Mar 2023
Query-Dependent Video Representation for Moment Retrieval and Highlight Detection
WonJun Moon
Sangeek Hyun
S. Park
Dongchan Park
Jae-Pil Heo
ViT
107
115
0
24 Mar 2023
GP-VTON: Towards General Purpose Virtual Try-on via Collaborative Local-Flow Global-Parsing Learning
Zhenyu Xie
Zaiyu Huang
Xin Dong
Fuwei Zhao
Haoye Dong
Xijin Zhang
Feida Zhu
Xiaodan Liang
3DH
83
100
0
24 Mar 2023
Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers
Cong Wei
Brendan Duke
R. Jiang
P. Aarabi
Graham W. Taylor
Florian Shkurti
ViT
107
17
0
24 Mar 2023
Category Query Learning for Human-Object Interaction Classification
Chi Xie
Fangao Zeng
Yue Hu
Shuang Liang
Yichen Wei
VLM
78
21
0
24 Mar 2023
Position-Guided Point Cloud Panoptic Segmentation Transformer
Zeqi Xiao
Wenwei Zhang
Tai Wang
Chen Change Loy
Dahua Lin
Jiangmiao Pang
ViT
3DPC
87
14
0
23 Mar 2023
Zero-guidance Segmentation Using Zero Segment Labels
Pitchaporn Rewatbowornwong
Nattanat Chatthee
Ekapol Chuangsuwanich
Supasorn Suwajanakorn
VLM
60
12
0
23 Mar 2023
Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation
Xiangtai Li
Haobo Yuan
Wenwei Zhang
Guangliang Cheng
Jiangmiao Pang
Chen Change Loy
ViT
VOS
107
21
0
22 Mar 2023
DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
Weijia Wu
Yuzhong Zhao
Mike Zheng Shou
Hong Zhou
Chunhua Shen
139
148
0
21 Mar 2023
BoxSnake: Polygonal Instance Segmentation with Box Supervision
Rui Yang
Lin Song
Yixiao Ge
Xiu Li
ISeg
89
20
0
21 Mar 2023
Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images
Ruiqi Wang
A. Patil
Fenggen Yu
Hao Zhang
64
2
0
21 Mar 2023
EVA-02: A Visual Representation for Neon Genesis
Yuxin Fang
Quan-Sen Sun
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
ViT
CLIP
127
289
0
20 Mar 2023
Open-vocabulary Panoptic Segmentation with Embedding Modulation
Xi Chen
Shuang Li
Ser-Nam Lim
Antonio Torralba
Hengshuang Zhao
VLM
84
34
0
20 Mar 2023
Generative Semantic Segmentation
Jia-Qing Chen
Jiachen Lu
Xiatian Zhu
Li Zhang
GAN
ISeg
VLM
79
40
0
20 Mar 2023
Reliability in Semantic Segmentation: Are We on the Right Track?
Pau de Jorge
Riccardo Volpi
Philip Torr
Grégory Rogez
UQCV
65
21
0
20 Mar 2023
Neural Refinement for Absolute Pose Regression with Feature Synthesis
Shuai Chen
Yash Bhalgat
Xinghui Li
Jiawang Bian
Kejie Li
Zirui Wang
V. Prisacariu
97
19
0
17 Mar 2023
LERF: Language Embedded Radiance Fields
Justin Kerr
Chung Min Kim
Ken Goldberg
Angjoo Kanazawa
Matthew Tancik
76
377
0
16 Mar 2023
MATIS: Masked-Attention Transformers for Surgical Instrument Segmentation
Nicolás Ayobi
Alejandra Pérez-Rondón
Santiago Rodríguez
Pablo Arbelaez
MedIm
98
22
0
16 Mar 2023
Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers
Zhibo Yang
Sounak Mondal
Seoyoung Ahn
Ruoyu Xue
G. Zelinsky
Minh Hoai
Dimitris Samaras
56
12
0
16 Mar 2023
Global Knowledge Calibration for Fast Open-Vocabulary Segmentation
Kunyang Han
Yong-Jin Liu
Jun Hao Liew
Henghui Ding
Yunchao Wei
...
Yitong Wang
Yansong Tang
Yujiu Yang
Jiashi Feng
Yao-Min Zhao
VLM
103
40
0
16 Mar 2023
RSFNet: A White-Box Image Retouching Approach using Region-Specific Color Filters
Wenqi Ouyang
Yi Dong
Xiaoyang Kang
Peiran Ren
Xin Xu
Xuansong Xie
79
8
0
15 Mar 2023
High-level Feature Guided Decoding for Semantic Segmentation
Ye Huang
Di Kang
Shenghua Gao
Wen Li
Lixin Duan
121
0
0
15 Mar 2023
FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation
Junjie He
Pengyu Li
Yifeng Geng
Xuansong Xie
ISeg
VLM
77
53
0
15 Mar 2023
A Simple Framework for Open-Vocabulary Segmentation and Detection
Hao Zhang
Feng Li
Xueyan Zou
Siyi Liu
Chun-yue Li
Jianfeng Gao
Jianwei Yang
Lei Zhang
ObjD
VLM
93
162
0
14 Mar 2023
LoG-CAN: local-global Class-aware Network for semantic segmentation of remote sensing images
Xiaowen Ma
Mengting Ma
Chenlu Hu
Zhiyuan Song
Zi-Shu Zhao
Tian Feng
Wei Zhang
100
13
0
14 Mar 2023
Previous
1
2
3
...
23
24
25
...
27
28
29
Next