ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
v1v2v3 (latest)

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXiv (abs)PDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,408 papers shown
Title
On Efficient Training of Large-Scale Deep Learning Models: A Literature
  Review
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
105
43
0
07 Apr 2023
SegGPT: Segmenting Everything In Context
SegGPT: Segmenting Everything In Context
Xinlong Wang
Xiaosong Zhang
Yue Cao
Wen Wang
Chunhua Shen
Tiejun Huang
VOSMLLMVLM
114
209
0
06 Apr 2023
From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot
  Keypoint Detection
From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection
Changsheng Lu
Hao Zhu
Piotr Koniusz
77
11
0
06 Apr 2023
Segment Anything
Segment Anything
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
...
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLMVLM
479
7,478
0
05 Apr 2023
Uncertainty estimation in Deep Learning for Panoptic segmentation
Uncertainty estimation in Deep Learning for Panoptic segmentation
Michael J. Smith
F. Ferrie
OODUQCV
71
0
0
04 Apr 2023
Open-Vocabulary Semantic Segmentation with Decoupled One-Pass Network
Open-Vocabulary Semantic Segmentation with Decoupled One-Pass Network
Cong Han
Yujie Zhong
Dengjie Li
Kai Han
Lin Ma
VLMSSeg
96
34
0
03 Apr 2023
RegionPLC: Regional Point-Language Contrastive Learning for Open-World
  3D Scene Understanding
RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding
Jihan Yang
Runyu Ding
Weipeng Deng
Zhe Wang
Xiaojuan Qi
133
69
0
03 Apr 2023
Devil is in the Queries: Advancing Mask Transformers for Real-world
  Medical Image Segmentation and Out-of-Distribution Localization
Devil is in the Queries: Advancing Mask Transformers for Real-world Medical Image Segmentation and Out-of-Distribution Localization
Mingze Yuan
Yingda Xia
Hexin Dong
Zi Chen
Jiawen Yao
...
Bin Dong
Jing Zhou
Le Lu
Ling Zhang
Li Zhang
OODMedIm
57
23
0
01 Apr 2023
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution
  Vision Transformer
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
Xuanyao Chen
Zhijian Liu
Haotian Tang
Li Yi
Hang Zhao
Song Han
ViT
214
48
0
30 Mar 2023
MobileInst: Video Instance Segmentation on the Mobile
MobileInst: Video Instance Segmentation on the Mobile
Renhong Zhang
Tianheng Cheng
Shusheng Yang
Hao Jiang
Shuai Zhang
...
Xin Li
Xiaowen Ying
Dashan Gao
Wenyu Liu
Xinggang Wang
104
7
0
30 Mar 2023
DDP: Diffusion Model for Dense Visual Prediction
DDP: Diffusion Model for Dense Visual Prediction
Yuanfeng Ji
Zhe Chen
Enze Xie
Lanqing Hong
Xihui Liu
Zhaoqiang Liu
Tong Lu
Zhenguo Li
Ping Luo
DiffMVLM
133
138
0
30 Mar 2023
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor
Vidit Goel
E. Peruzzo
Yi Ding
Dejia Xu
Xingqian Xu
N. Sebe
Trevor Darrell
Zhangyang Wang
Humphrey Shi
DiffM
69
8
0
30 Mar 2023
Complementary Random Masking for RGB-Thermal Semantic Segmentation
Complementary Random Masking for RGB-Thermal Semantic Segmentation
Ukcheol Shin
Kyunghyun Lee
In So Kweon
Jean Oh
76
23
0
30 Mar 2023
FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation
FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation
Jie Qin
Jie Wu
Pengxiang Yan
Ming Li
Ren Yuxi
...
Yitong Wang
Rui Wang
Shilei Wen
X. Pan
Xingang Wang
SSegVLM
96
94
0
30 Mar 2023
Masked and Adaptive Transformer for Exemplar Based Image Translation
Masked and Adaptive Transformer for Exemplar Based Image Translation
Changlong Jiang
Fei Gao
Biao Ma
Yuhao Lin
N. Wang
Gang Xu
84
18
0
30 Mar 2023
If At First You Don't Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval
If At First You Don't Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval
Finlay G. C. Hudson
W. Smith
ViT
128
1
0
30 Mar 2023
Real-time Multi-person Eyeblink Detection in the Wild for Untrimmed
  Video
Real-time Multi-person Eyeblink Detection in the Wild for Untrimmed Video
Wenzheng Zeng
Yang Xiao
Sicheng Wei
Jinfang Gan
Xintao Zhang
Z. Cao
Zhiwen Fang
Qiufeng Wang
CVBM
51
11
0
28 Mar 2023
HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic
  Scene Graph Generation
HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation
Zijian Zhou
Miaojing Shi
Holger Caesar
89
20
0
28 Mar 2023
Mask-Free Video Instance Segmentation
Mask-Free Video Instance Segmentation
Lei Ke
Martin Danelljan
Henghui Ding
Yu-Wing Tai
Chi-Keung Tang
Feng Yu
81
23
0
28 Mar 2023
OpenInst: A Simple Query-Based Method for Open-World Instance
  Segmentation
OpenInst: A Simple Query-Based Method for Open-World Instance Segmentation
Cheng Wang
Guoli Wang
Qian Zhang
Pengning Guo
Wenyu Liu
Xinggang Wang
ISegVLM
67
7
0
28 Mar 2023
SwiftFormer: Efficient Additive Attention for Transformer-based
  Real-time Mobile Vision Applications
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
Abdelrahman M. Shaker
Muhammad Maaz
H. Rasheed
Salman Khan
Ming-Hsuan Yang
Fahad Shahbaz Khan
ViT
157
98
0
27 Mar 2023
You Only Segment Once: Towards Real-Time Panoptic Segmentation
You Only Segment Once: Towards Real-Time Panoptic Segmentation
Jie Hu
Linyan Huang
Tianhe Ren
Shengchuan Zhang
Rongrong Ji
Liujuan Cao
SSeg
108
60
0
26 Mar 2023
Affordance Grounding from Demonstration Video to Target Image
Affordance Grounding from Demonstration Video to Target Image
Joya Chen
Difei Gao
Kevin Qinghong Lin
Mike Zheng Shou
70
27
0
26 Mar 2023
BoxVIS: Video Instance Segmentation with Box Annotations
BoxVIS: Video Instance Segmentation with Box Annotations
Minghan Li
Lei Zhang
ISegVOS
78
1
0
26 Mar 2023
MDQE: Mining Discriminative Query Embeddings to Segment Occluded
  Instances on Challenging Videos
MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging Videos
Minghan Li
Shuai Li
Wangmeng Xiang
Lei Zhang
86
10
0
25 Mar 2023
OPDMulti: Openable Part Detection for Multiple Objects
OPDMulti: Openable Part Detection for Multiple Objects
Xiaohao Sun
Hanxiao Jiang
Manolis Savva
Angel X. Chang
AI4CE
75
17
0
24 Mar 2023
Query-Dependent Video Representation for Moment Retrieval and Highlight
  Detection
Query-Dependent Video Representation for Moment Retrieval and Highlight Detection
WonJun Moon
Sangeek Hyun
S. Park
Dongchan Park
Jae-Pil Heo
ViT
107
115
0
24 Mar 2023
GP-VTON: Towards General Purpose Virtual Try-on via Collaborative
  Local-Flow Global-Parsing Learning
GP-VTON: Towards General Purpose Virtual Try-on via Collaborative Local-Flow Global-Parsing Learning
Zhenyu Xie
Zaiyu Huang
Xin Dong
Fuwei Zhao
Haoye Dong
Xijin Zhang
Feida Zhu
Xiaodan Liang
3DH
83
100
0
24 Mar 2023
Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient
  Vision Transformers
Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers
Cong Wei
Brendan Duke
R. Jiang
P. Aarabi
Graham W. Taylor
Florian Shkurti
ViT
107
17
0
24 Mar 2023
Category Query Learning for Human-Object Interaction Classification
Category Query Learning for Human-Object Interaction Classification
Chi Xie
Fangao Zeng
Yue Hu
Shuang Liang
Yichen Wei
VLM
78
21
0
24 Mar 2023
Position-Guided Point Cloud Panoptic Segmentation Transformer
Position-Guided Point Cloud Panoptic Segmentation Transformer
Zeqi Xiao
Wenwei Zhang
Tai Wang
Chen Change Loy
Dahua Lin
Jiangmiao Pang
ViT3DPC
87
14
0
23 Mar 2023
Zero-guidance Segmentation Using Zero Segment Labels
Zero-guidance Segmentation Using Zero Segment Labels
Pitchaporn Rewatbowornwong
Nattanat Chatthee
Ekapol Chuangsuwanich
Supasorn Suwajanakorn
VLM
60
12
0
23 Mar 2023
Tube-Link: A Flexible Cross Tube Framework for Universal Video
  Segmentation
Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation
Xiangtai Li
Haobo Yuan
Wenwei Zhang
Guangliang Cheng
Jiangmiao Pang
Chen Change Loy
ViTVOS
107
21
0
22 Mar 2023
DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic
  Segmentation Using Diffusion Models
DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
Weijia Wu
Yuzhong Zhao
Mike Zheng Shou
Hong Zhou
Chunhua Shen
139
148
0
21 Mar 2023
BoxSnake: Polygonal Instance Segmentation with Box Supervision
BoxSnake: Polygonal Instance Segmentation with Box Supervision
Rui Yang
Lin Song
Yixiao Ge
Xiu Li
ISeg
89
20
0
21 Mar 2023
Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images
Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images
Ruiqi Wang
A. Patil
Fenggen Yu
Hao Zhang
64
2
0
21 Mar 2023
EVA-02: A Visual Representation for Neon Genesis
EVA-02: A Visual Representation for Neon Genesis
Yuxin Fang
Quan-Sen Sun
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLMViTCLIP
127
289
0
20 Mar 2023
Open-vocabulary Panoptic Segmentation with Embedding Modulation
Open-vocabulary Panoptic Segmentation with Embedding Modulation
Xi Chen
Shuang Li
Ser-Nam Lim
Antonio Torralba
Hengshuang Zhao
VLM
84
34
0
20 Mar 2023
Generative Semantic Segmentation
Generative Semantic Segmentation
Jia-Qing Chen
Jiachen Lu
Xiatian Zhu
Li Zhang
GANISegVLM
79
40
0
20 Mar 2023
Reliability in Semantic Segmentation: Are We on the Right Track?
Reliability in Semantic Segmentation: Are We on the Right Track?
Pau de Jorge
Riccardo Volpi
Philip Torr
Grégory Rogez
UQCV
65
21
0
20 Mar 2023
Neural Refinement for Absolute Pose Regression with Feature Synthesis
Neural Refinement for Absolute Pose Regression with Feature Synthesis
Shuai Chen
Yash Bhalgat
Xinghui Li
Jiawang Bian
Kejie Li
Zirui Wang
V. Prisacariu
97
19
0
17 Mar 2023
LERF: Language Embedded Radiance Fields
LERF: Language Embedded Radiance Fields
Justin Kerr
Chung Min Kim
Ken Goldberg
Angjoo Kanazawa
Matthew Tancik
76
377
0
16 Mar 2023
MATIS: Masked-Attention Transformers for Surgical Instrument
  Segmentation
MATIS: Masked-Attention Transformers for Surgical Instrument Segmentation
Nicolás Ayobi
Alejandra Pérez-Rondón
Santiago Rodríguez
Pablo Arbelaez
MedIm
98
22
0
16 Mar 2023
Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers
Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers
Zhibo Yang
Sounak Mondal
Seoyoung Ahn
Ruoyu Xue
G. Zelinsky
Minh Hoai
Dimitris Samaras
56
12
0
16 Mar 2023
Global Knowledge Calibration for Fast Open-Vocabulary Segmentation
Global Knowledge Calibration for Fast Open-Vocabulary Segmentation
Kunyang Han
Yong-Jin Liu
Jun Hao Liew
Henghui Ding
Yunchao Wei
...
Yitong Wang
Yansong Tang
Yujiu Yang
Jiashi Feng
Yao-Min Zhao
VLM
103
40
0
16 Mar 2023
RSFNet: A White-Box Image Retouching Approach using Region-Specific
  Color Filters
RSFNet: A White-Box Image Retouching Approach using Region-Specific Color Filters
Wenqi Ouyang
Yi Dong
Xiaoyang Kang
Peiran Ren
Xin Xu
Xuansong Xie
79
8
0
15 Mar 2023
High-level Feature Guided Decoding for Semantic Segmentation
High-level Feature Guided Decoding for Semantic Segmentation
Ye Huang
Di Kang
Shenghua Gao
Wen Li
Lixin Duan
121
0
0
15 Mar 2023
FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation
FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation
Junjie He
Pengyu Li
Yifeng Geng
Xuansong Xie
ISegVLM
77
53
0
15 Mar 2023
A Simple Framework for Open-Vocabulary Segmentation and Detection
A Simple Framework for Open-Vocabulary Segmentation and Detection
Hao Zhang
Feng Li
Xueyan Zou
Siyi Liu
Chun-yue Li
Jianfeng Gao
Jianwei Yang
Lei Zhang
ObjDVLM
93
162
0
14 Mar 2023
LoG-CAN: local-global Class-aware Network for semantic segmentation of
  remote sensing images
LoG-CAN: local-global Class-aware Network for semantic segmentation of remote sensing images
Xiaowen Ma
Mengting Ma
Chenlu Hu
Zhiyuan Song
Zi-Shu Zhao
Tian Feng
Wei Zhang
100
13
0
14 Mar 2023
Previous
123...232425...272829
Next