ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXivPDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,365 papers shown
Title
Occupancy Planes for Single-view RGB-D Human Reconstruction
Occupancy Planes for Single-view RGB-D Human Reconstruction
Xiaoming Zhao
Yuan-Ting Hu
Zhongzheng Ren
A. Schwing
3DH
31
9
0
04 Aug 2022
MinVIS: A Minimal Video Instance Segmentation Framework without
  Video-based Training
MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training
De-An Huang
Zhiding Yu
Anima Anandkumar
VLM
48
78
0
03 Aug 2022
Connection Reduction of DenseNet for Image Recognition
Connection Reduction of DenseNet for Image Recognition
Ruikang Ju
Jen-Shiun Chiang
Chih-Chia Chen
Yu-Shian Lin
29
1
0
02 Aug 2022
HorNet: Efficient High-Order Spatial Interactions with Recursive Gated
  Convolutions
HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions
Yongming Rao
Wenliang Zhao
Yansong Tang
Jie Zhou
Ser-Nam Lim
Jiwen Lu
ViT
22
251
0
28 Jul 2022
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
Qiang Chen
Xiaokang Chen
Jian Wang
Shan Zhang
Kun Yao
Haocheng Feng
Junyu Han
Errui Ding
Gang Zeng
Jingdong Wang
ViT
49
120
0
26 Jul 2022
DETRs with Hybrid Matching
DETRs with Hybrid Matching
Ding Jia
Yuhui Yuan
Hao He
Xiao-pei Wu
Haojun Yu
Weihong Lin
Lei-huan Sun
Chao Zhang
Hanhua Hu
26
182
0
26 Jul 2022
Behind Every Domain There is a Shift: Adapting Distortion-aware Vision
  Transformers for Panoramic Semantic Segmentation
Behind Every Domain There is a Shift: Adapting Distortion-aware Vision Transformers for Panoramic Semantic Segmentation
Jiaming Zhang
Kailun Yang
Haowen Shi
Simon Reiß
Kunyu Peng
Chaoxiang Ma
Haodong Fu
Philip H. S. Torr
Kaiwei Wang
Rainer Stiefelhagen
ViT
MDE
36
36
0
25 Jul 2022
Active Pointly-Supervised Instance Segmentation
Active Pointly-Supervised Instance Segmentation
Chufeng Tang
Lingxi Xie
Gang Zhang
Xiaopeng Zhang
Qi Tian
Xiaolin Hu
ISeg
40
15
0
23 Jul 2022
NeuralBF: Neural Bilateral Filtering for Top-down Instance Segmentation
  on Point Clouds
NeuralBF: Neural Bilateral Filtering for Top-down Instance Segmentation on Point Clouds
Weiwei Sun
Daniel Rebain
Renjie Liao
V. Tankovich
S. Yazdani
K. M. Yi
Andrea Tagliasacchi
3DPC
18
13
0
20 Jul 2022
Tracking Objects as Pixel-wise Distributions
Tracking Objects as Pixel-wise Distributions
Zelin Zhao
Ze Wu
Yueqing Zhuang
Boxun Li
Jiaya Jia
VOT
31
54
0
12 Jul 2022
kMaX-DeepLab: k-means Mask Transformer
kMaX-DeepLab: k-means Mask Transformer
Qihang Yu
Huiyu Wang
Siyuan Qiao
Maxwell D. Collins
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
ViT
40
18
0
08 Jul 2022
Dual Decision Improves Open-Set Panoptic Segmentation
Dual Decision Improves Open-Set Panoptic Segmentation
Hainan Xu
Hao Chen
Lingqiao Liu
Yufei Yin
VLM
24
6
0
06 Jul 2022
UniDAformer: Unified Domain Adaptive Panoptic Segmentation Transformer
  via Hierarchical Mask Calibration
UniDAformer: Unified Domain Adaptive Panoptic Segmentation Transformer via Hierarchical Mask Calibration
Jingyi Zhang
Jiaxing Huang
Xiaoqin Zhang
Shijian Lu
21
7
0
30 Jun 2022
The Third Place Solution for CVPR2022 AVA Accessibility Vision and
  Autonomy Challenge
The Third Place Solution for CVPR2022 AVA Accessibility Vision and Autonomy Challenge
Bo Yan
Leilei Cao
Zhuang Li
Hongbin Wang
32
0
0
28 Jun 2022
MaskRange: A Mask-classification Model for Range-view based LiDAR
  Segmentation
MaskRange: A Mask-classification Model for Range-view based LiDAR Segmentation
Yinjuan Gu
Yuming Huang
Chengzhong Xu
Hui Kong
ISeg
VLM
3DPC
30
10
0
24 Jun 2022
Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Shangchen Zhou
Kelvin C. K. Chan
Chongyi Li
Chen Change Loy
CVBM
21
220
0
22 Jun 2022
Panoramic Panoptic Segmentation: Insights Into Surrounding Parsing for
  Mobile Agents via Unsupervised Contrastive Learning
Panoramic Panoptic Segmentation: Insights Into Surrounding Parsing for Mobile Agents via Unsupervised Contrastive Learning
A. Jaus
Kailun Yang
Rainer Stiefelhagen
41
17
0
21 Jun 2022
EATFormer: Improving Vision Transformer Inspired by Evolutionary
  Algorithm
EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm
Jiangning Zhang
Xiangtai Li
Yabiao Wang
Chengjie Wang
Yibo Yang
Yong Liu
Dacheng Tao
ViT
34
32
0
19 Jun 2022
REVECA -- Rich Encoder-decoder framework for Video Event CAptioner
REVECA -- Rich Encoder-decoder framework for Video Event CAptioner
Jaehyuk Heo
YongGi Jeong
Sunwoo Kim
Jaehee Kim
Pilsung Kang
18
0
0
18 Jun 2022
Forecasting of depth and ego-motion with transformers and
  self-supervision
Forecasting of depth and ego-motion with transformers and self-supervision
Houssem-eddine Boulahbal
A. Voicila
Andrew I. Comport
ViT
MDE
27
3
0
15 Jun 2022
Consistent Video Instance Segmentation with Inter-Frame Recurrent
  Attention
Consistent Video Instance Segmentation with Inter-Frame Recurrent Attention
Quanzeng You
Jiang Wang
Peng Chu
Andre Abrantes
Zicheng Liu
VOS
27
1
0
14 Jun 2022
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional
  MoEs
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs
Jinguo Zhu
Xizhou Zhu
Wenhai Wang
Xiaohua Wang
Hongsheng Li
Xiaogang Wang
Jifeng Dai
MoMe
MoE
26
66
0
09 Jun 2022
VITA: Video Instance Segmentation via Object Token Association
VITA: Video Instance Segmentation via Object Token Association
Miran Heo
Sukjun Hwang
Seoung Wug Oh
Joon-Young Lee
Seon Joo Kim
VOS
23
88
0
09 Jun 2022
Mask DINO: Towards A Unified Transformer-based Framework for Object
  Detection and Segmentation
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
Feng Li
Hao Zhang
Hu-Sheng Xu
Siyi Liu
Lei Zhang
L. Ni
H. Shum
ISeg
59
367
0
06 Jun 2022
EfficientFormer: Vision Transformers at MobileNet Speed
EfficientFormer: Vision Transformers at MobileNet Speed
Yanyu Li
Geng Yuan
Yang Wen
Eric Hu
Georgios Evangelidis
Sergey Tulyakov
Yanzhi Wang
Jian Ren
ViT
23
347
0
02 Jun 2022
Differentiable Soft-Masked Attention
Differentiable Soft-Masked Attention
A. Athar
Jonathon Luiten
Alexander Hermans
Deva Ramanan
Bastian Leibe
17
0
0
01 Jun 2022
EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense
  Prediction
EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense Prediction
Han Cai
Junyan Li
Muyan Hu
Chuang Gan
Song Han
34
49
0
29 May 2022
Unsupervised Multi-object Segmentation Using Attention and Soft-argmax
Unsupervised Multi-object Segmentation Using Attention and Soft-argmax
Bruno Sauvalle
A. de La Fortelle
3DPC
52
12
0
26 May 2022
UViM: A Unified Modeling Approach for Vision with Learned Guiding Codes
UViM: A Unified Modeling Approach for Vision with Learned Guiding Codes
Alexander Kolesnikov
André Susano Pinto
Lucas Beyer
Xiaohua Zhai
Jeremiah Harmsen
N. Houlsby
103
67
0
20 May 2022
HCFormer: Unified Image Segmentation with Hierarchical Clustering
HCFormer: Unified Image Segmentation with Hierarchical Clustering
Teppei Suzuki
27
0
0
20 May 2022
Vision Transformer Adapter for Dense Predictions
Vision Transformer Adapter for Dense Predictions
Zhe Chen
Yuchen Duan
Wenhai Wang
Junjun He
Tong Lu
Jifeng Dai
Yu Qiao
45
543
0
17 May 2022
Transformer Scale Gate for Semantic Segmentation
Transformer Scale Gate for Semantic Segmentation
Hengcan Shi
Munawar Hayat
Jianfei Cai
ViT
32
22
0
14 May 2022
Where in the World is this Image? Transformer-based Geo-localization in
  the Wild
Where in the World is this Image? Transformer-based Geo-localization in the Wild
Shraman Pramanick
E. Nowara
Joshua Gleason
Carlos D. Castillo
Rama Chellappa
ViT
21
30
0
29 Apr 2022
Joint Forecasting of Panoptic Segmentations with Difference Attention
Joint Forecasting of Panoptic Segmentations with Difference Attention
Colin Graber
Cyril Jazra
Wenjie Luo
Liangyan Gui
A. Schwing
AI4TS
32
1
0
14 Apr 2022
Fashionformer: A simple, Effective and Unified Baseline for Human
  Fashion Segmentation and Recognition
Fashionformer: A simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition
Shilin Xu
Xiangtai Li
Jingbo Wang
Guangliang Cheng
Yunhai Tong
Dacheng Tao
ViT
23
27
0
10 Apr 2022
Learning Local and Global Temporal Contexts for Video Semantic
  Segmentation
Learning Local and Global Temporal Contexts for Video Semantic Segmentation
Guolei Sun
Yun Liu
Henghui Ding
Min Wu
Luc Van Gool
30
32
0
07 Apr 2022
End-to-End Instance Edge Detection
End-to-End Instance Edge Detection
Xueyan Zou
Haotian Liu
Yong Jae Lee
29
2
0
06 Apr 2022
MultiMAE: Multi-modal Multi-task Masked Autoencoders
MultiMAE: Multi-modal Multi-task Masked Autoencoders
Roman Bachmann
David Mizrahi
Andrei Atanov
Amir Zamir
47
265
0
04 Apr 2022
Dynamic Focus-aware Positional Queries for Semantic Segmentation
Dynamic Focus-aware Positional Queries for Semantic Segmentation
Haoyu He
Jianfei Cai
Zizheng Pan
Jing Liu
Jing Zhang
Dacheng Tao
Bohan Zhuang
34
17
0
04 Apr 2022
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation
Zhenyu Li
Xuyang Wang
Xianming Liu
Junjun Jiang
MDE
24
191
0
03 Apr 2022
StructToken : Rethinking Semantic Segmentation with Structural Prior
StructToken : Rethinking Semantic Segmentation with Structural Prior
Fangjian Lin
Zhanhao Liang
Miao Zheng
Junjun He
Kaibing Chen
Sheng Tian
23
49
0
23 Mar 2022
GOSS: Towards Generalized Open-set Semantic Segmentation
GOSS: Towards Generalized Open-set Semantic Segmentation
Jie Hong
Weihong Li
Junlin Han
Jiyang Zheng
Pengfei Fang
Mehrtash Harandi
L. Petersson
VLM
30
19
0
23 Mar 2022
Focal Modulation Networks
Focal Modulation Networks
Jianwei Yang
Chunyuan Li
Xiyang Dai
Lu Yuan
Jianfeng Gao
3DPC
33
263
0
22 Mar 2022
Test-time Adaptation with Slot-Centric Models
Test-time Adaptation with Slot-Centric Models
Mihir Prabhudesai
Anirudh Goyal
S. Paul
Sjoerd van Steenkiste
Mehdi S. M. Sajjadi
Gaurav Aggarwal
Thomas Kipf
Deepak Pathak
Katerina Fragkiadaki
TTA
26
9
0
21 Mar 2022
Active Token Mixer
Active Token Mixer
Guoqiang Wei
Zhizheng Zhang
Cuiling Lan
Yan Lu
Zhibo Chen
24
15
0
11 Mar 2022
NeRFocus: Neural Radiance Field for 3D Synthetic Defocus
NeRFocus: Neural Radiance Field for 3D Synthetic Defocus
Yinhuai Wang
Shu-Yi Yang
Yu Fei Hu
Jian Zhang
22
11
0
10 Mar 2022
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with
  Transformers
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers
Jiaming Zhang
Huayao Liu
Kailun Yang
Xinxin Hu
Ruiping Liu
Rainer Stiefelhagen
ViT
34
301
0
09 Mar 2022
RankSeg: Adaptive Pixel Classification with Image Category Ranking for
  Segmentation
RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation
Hao He
Yuhui Yuan
Xiangyu Yue
Han Hu
VOS
VLM
30
13
0
08 Mar 2022
Instance Segmentation for Autonomous Log Grasping in Forestry Operations
Instance Segmentation for Autonomous Log Grasping in Forestry Operations
Jean-Michel Fortin
Olivier Gamache
Vincent Grondin
F. Pomerleau
Philippe Giguère
27
22
0
03 Mar 2022
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
Feng Li
Hao Zhang
Shi-guang Liu
Jian Guo
L. Ni
Lei Zhang
ViT
52
648
0
02 Mar 2022
Previous
123...262728
Next