ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
v1v2v3 (latest)

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXiv (abs)PDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,408 papers shown
Title
Completing Visual Objects via Bridging Generation and Segmentation
Completing Visual Objects via Bridging Generation and Segmentation
Xiang Li
Yinpeng Chen
Chung-Ching Lin
Hao Chen
Kai Hu
Rita Singh
Bhiksha Raj
Lijuan Wang
Zicheng Liu
DiffM
110
3
0
01 Oct 2023
PharmacoNet: Accelerating Large-Scale Virtual Screening by Deep
  Pharmacophore Modeling
PharmacoNet: Accelerating Large-Scale Virtual Screening by Deep Pharmacophore Modeling
Seonghwan Seo
Woo Youn Kim
70
4
0
01 Oct 2023
Black-box Attacks on Image Activity Prediction and its Natural Language
  Explanations
Black-box Attacks on Image Activity Prediction and its Natural Language Explanations
Alina Elena Baia
Valentina Poggioni
Andrea Cavallaro
AAML
61
1
0
30 Sep 2023
InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision
  Generalists
InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists
Yulu Gan
Sungwoo Park
Alexander Schubert
Anthony Philippakis
Ahmed Alaa
VLM
111
25
0
30 Sep 2023
Advances in Kidney Biopsy Lesion Assessment through Dense Instance
  Segmentation
Advances in Kidney Biopsy Lesion Assessment through Dense Instance Segmentation
Zhan Xiong
Junling He
Pieter Valkema
Tri Q. Nguyen
M. Naesens
J. Kers
F. Verbeek
MedIm
60
0
0
29 Sep 2023
Investigating Shift Equivalence of Convolutional Neural Networks in
  Industrial Defect Segmentation
Investigating Shift Equivalence of Convolutional Neural Networks in Industrial Defect Segmentation
Yunsheng Tian
Jieliang Luo
Yichen Li
Zhengtao Zhang
Hui Li
72
4
0
29 Sep 2023
Superpixel Transformers for Efficient Semantic Segmentation
Superpixel Transformers for Efficient Semantic Segmentation
Xiao Han
Jieru Mei
Lu Zhang
Hang Yan
Yongkai Wu
Liang-Chieh Chen
Henrik Kretzschmar
ViT
61
11
0
28 Sep 2023
Radar Instance Transformer: Reliable Moving Instance Segmentation in
  Sparse Radar Point Clouds
Radar Instance Transformer: Reliable Moving Instance Segmentation in Sparse Radar Point Clouds
Matthias Zeller
Vardeep S. Sandhu
Benedikt Mersch
D. Hristopulos
Michael Heidingsfeld
Cyrill Stachniss
106
10
0
28 Sep 2023
Two-Step Active Learning for Instance Segmentation with Uncertainty and
  Diversity Sampling
Two-Step Active Learning for Instance Segmentation with Uncertainty and Diversity Sampling
Ke Yu
Yuanmin Tang
Giulia DeSalvo
Suraj Kothawade
Abdullah Rashwan
S. Tavakkol
Kayhan Batmanghelich
Xiaoqi Yin
ISeg
78
0
0
28 Sep 2023
Mask4Former: Mask Transformer for 4D Panoptic Segmentation
Mask4Former: Mask Transformer for 4D Panoptic Segmentation
Kadir Yilmaz
Jonas Schult
Alexey Nekrasov
Bastian Leibe
ISeg3DPC
86
11
0
28 Sep 2023
CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and
  Favorable Transferability For ViTs
CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and Favorable Transferability For ViTs
Ao Wang
Hui Chen
Zijia Lin
Sicheng Zhao
Jiawei Han
Guiguang Ding
ViT
58
6
0
27 Sep 2023
The Robust Semantic Segmentation UNCV2023 Challenge Results
The Robust Semantic Segmentation UNCV2023 Challenge Results
Xuanlong Yu
Yi Zuo
Zitao Wang
Xiaowen Zhang
Jiaxuan Zhao
...
Angela Yao
Wenlong Chen
Ivor J. A. Simpson
Neill D. F. Campbell
Gianni Franchi
UQCV
89
5
0
27 Sep 2023
DECO: Dense Estimation of 3D Human-Scene Contact In The Wild
DECO: Dense Estimation of 3D Human-Scene Contact In The Wild
Shashank Tripathi
Agniv Chatterjee
Jean-Claude Passy
Hongwei Yi
Dimitrios Tzionas
Michael J. Black
3DH
85
23
0
26 Sep 2023
MoCaE: Mixture of Calibrated Experts Significantly Improves Object
  Detection
MoCaE: Mixture of Calibrated Experts Significantly Improves Object Detection
Kemal Oksuz
Selim Kuzucu
Tom Joy
P. Dokania
MoE
127
7
0
26 Sep 2023
Volumetric Semantically Consistent 3D Panoptic Mapping
Volumetric Semantically Consistent 3D Panoptic Mapping
Yang Miao
Iro Armeni
Marc Pollefeys
Dániel Baráth
3DPC
88
9
0
26 Sep 2023
Dynamic Scene Graph Representation for Surgical Video
Dynamic Scene Graph Representation for Surgical Video
Felix Holm
Ghazal Ghazaei
Tobias Czempiel
Ege Özsoy
Stefan Saur
Nassir Navab
MedIm
64
16
0
25 Sep 2023
Assessment of a new GeoAI foundation model for flood inundation mapping
Assessment of a new GeoAI foundation model for flood inundation mapping
Wenwen Li
Hyunho Lee
Sizhe Wang
Chia-Yu Hsu
S. Arundel
AI4CE
66
18
0
25 Sep 2023
3D Indoor Instance Segmentation in an Open-World
3D Indoor Instance Segmentation in an Open-World
Mohamed El Amine Boudjoghra
Salwa K. Al Khatib
Jean Lahoud
Hisham Cholakkal
Rao Muhammad Anwer
Salman Khan
Fahad Khan
3DVISeg
68
6
0
25 Sep 2023
Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for
  Pixel-Level Semantic Segmentation
Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic Segmentation
Quang H. Nguyen
T. Vu
Anh Tran
Kim Dan Nguyen
DiffM
121
89
0
25 Sep 2023
A SAM-based Solution for Hierarchical Panoptic Segmentation of Crops and
  Weeds Competition
A SAM-based Solution for Hierarchical Panoptic Segmentation of Crops and Weeds Competition
K. Nguyen
T. Phung
Hoang-Giang Cao
58
7
0
24 Sep 2023
LOGICSEG: Parsing Visual Semantics with Neural Logic Learning and
  Reasoning
LOGICSEG: Parsing Visual Semantics with Neural Logic Learning and Reasoning
Liulei Li
Wenguan Wang
Yi Yang
NAIVLM
95
27
0
24 Sep 2023
I-AI: A Controllable & Interpretable AI System for Decoding
  Radiologists' Intense Focus for Accurate CXR Diagnoses
I-AI: A Controllable & Interpretable AI System for Decoding Radiologists' Intense Focus for Accurate CXR Diagnoses
Trong-Thang Pham
Jacob Brecheisen
Anh Nguyen
Hien Nguyen
Ngan Le
69
15
0
24 Sep 2023
Rethinking Amodal Video Segmentation from Learning Supervised Signals
  with Object-centric Representation
Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation
Ke Fan
Jingshi Lei
Xuelin Qian
Miaopeng Yu
Tianjun Xiao
Tong He
Zheng Zhang
Yanwei Fu
VOS
53
4
0
23 Sep 2023
ClusterFormer: Clustering As A Universal Visual Learner
ClusterFormer: Clustering As A Universal Visual Learner
James Liang
Yiming Cui
Qifan Wang
Tong Geng
Wenguan Wang
Dongfang Liu
VLM
96
10
0
22 Sep 2023
NTO3D: Neural Target Object 3D Reconstruction with Segment Anything
NTO3D: Neural Target Object 3D Reconstruction with Segment Anything
Xi Wei
Renrui Zhang
Jiarui Wu
Jiaming Liu
Ming Lu
Yandong Guo
Shanghang Zhang
74
6
0
22 Sep 2023
Unsupervised Semantic Segmentation Through Depth-Guided Feature
  Correlation and Sampling
Unsupervised Semantic Segmentation Through Depth-Guided Feature Correlation and Sampling
Leon Sick
Dominik Engel
Pedro Hermosilla
Timo Ropinski
82
8
0
21 Sep 2023
TCOVIS: Temporally Consistent Online Video Instance Segmentation
TCOVIS: Temporally Consistent Online Video Instance Segmentation
Junlong Li
Ting Yu
Yongming Rao
Jie Zhou
Jiwen Lu
58
13
0
21 Sep 2023
A Vision-Centric Approach for Static Map Element Annotation
A Vision-Centric Approach for Static Map Element Annotation
Jiaxin Zhang
Shiyuan Chen
Haoran Yin
Ruohong Mei
Xuan Liu
Cong Yang
Qian Zhang
Wei Sui
3DV
61
3
0
21 Sep 2023
Multi-grained Temporal Prototype Learning for Few-shot Video Object
  Segmentation
Multi-grained Temporal Prototype Learning for Few-shot Video Object Segmentation
Nian Liu
Kepan Nan
Wangbo Zhao
Yuanwei Liu
Xiwen Yao
Salman Khan
Hisham Cholakkal
Rao Muhammad Anwer
Junwei Han
Fahad Shahbaz Khan
VOS
81
7
0
20 Sep 2023
RoadFormer: Duplex Transformer for RGB-Normal Semantic Road Scene
  Parsing
RoadFormer: Duplex Transformer for RGB-Normal Semantic Road Scene Parsing
Jiahang Li
Yikang Zhang
Peng Yun
Guangliang Zhou
Qijun Chen
Rui Fan
ViTOffRL
98
29
0
19 Sep 2023
PanopticNeRF-360: Panoramic 3D-to-2D Label Transfer in Urban Scenes
PanopticNeRF-360: Panoramic 3D-to-2D Label Transfer in Urban Scenes
Xiao Fu
Shangzhan Zhang
Tianrun Chen
Yichong Lu
Xiaowei Zhou
Andreas Geiger
Yiyi Liao
3DPC
166
9
0
19 Sep 2023
Drawing the Same Bounding Box Twice? Coping Noisy Annotations in Object
  Detection with Repeated Labels
Drawing the Same Bounding Box Twice? Coping Noisy Annotations in Object Detection with Repeated Labels
David Tschirschwitz
C. Benz
Morris Florek
Henrik Norderhus
Benno Stein
Volker Rodehorst
61
1
0
18 Sep 2023
Discovering Sounding Objects by Audio Queries for Audio Visual
  Segmentation
Discovering Sounding Objects by Audio Queries for Audio Visual Segmentation
Shaofei Huang
Han Li
Yuqing Wang
Hongji Zhu
Jiao Dai
Jizhong Han
Wenge Rong
Si Liu
VOS
53
19
0
18 Sep 2023
Uncertainty-aware 3D Object-Level Mapping with Deep Shape Priors
Uncertainty-aware 3D Object-Level Mapping with Deep Shape Priors
Ziwei Liao
Jun Yang
Jingxing Qian
Angela P. Schoellig
Steven L. Waslander
82
5
0
17 Sep 2023
Temporal-aware Hierarchical Mask Classification for Video Semantic
  Segmentation
Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation
Zhaochong An
Guolei Sun
Zongwei Wu
Hao Tang
Luc Van Gool
VOS
68
5
0
14 Sep 2023
NutritionVerse: Empirical Study of Various Dietary Intake Estimation
  Approaches
NutritionVerse: Empirical Study of Various Dietary Intake Estimation Approaches
Chi-en Amy Tai
Matthew Keller
Saeejith Nair
Yuhao Chen
Yifan Wu
...
Krish Parmar
Pengcheng Xi
Heather H. Keller
Sharon I Kirkpatrick
Alexander Wong
61
2
0
14 Sep 2023
Dynamic Spectrum Mixer for Visual Recognition
Dynamic Spectrum Mixer for Visual Recognition
Zhiqiang Hu
Tao Yu
56
3
0
13 Sep 2023
MPI-Flow: Learning Realistic Optical Flow with Multiplane Images
MPI-Flow: Learning Realistic Optical Flow with Multiplane Images
Yingping Liang
Jiaming Liu
Debing Zhang
Ying Fu
77
7
0
13 Sep 2023
ASPED: An Audio Dataset for Detecting Pedestrians
ASPED: An Audio Dataset for Detecting Pedestrians
Pavan Seshadri
Chaeyeon Han
B. Koo
Noah Posner
S. Guhathakurta
Alexander Lerch
31
2
0
12 Sep 2023
IBAFormer: Intra-batch Attention Transformer for Domain Generalized
  Semantic Segmentation
IBAFormer: Intra-batch Attention Transformer for Domain Generalized Semantic Segmentation
Qiyu Sun
Huilin Chen
Meng Zheng
Ziyan Wu
Michael Felsberg
Yang Tang
96
3
0
12 Sep 2023
Federated Learning for Large-Scale Scene Modeling with Neural Radiance
  Fields
Federated Learning for Large-Scale Scene Modeling with Neural Radiance Fields
Teppei Suzuki
AI4CE
98
8
0
12 Sep 2023
Panoptic Vision-Language Feature Fields
Panoptic Vision-Language Feature Fields
Haoran Chen
Kenneth Blomqvist
Francesco Milano
Roland Siegwart
VLM
86
14
0
11 Sep 2023
Toward a Deeper Understanding: RetNet Viewed through Convolution
Toward a Deeper Understanding: RetNet Viewed through Convolution
Chenghao Li
Chaoning Zhang
ViT
75
7
0
11 Sep 2023
PAg-NeRF: Towards fast and efficient end-to-end panoptic 3D
  representations for agricultural robotics
PAg-NeRF: Towards fast and efficient end-to-end panoptic 3D representations for agricultural robotics
Claus Smitt
Michael Halstead
Patrick Zimmer
Thomas Labe
Esra Guclu
C. Stachniss
Chris McCool
58
18
0
11 Sep 2023
Mask2Anomaly: Mask Transformer for Universal Open-set Segmentation
Mask2Anomaly: Mask Transformer for Universal Open-set Segmentation
Shyam Nandan Rai
Fabio Cermelli
Barbara Caputo
Carlo Masone
ISegViT
74
5
0
08 Sep 2023
Four Ways to Improve Verbo-visual Fusion for Dense 3D Visual Grounding
Four Ways to Improve Verbo-visual Fusion for Dense 3D Visual Grounding
Ozan Unal
Daniel Gehrig
Suman Saha
Luc Van Gool
87
17
0
08 Sep 2023
Video Task Decathlon: Unifying Image and Video Tasks in Autonomous
  Driving
Video Task Decathlon: Unifying Image and Video Tasks in Autonomous Driving
Thomas E. Huang
Yifan Liu
Luc Van Gool
Fisher Yu
118
5
0
08 Sep 2023
Have We Ever Encountered This Before? Retrieving Out-of-Distribution
  Road Obstacles from Driving Scenes
Have We Ever Encountered This Before? Retrieving Out-of-Distribution Road Obstacles from Driving Scenes
Youssef Shoeb
Robin Shing Moon Chan
Gesina Schwalbe
Azarm Nowzard
Fatma Guney
Hanno Gottschalk
64
6
0
08 Sep 2023
MMSFormer: Multimodal Transformer for Material and Semantic Segmentation
MMSFormer: Multimodal Transformer for Material and Semantic Segmentation
Md Kaykobad Reza
Ashley Prater-Bennette
M. Salman Asif
169
15
0
07 Sep 2023
Tracking Anything with Decoupled Video Segmentation
Tracking Anything with Decoupled Video Segmentation
Ho Kei Cheng
Seoung Wug Oh
Brian L. Price
Alexander Schwing
Joon-Young Lee
VOSVLM
107
138
0
07 Sep 2023
Previous
123...181920...272829
Next