ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXivPDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,369 papers shown
Title
Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision
  Transformers
Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers
Diana-Nicoleta Grigore
Mariana-Iuliana Georgescu
J. A. Justo
T. Johansen
Andreea-Iuliana Ionescu
Radu Tudor Ionescu
36
0
0
14 Apr 2024
Coreset Selection for Object Detection
Coreset Selection for Object Detection
Hojun Lee
Suyoung Kim
Junhoo Lee
Jaeyoung Yoo
Nojun Kwak
38
4
0
14 Apr 2024
MAProtoNet: A Multi-scale Attentive Interpretable Prototypical Part
  Network for 3D Magnetic Resonance Imaging Brain Tumor Classification
MAProtoNet: A Multi-scale Attentive Interpretable Prototypical Part Network for 3D Magnetic Resonance Imaging Brain Tumor Classification
Binghua Li
Jie Mao
Zhe Sun
Chao Li
Qibin Zhao
Toshihisa Tanaka
28
0
0
13 Apr 2024
COCONut: Modernizing COCO Segmentation
COCONut: Modernizing COCO Segmentation
XueQing Deng
Qihang Yu
Peng Wang
Xiaohui Shen
Liang-Chieh Chen
48
16
0
12 Apr 2024
LaSagnA: Language-based Segmentation Assistant for Complex Queries
LaSagnA: Language-based Segmentation Assistant for Complex Queries
Cong Wei
Haoxian Tan
Yujie Zhong
Yujiu Yang
Lin Ma
49
14
0
12 Apr 2024
ControlNet++: Improving Conditional Controls with Efficient Consistency
  Feedback
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
Ming Li
Taojiannan Yang
Huafeng Kuang
Jie Wu
Zhaoning Wang
Xuefeng Xiao
Chong Chen
45
63
0
11 Apr 2024
GLID: Pre-training a Generalist Encoder-Decoder Vision Model
GLID: Pre-training a Generalist Encoder-Decoder Vision Model
Jihao Liu
Jinliang Zheng
Yu Liu
Hongsheng Li
VLM
29
3
0
11 Apr 2024
Transferable and Principled Efficiency for Open-Vocabulary Segmentation
Transferable and Principled Efficiency for Open-Vocabulary Segmentation
Jingxuan Xu
Wuyang Chen
Yao-Min Zhao
Yunchao Wei
VLM
44
2
0
11 Apr 2024
Finding Dino: A Plug-and-Play Framework for Zero-Shot Detection of Out-of-Distribution Objects Using Prototypes
Finding Dino: A Plug-and-Play Framework for Zero-Shot Detection of Out-of-Distribution Objects Using Prototypes
Poulami Sinhamahapatra
Franziska Schwaiger
Shirsha Bose
Huiyu Wang
Karsten Roscher
Stephan Guennemann
25
3
0
11 Apr 2024
Identification of Fine-grained Systematic Errors via Controlled Scene
  Generation
Identification of Fine-grained Systematic Errors via Controlled Scene Generation
Valentyn Boreiko
Matthias Hein
J. H. Metzen
35
1
0
10 Apr 2024
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic
  Segmentation
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation
Jiannan Ge
Lingxi Xie
Hongtao Xie
Pandeng Li
Xiaopeng Zhang
Yongdong Zhang
Qi Tian
VLM
31
3
0
08 Apr 2024
UniFL: Improve Stable Diffusion via Unified Feedback Learning
UniFL: Improve Stable Diffusion via Unified Feedback Learning
Jiacheng Zhang
Jie Wu
Yuxi Ren
Xin Xia
Huafeng Kuang
...
Jiashi Li
Xuefeng Xiao
Min Zheng
Lean Fu
Guanbin Li
45
5
0
08 Apr 2024
NeRF2Points: Large-Scale Point Cloud Generation From Street Views'
  Radiance Field Optimization
NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization
Peng Tu
Xun Zhou
Mingming Wang
Xiaojun Yang
Bo Peng
Ping Chen
Xiu Su
Yawen Huang
Yefeng Zheng
Chang Xu
40
1
0
07 Apr 2024
Joint Reconstruction of 3D Human and Object via Contact-Based Refinement
  Transformer
Joint Reconstruction of 3D Human and Object via Contact-Based Refinement Transformer
Hyeongjin Nam
Daniel Sungho Jung
Gyeongsik Moon
Kyoung Mu Lee
3DH
36
10
0
07 Apr 2024
Panoptic Perception: A Novel Task and Fine-grained Dataset for Universal
  Remote Sensing Image Interpretation
Panoptic Perception: A Novel Task and Fine-grained Dataset for Universal Remote Sensing Image Interpretation
Danpei Zhao
Bo Yuan
Ziqiang Chen
Tian Li
Zhuoran Liu
Wentao Li
Yue Gao
60
10
0
06 Apr 2024
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Pei Wang
Zhaowei Cai
Hao Yang
Ashwin Swaminathan
R. Manmatha
Stefano Soatto
78
2
0
06 Apr 2024
MarsSeg: Mars Surface Semantic Segmentation with Multi-level Extractor
  and Connector
MarsSeg: Mars Surface Semantic Segmentation with Multi-level Extractor and Connector
Junbo Li
Keyan Chen
Gengju Tian
Lu Li
Z. Shi
52
1
0
05 Apr 2024
Deep Learning for Satellite Image Time Series Analysis: A Review
Deep Learning for Satellite Image Time Series Analysis: A Review
Lynn Miller
Charlotte Pelletier
Geoffrey I. Webb
35
18
0
05 Apr 2024
Decoupling Static and Hierarchical Motion Perception for Referring Video
  Segmentation
Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation
Shuting He
Henghui Ding
VOS
37
23
0
04 Apr 2024
HAPNet: Toward Superior RGB-Thermal Scene Parsing via Hybrid,
  Asymmetric, and Progressive Heterogeneous Feature Fusion
HAPNet: Toward Superior RGB-Thermal Scene Parsing via Hybrid, Asymmetric, and Progressive Heterogeneous Feature Fusion
Jiahang Li
Peng Yun
Qijun Chen
Rui Fan
46
8
0
04 Apr 2024
Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer
Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer
Qinji Yu
Yirui Wang
K. Yan
Haoshen Li
Dazhou Guo
...
Na Shen
Qifeng Wang
Xiaowei Ding
X. Ye
Dakai Jin
MedIm
71
2
0
04 Apr 2024
JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic
  Dataset in Crowded Human Environments
JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments
Duy-Tho Le
Chenhui Gou
Stavya Datta
Hengcan Shi
Ian Reid
Jianfei Cai
Hamid Rezatofighi
34
2
0
02 Apr 2024
What is Point Supervision Worth in Video Instance Segmentation?
What is Point Supervision Worth in Video Instance Segmentation?
Shuaiyi Huang
De-An Huang
Zhiding Yu
Shiyi Lan
Subhashree Radhakrishnan
Jose M. Alvarez
Abhinav Shrivastava
A. Anandkumar
VOS
32
3
0
01 Apr 2024
Rethinking Saliency-Guided Weakly-Supervised Semantic Segmentation
Rethinking Saliency-Guided Weakly-Supervised Semantic Segmentation
Beomyoung Kim
Donghyeon Kim
Sung Ju Hwang
31
0
0
01 Apr 2024
MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction
MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction
Xiaolu Liu
Song Wang
Wentong Li
Ruizi Yang
Junbo Chen
Jianke Zhu
52
19
0
01 Apr 2024
Transformer based Pluralistic Image Completion with Reduced Information
  Loss
Transformer based Pluralistic Image Completion with Reduced Information Loss
Qiankun Liu
Yuqi Jiang
Zhentao Tan
DongDong Chen
Ying Fu
Qi Chu
Gang Hua
Nenghai Yu
ViT
73
11
0
31 Mar 2024
SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs
SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs
Yang Miao
Francis Engelmann
Olga Vysotska
Federico Tombari
Marc Pollefeys
Daniel Barath
3DPC
60
7
0
30 Mar 2024
DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and
  Intra-Class Regions for Weakly-Supervised Semantic Segmentation
DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation
Sang-Kee Jo
Fei Pan
In-Jae Yu
Kyungsu Kim
38
2
0
30 Mar 2024
Image-to-Image Matching via Foundation Models: A New Perspective for
  Open-Vocabulary Semantic Segmentation
Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation
Yuan Wang
Rui Sun
Naisong Luo
Yuwen Pan
Tianzhu Zhang
VLM
48
9
0
30 Mar 2024
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with
  Visual Prompt Tuning
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning
Beomyoung Kim
Joonsang Yu
Sung Ju Hwang
VLM
CLL
44
11
0
29 Mar 2024
Mixed-precision Supernet Training from Vision Foundation Models using
  Low Rank Adapter
Mixed-precision Supernet Training from Vision Foundation Models using Low Rank Adapter
Yuiko Sakuma
Masakazu Yoshimura
Junji Otsuka
Atsushi Irie
Takeshi Ohashi
MQ
40
0
0
29 Mar 2024
Enhancing Efficiency in Vision Transformer Networks: Design Techniques
  and Insights
Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights
Moein Heidari
Reza Azad
Sina Ghorbani Kolahi
René Arimond
Leon Niggemeier
...
Afshin Bozorgpour
Ehsan Khodapanah Aghdam
A. Kazerouni
I. Hacihaliloglu
Dorit Merhof
53
7
0
28 Mar 2024
Efficient 3D Instance Mapping and Localization with Neural Fields
Efficient 3D Instance Mapping and Localization with Neural Fields
George Tang
Krishna Murthy Jatavallabhula
Antonio Torralba
ISeg
39
5
0
28 Mar 2024
GauStudio: A Modular Framework for 3D Gaussian Splatting and Beyond
GauStudio: A Modular Framework for 3D Gaussian Splatting and Beyond
Chongjie Ye
Y. Nie
Jiahao Chang
Yuantao Chen
Yihao Zhi
Xiaoguang Han
3DGS
93
15
0
28 Mar 2024
Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction
Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction
Xiaoyang Lyu
Chirui Chang
Peng Dai
Yang-tian Sun
Xiaojuan Qi
3DGS
46
3
0
28 Mar 2024
WALT3D: Generating Realistic Training Data from Time-Lapse Imagery for
  Reconstructing Dynamic Objects under Occlusion
WALT3D: Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects under Occlusion
Khiem Vuong
N. D. Reddy
R. Tamburo
S. Narasimhan
32
1
0
27 Mar 2024
Benchmarking Object Detectors with COCO: A New Path Forward
Benchmarking Object Detectors with COCO: A New Path Forward
Shweta Singh
Aayan Yadav
Jitesh Jain
Humphrey Shi
Justin Johnson
Karan Desai
33
7
0
27 Mar 2024
Unleashing the Potential of SAM for Medical Adaptation via Hierarchical
  Decoding
Unleashing the Potential of SAM for Medical Adaptation via Hierarchical Decoding
Zhiheng Cheng
Qingyue Wei
Hongru Zhu
Yan Wang
Liangqiong Qu
Wei Shao
Yuyin Zhou
MedIm
41
27
0
27 Mar 2024
EgoPoseFormer: A Simple Baseline for Egocentric 3D Human Pose Estimation
EgoPoseFormer: A Simple Baseline for Egocentric 3D Human Pose Estimation
Chenhongyi Yang
Anastasia Tkach
Shreyas Hampali
Linguang Zhang
Elliot J. Crowley
Cem Keskin
42
1
0
26 Mar 2024
The Need for Speed: Pruning Transformers with One Recipe
The Need for Speed: Pruning Transformers with One Recipe
Samir Khaki
Konstantinos N. Plataniotis
42
10
0
26 Mar 2024
PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
Chenhongyi Yang
Zehui Chen
Miguel Espinosa
Linus Ericsson
Zhenyu Wang
Jiaming Liu
Elliot J. Crowley
Mamba
39
89
0
26 Mar 2024
NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using
  Heuristics-Guided Segmentation
NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation
Jiahao Chen
Yipeng Qin
Lingjie Liu
Jiangbo Lu
Guanbin Li
48
11
0
26 Mar 2024
Activity-Biometrics: Person Identification from Daily Activities
Activity-Biometrics: Person Identification from Daily Activities
Shehreen Azad
Yogesh S Rawat
29
3
0
26 Mar 2024
Clustering Propagation for Universal Medical Image Segmentation
Clustering Propagation for Universal Medical Image Segmentation
Yuhang Ding
Liulei Li
Wenguan Wang
Yi Yang
48
9
0
25 Mar 2024
V2X-PC: Vehicle-to-everything Collaborative Perception via Point Cluster
V2X-PC: Vehicle-to-everything Collaborative Perception via Point Cluster
Si Liu
Zihan Ding
Jiahui Fu
Hongyu Li
Siheng Chen
Shifeng Zhang
Xu Zhou
47
3
0
25 Mar 2024
Towards Large-Scale Training of Pathology Foundation Models
Towards Large-Scale Training of Pathology Foundation Models
kaiko.ai
N. Aben
Edwin D. de Jong
Ioannis Gatopoulos
Nicolas Kanzig
Mikhail Karasikov
Axel Lagré
Roman Moser
J. Doorn
Fei Tang
MedIm
AI4CE
39
9
0
24 Mar 2024
3D-TransUNet for Brain Metastases Segmentation in the BraTS2023
  Challenge
3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge
Siwei Yang
Xianhang Li
Jieru Mei
Jieneng Chen
Cihang Xie
Yuyin Zhou
MedIm
44
4
0
23 Mar 2024
Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian
  Splatting
Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting
Jun Guo
Xiaojian Ma
Yue Fan
Huaping Liu
Qing Li
3DGS
44
27
0
22 Mar 2024
Your Image is My Video: Reshaping the Receptive Field via Image-To-Video
  Differentiable AutoAugmentation and Fusion
Your Image is My Video: Reshaping the Receptive Field via Image-To-Video Differentiable AutoAugmentation and Fusion
S. Casarin
C. Ugwu
Sergio Escalera
Oswald Lanz
36
0
0
22 Mar 2024
Towards a Comprehensive, Efficient and Promptable Anatomic Structure
  Segmentation Model using 3D Whole-body CT Scans
Towards a Comprehensive, Efficient and Promptable Anatomic Structure Segmentation Model using 3D Whole-body CT Scans
Heng Guo
Jianfeng Zhang
Jiaxing Huang
Tony C. W. Mok
Dazhou Guo
Ke Yan
Le Lu
Dakai Jin
Minfeng Xu
MedIm
24
5
0
22 Mar 2024
Previous
123...111213...262728
Next