ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXivPDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,365 papers shown
Title
Unsupervised Pre-training with Language-Vision Prompts for Low-Data
  Instance Segmentation
Unsupervised Pre-training with Language-Vision Prompts for Low-Data Instance Segmentation
Dingwen Zhang
Hao Li
Diqi He
Nian Liu
Lechao Cheng
Jingdong Wang
Junwei Han
VLM
49
0
0
22 May 2024
Influence of Water Droplet Contamination for Transparency Segmentation
Influence of Water Droplet Contamination for Transparency Segmentation
Volker Knauthe
Paul Weitz
Thomas Pollabauer
Tristan Wirth
Arne Rak
Arjan Kuijper
Dieter W. Fellner
46
1
0
21 May 2024
DLAFormer: An End-to-End Transformer For Document Layout Analysis
DLAFormer: An End-to-End Transformer For Document Layout Analysis
Jiawei Wang
Kai Hu
Qiang Huo
3DV
ViT
32
3
0
20 May 2024
Unifying 3D Vision-Language Understanding via Promptable Queries
Unifying 3D Vision-Language Understanding via Promptable Queries
Ziyu Zhu
Zhuofan Zhang
Xiaojian Ma
Xuesong Niu
Yixin Chen
Baoxiong Jia
Zhidong Deng
Siyuan Huang
Qing Li
48
21
0
19 May 2024
HARIS: Human-Like Attention for Reference Image Segmentation
HARIS: Human-Like Attention for Reference Image Segmentation
Mengxi Zhang
Heqing Lian
Yiming Liu
Jie Chen
VLM
21
0
0
17 May 2024
NeRO: Neural Road Surface Reconstruction
NeRO: Neural Road Surface Reconstruction
Ruibo Wang
Song Zhang
Ping Huang
Donghai Zhang
Haoyu Chen
3DV
40
1
0
17 May 2024
Grounded 3D-LLM with Referent Tokens
Grounded 3D-LLM with Referent Tokens
Yilun Chen
Shuai Yang
Haifeng Huang
Tai Wang
Ruiyuan Lyu
Runsen Xu
Dahua Lin
Jiangmiao Pang
57
24
0
16 May 2024
4D Panoptic Scene Graph Generation
4D Panoptic Scene Graph Generation
Jingkang Yang
Jun Cen
Wenxuan Peng
Shuai Liu
Fangzhou Hong
Xiangtai Li
Kaiyang Zhou
Qifeng Chen
Ziwei Liu
45
13
0
16 May 2024
DiverGen: Improving Instance Segmentation by Learning Wider Data
  Distribution with More Diverse Generative Data
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data
Chengxiang Fan
Muzhi Zhu
Hao Chen
Yang Liu
Weijia Wu
Huaqi Zhang
Chunhua Shen
DiffM
62
11
0
16 May 2024
NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge
NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge
Jie Liang
Radu Timofte
Qiaosi Yi
Shuai Liu
Lingchen Sun
Rongyuan Wu
Xindong Zhang
Huiyu Zeng
Lei Zhang
55
18
0
16 May 2024
UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation
UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation
Yachan Guo
Yi Xiao
Danna Xue
Jose Luis Gomez Zurita
Antonio M. López
66
0
0
15 May 2024
AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous
  Driving
AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous Driving
Daniel Bogdoll
Iramm Hamdard
Lukas Namgyu Rößler
Felix Geisler
Muhammed Bayram
...
Miguel de Campos
Anushervon Tabarov
Yitian Yang
Hanno Gottschalk
J. Marius Zöllner
42
5
0
13 May 2024
Enhancing Weakly Supervised Semantic Segmentation with Multi-modal
  Foundation Models: An End-to-End Approach
Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach
Elham Ravanbakhsh
Cheng Niu
Yongqing Liang
J. Ramanujam
Xin Li
VLM
54
0
0
10 May 2024
Context-Guided Spatial Feature Reconstruction for Efficient Semantic
  Segmentation
Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation
Zhenliang Ni
Xinghao Chen
Yingjie Zhai
Yehui Tang
Yunhe Wang
46
15
0
10 May 2024
Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview
Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview
Yuhang Ming
Xingrui Yang
Weihan Wang
Zheng Chen
Jinglun Feng
Yifan Xing
Guofeng Zhang
40
12
0
09 May 2024
S3Former: Self-supervised High-resolution Transformer for Solar PV Profiling
S3Former: Self-supervised High-resolution Transformer for Solar PV Profiling
Minh-Triet Tran
Adrian de Luis
Haitao Liao
Ying Huang
Roy McCann
Alan Mantooth
Jack Cothren
Ngan Le
90
0
0
07 May 2024
Vision-based 3D occupancy prediction in autonomous driving: a review and
  outlook
Vision-based 3D occupancy prediction in autonomous driving: a review and outlook
Yanan Zhang
Jinqing Zhang
Zengran Wang
Junhao Xu
Di Huang
32
16
0
04 May 2024
Multi-Space Alignments Towards Universal LiDAR Segmentation
Multi-Space Alignments Towards Universal LiDAR Segmentation
You-Chen Liu
Lingdong Kong
Xiaoyang Wu
Runnan Chen
Xin Li
Liang Pan
Ziwei Liu
Yuexin Ma
3DPC
51
17
0
02 May 2024
Spider: A Unified Framework for Context-dependent Concept Segmentation
Spider: A Unified Framework for Context-dependent Concept Segmentation
Xiaoqi Zhao
Youwei Pang
Wei Ji
Baicheng Sheng
Jiaming Zuo
Lihe Zhang
Huchuan Lu
39
6
0
02 May 2024
Garbage Segmentation and Attribute Analysis by Robotic Dogs
Garbage Segmentation and Attribute Analysis by Robotic Dogs
Nuo Xu
Jianfeng Liao
Qiwei Meng
Wei Song
31
0
0
28 Apr 2024
Random Walk on Pixel Manifolds for Anomaly Segmentation of Complex
  Driving Scenes
Random Walk on Pixel Manifolds for Anomaly Segmentation of Complex Driving Scenes
Zelong Zeng
Kaname Tomite
32
1
0
27 Apr 2024
Open-Set 3D Semantic Instance Maps for Vision Language Navigation --
  O3D-SIM
Open-Set 3D Semantic Instance Maps for Vision Language Navigation -- O3D-SIM
Laksh Nanwani
Kumaraditya Gupta
Aditya Mathur
Swayam Agrawal
A. H. A. Hafez
K. M. Krishna
40
0
0
27 Apr 2024
Instance-free Text to Point Cloud Localization with Relative Position
  Awareness
Instance-free Text to Point Cloud Localization with Relative Position Awareness
Lichao Wang
Zhihao Yuan
Jinke Ren
Shuguang Cui
Zhen Li
49
0
0
27 Apr 2024
Features Fusion for Dual-View Mammography Mass Detection
Features Fusion for Dual-View Mammography Mass Detection
Arina Varlamova
Valery Belotsky
Grigory Novikov
Anton Konushin
Evgeny Sidorov
MedIm
22
1
0
25 Apr 2024
Multi-Scale Representations by Varying Window Attention for Semantic
  Segmentation
Multi-Scale Representations by Varying Window Attention for Semantic Segmentation
Haotian Yan
Ming Wu
Chuang Zhang
40
12
0
25 Apr 2024
MaGGIe: Masked Guided Gradual Human Instance Matting
MaGGIe: Masked Guided Gradual Human Instance Matting
Chuong Huynh
Seoung Wug Oh
Abhinav Shrivastava
Joon-Young Lee
VOS
38
8
0
24 Apr 2024
Efficient Transformer Encoders for Mask2Former-style models
Efficient Transformer Encoders for Mask2Former-style models
Manyi Yao
Abhishek Aich
Yumin Suh
Amit Roy-Chowdhury
Christian Shelton
Manmohan Chandraker
41
0
0
23 Apr 2024
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Abhishek Aich
Yumin Suh
S. Schulter
Manmohan Chandraker
56
0
0
23 Apr 2024
PM-VIS: High-Performance Box-Supervised Video Instance Segmentation
PM-VIS: High-Performance Box-Supervised Video Instance Segmentation
Zhangjing Yang
Dun Liu
Wensheng Cheng
Jinqiao Wang
Yi Wu
VLM
31
2
0
22 Apr 2024
HOIST-Former: Hand-held Objects Identification, Segmentation, and
  Tracking in the Wild
HOIST-Former: Hand-held Objects Identification, Segmentation, and Tracking in the Wild
Supreeth Narasimhaswamy
Huy Anh Nguyen
Lihan Huang
Minh Hoai
40
4
0
22 Apr 2024
Semantic-Rearrangement-Based Multi-Level Alignment for Domain
  Generalized Segmentation
Semantic-Rearrangement-Based Multi-Level Alignment for Domain Generalized Segmentation
Guanlong Jiao
Chenyangguang Zhang
Haonan Yin
Yu Mo
Biqing Huang
Hui Pan
Yi Luo
Jingxian Liu
35
0
0
21 Apr 2024
Clio: Real-time Task-Driven Open-Set 3D Scene Graphs
Clio: Real-time Task-Driven Open-Set 3D Scene Graphs
Dominic Maggio
Yun Chang
Nathan Hughes
Matthew Trang
Dan Griffith
Carlyn Dougherty
Eric Cristofalo
Lukas Schmid
Luca Carlone
3DV
42
33
0
21 Apr 2024
Groma: Localized Visual Tokenization for Grounding Multimodal Large
  Language Models
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models
Chuofan Ma
Yi-Xin Jiang
Jiannan Wu
Zehuan Yuan
Xiaojuan Qi
VLM
ObjD
37
53
0
19 Apr 2024
FipTR: A Simple yet Effective Transformer Framework for Future Instance
  Prediction in Autonomous Driving
FipTR: A Simple yet Effective Transformer Framework for Future Instance Prediction in Autonomous Driving
Xingtai Gui
Tengteng Huang
Haonan Shao
Haotian Yao
Chi Zhang
44
3
0
19 Apr 2024
SOHES: Self-supervised Open-world Hierarchical Entity Segmentation
SOHES: Self-supervised Open-world Hierarchical Entity Segmentation
Shengcao Cao
Jiuxiang Gu
Jason Kuen
Hao Tan
Ruiyi Zhang
Handong Zhao
A. Nenkova
Liangyan Gui
Tong Sun
Yu-Xiong Wang
VLM
OCL
46
3
0
18 Apr 2024
How to Benchmark Vision Foundation Models for Semantic Segmentation?
How to Benchmark Vision Foundation Models for Semantic Segmentation?
Tommie Kerssies
Daan de Geus
Gijs Dubbelman
VLM
34
7
0
18 Apr 2024
MaskCD: A Remote Sensing Change Detection Network Based on Mask
  Classification
MaskCD: A Remote Sensing Change Detection Network Based on Mask Classification
Weikang Yu
Xiaokang Zhang
Samiran Das
Xiao Xiang Zhu
Pedram Ghamisi
39
9
0
18 Apr 2024
Curriculum Point Prompting for Weakly-Supervised Referring Image
  Segmentation
Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation
Qiyuan Dai
Sibei Yang
34
8
0
18 Apr 2024
Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale
  Approach
Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach
Mir Rayat Imtiaz Hossain
Mennatullah Siam
Leonid Sigal
James J. Little
VLM
38
6
0
17 Apr 2024
CarcassFormer: An End-to-end Transformer-based Framework for
  Simultaneous Localization, Segmentation and Classification of Poultry Carcass
  Defect
CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect
Minh Q. Tran
Sang Truong
Arthur F. A. Fernandes
Michael Kidd
Ngan Le
ViT
40
3
0
17 Apr 2024
StyleCity: Large-Scale 3D Urban Scenes Stylization
StyleCity: Large-Scale 3D Urban Scenes Stylization
Yingshu Chen
Huajian Huang
Tuan-Anh Vu
Ka Chun Shum
Sai-Kit Yeung
51
0
0
16 Apr 2024
AIGeN: An Adversarial Approach for Instruction Generation in VLN
AIGeN: An Adversarial Approach for Instruction Generation in VLN
Niyati Rawal
Roberto Bigazzi
Lorenzo Baraldi
Rita Cucchiara
GAN
52
4
0
15 Apr 2024
No More Ambiguity in 360° Room Layout via Bi-Layout Estimation
No More Ambiguity in 360° Room Layout via Bi-Layout Estimation
Yu-Ju Tsai
Jin-Cheng Jhang
Jingjing Zheng
Wei Wang
Albert Y. C. Chen
Min Sun
Cheng-Hao Kuo
Ming-Hsuan Yang
3DV
41
4
0
15 Apr 2024
In-Context Translation: Towards Unifying Image Recognition, Processing,
  and Generation
In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation
Han Xue
Qianru Sun
Li-Na Song
Wenjun Zhang
Zhiwu Huang
MLLM
44
0
0
15 Apr 2024
The revenge of BiSeNet: Efficient Multi-Task Image Segmentation
The revenge of BiSeNet: Efficient Multi-Task Image Segmentation
Gabriele Rosi
Claudia Cuttano
Niccolò Cavagnero
Giuseppe Averta
Fabio Cermelli
SSeg
64
3
0
15 Apr 2024
SparseOcc: Rethinking Sparse Latent Representation for Vision-Based
  Semantic Occupancy Prediction
SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction
Pin Tang
Zhongdao Wang
Guoqing Wang
Jilai Zheng
Xiangxuan Ren
Bailan Feng
Chao Ma
55
38
0
15 Apr 2024
Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision
  Transformers
Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers
Diana-Nicoleta Grigore
Mariana-Iuliana Georgescu
J. A. Justo
T. Johansen
Andreea-Iuliana Ionescu
Radu Tudor Ionescu
36
0
0
14 Apr 2024
Coreset Selection for Object Detection
Coreset Selection for Object Detection
Hojun Lee
Suyoung Kim
Junhoo Lee
Jaeyoung Yoo
Nojun Kwak
38
4
0
14 Apr 2024
MAProtoNet: A Multi-scale Attentive Interpretable Prototypical Part
  Network for 3D Magnetic Resonance Imaging Brain Tumor Classification
MAProtoNet: A Multi-scale Attentive Interpretable Prototypical Part Network for 3D Magnetic Resonance Imaging Brain Tumor Classification
Binghua Li
Jie Mao
Zhe Sun
Chao Li
Qibin Zhao
Toshihisa Tanaka
28
0
0
13 Apr 2024
COCONut: Modernizing COCO Segmentation
COCONut: Modernizing COCO Segmentation
XueQing Deng
Qihang Yu
Peng Wang
Xiaohui Shen
Liang-Chieh Chen
48
16
0
12 Apr 2024
Previous
123...101112...262728
Next