ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
v1v2v3 (latest)

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXiv (abs)PDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,408 papers shown
Title
Leveraging Open-Vocabulary Diffusion to Camouflaged Instance
  Segmentation
Leveraging Open-Vocabulary Diffusion to Camouflaged Instance Segmentation
Tuan-Anh Vu
Duc Thanh Nguyen
Qing Guo
Binh-Son Hua
N. Chung
Ivor W. Tsang
Sai-Kit Yeung
DiffM
83
3
0
29 Dec 2023
HEAP: Unsupervised Object Discovery and Localization with Contrastive
  Grouping
HEAP: Unsupervised Object Discovery and Localization with Contrastive Grouping
Xin Zhang
Jinheng Xie
Yuan. Yuan
Michael Bi Mi
Robby T. Tan
VOSOCLVLM
141
4
0
29 Dec 2023
Amodal Ground Truth and Completion in the Wild
Amodal Ground Truth and Completion in the Wild
Guanqi Zhan
Chuanxia Zheng
Weidi Xie
Andrew Zisserman
82
22
0
28 Dec 2023
Unsupervised Universal Image Segmentation
Unsupervised Universal Image Segmentation
Dantong Niu
Xudong Wang
Xinyang Han
Long Lian
Roei Herzig
Trevor Darrell
VLM
94
20
0
28 Dec 2023
LISA++: An Improved Baseline for Reasoning Segmentation with Large
  Language Model
LISA++: An Improved Baseline for Reasoning Segmentation with Large Language Model
Senqiao Yang
Tianyuan Qu
Xin Lai
Zhuotao Tian
Bohao Peng
Shu Liu
Jiaya Jia
VLM
122
32
0
28 Dec 2023
Fully Sparse 3D Occupancy Prediction
Fully Sparse 3D Occupancy Prediction
Haisong Liu
Yang Chen
Haiguang Wang
Zetong Yang
Tianyu Li
Jia Zeng
Li Chen
Hongyang Li
Limin Wang
128
19
0
28 Dec 2023
LaneSegNet: Map Learning with Lane Segment Perception for Autonomous
  Driving
LaneSegNet: Map Learning with Lane Segment Perception for Autonomous Driving
Tianyu Li
Peijin Jia
Bangjun Wang
Li Chen
Kun Jiang
Junchi Yan
Hongyang Li
88
38
0
26 Dec 2023
Semantic-aware SAM for Point-Prompted Instance Segmentation
Semantic-aware SAM for Point-Prompted Instance Segmentation
Zhaoyang Wei
Pengfei Chen
Xuehui Yu
Guorong Li
Jianbin Jiao
Zhenjun Han
VLM
101
6
0
26 Dec 2023
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces
Jiannan Wu
Yi Jiang
Bin Yan
Huchuan Lu
Zehuan Yuan
Ping Luo
VOS
106
18
0
25 Dec 2023
WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in
  Large-scale Natural Environments
WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Large-scale Natural Environments
Kavisha Vidanapathirana
Joshua Knights
Stephen Hausler
Mark Cox
Milad Ramezani
...
Ethan Griffiths
Shaheer Mohamed
Sridha Sridharan
Clinton Fookes
Peyman Moghadam
3DV
89
9
0
23 Dec 2023
Harnessing Diffusion Models for Visual Perception with Meta Prompts
Harnessing Diffusion Models for Visual Perception with Meta Prompts
Qiang Wan
Zilong Huang
Bingyi Kang
Jiashi Feng
Li Zhang
MDEVLM
105
16
0
22 Dec 2023
SurgicalPart-SAM: Part-to-Whole Collaborative Prompting for Surgical
  Instrument Segmentation
SurgicalPart-SAM: Part-to-Whole Collaborative Prompting for Surgical Instrument Segmentation
Wenxi Yue
Jing Zhang
Kun Hu
Qiuxia Wu
Zongyuan Ge
Yong Xia
Jiebo Luo
Zhiyong Wang
80
3
0
22 Dec 2023
UniHuman: A Unified Model for Editing Human Images in the Wild
UniHuman: A Unified Model for Editing Human Images in the Wild
Nannan Li
Qing Liu
Krishna Kumar Singh
Yilin Wang
Jianming Zhang
Bryan A. Plummer
Zhe Lin
56
10
0
22 Dec 2023
Leveraging Habitat Information for Fine-grained Bird Identification
Leveraging Habitat Information for Fine-grained Bird Identification
Tin Nguyen
Peijie Chen
Anh Totti Nguyen
VLM
113
0
0
22 Dec 2023
VCoder: Versatile Vision Encoders for Multimodal Large Language Models
VCoder: Versatile Vision Encoders for Multimodal Large Language Models
Jitesh Jain
Jianwei Yang
Humphrey Shi
MLLM
76
31
0
21 Dec 2023
TinySAM: Pushing the Envelope for Efficient Segment Anything Model
TinySAM: Pushing the Envelope for Efficient Segment Anything Model
Han Shu
Wenshuo Li
Yehui Tang
Yiman Zhang
Yihao Chen
Houqiang Li
Yunhe Wang
Xinghao Chen
VLM
124
21
0
21 Dec 2023
Unlocking Pre-trained Image Backbones for Semantic Image Synthesis
Unlocking Pre-trained Image Backbones for Semantic Image Synthesis
Tariq Berrada
Jakob Verbeek
Camille Couprie
Alahari Karteek
93
9
0
20 Dec 2023
SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete
  Diffusion Process
SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process
Meng Wang
Henghui Ding
Jun Hao Liew
Jiajun Liu
Yao-Min Zhao
Yunchao Wei
DiffM
109
19
0
19 Dec 2023
Mask Grounding for Referring Image Segmentation
Mask Grounding for Referring Image Segmentation
Yong Xien Chng
Henry Zheng
Yizeng Han
Xuchong Qiu
Gao Huang
ISegObjD
143
21
0
19 Dec 2023
Spherical Mask: Coarse-to-Fine 3D Point Cloud Instance Segmentation with
  Spherical Representation
Spherical Mask: Coarse-to-Fine 3D Point Cloud Instance Segmentation with Spherical Representation
Sangyun Shin
Kaichen Zhou
M. Vankadari
Andrew Markham
Niki Trigoni
3DPC
80
11
0
18 Dec 2023
MatchDet: A Collaborative Framework for Image Matching and Object
  Detection
MatchDet: A Collaborative Framework for Image Matching and Object Detection
Jinxiang Lai
Wenlong Wu
Bin-Bin Gao
Jun Liu
Jiawei Zhan
Congchong Nie
Yi Zeng
Chengjie Wang
VLM
85
0
0
18 Dec 2023
Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance
Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance
P. Nguyen
T.D. Ngo
E. Kalogerakis
Chuang Gan
Anh Tran
Cuong Pham
Khoi Duc Minh Nguyen
ISeg
154
55
0
17 Dec 2023
DETER: Detecting Edited Regions for Deterring Generative Manipulations
DETER: Detecting Edited Regions for Deterring Generative Manipulations
Sai Wang
Ye Zhu
Ruoyu Wang
Amaya Dharmasiri
Olga Russakovsky
Yu Wu
71
2
0
16 Dec 2023
Part Representation Learning with Teacher-Student Decoder for Occluded
  Person Re-identification
Part Representation Learning with Teacher-Student Decoder for Occluded Person Re-identification
Shang Gao
Chenyang Yu
Pingping Zhang
Huchuan Lu
90
4
0
15 Dec 2023
Collaborating Foundation Models for Domain Generalized Semantic
  Segmentation
Collaborating Foundation Models for Domain Generalized Semantic Segmentation
Yasser Benigmim
Subhankar Roy
S. Essid
Vicky Kalogeiton
Stéphane Lathuilière
139
14
0
15 Dec 2023
From-Ground-To-Objects: Coarse-to-Fine Self-supervised Monocular Depth
  Estimation of Dynamic Objects with Ground Contact Prior
From-Ground-To-Objects: Coarse-to-Fine Self-supervised Monocular Depth Estimation of Dynamic Objects with Ground Contact Prior
Jaeho Moon
J. P. Bello
Byeongjun Kwon
Munchurl Kim
61
7
0
15 Dec 2023
General Object Foundation Model for Images and Videos at Scale
General Object Foundation Model for Images and Videos at Scale
Junfeng Wu
Yi Jiang
Qihao Liu
Zehuan Yuan
Xiang Bai
Song Bai
VOSVLM
111
41
0
14 Dec 2023
Tokenize Anything via Prompting
Tokenize Anything via Prompting
Ting Pan
Lulu Tang
Xinlong Wang
Shiguang Shan
VLM
68
23
0
14 Dec 2023
LEMON: Learning 3D Human-Object Interaction Relation from 2D Images
LEMON: Learning 3D Human-Object Interaction Relation from 2D Images
Yuhang Yang
Wei Zhai
Hongcheng Luo
Yang Cao
Zheng-Jun Zha
124
26
0
14 Dec 2023
TAM-VT: Transformation-Aware Multi-scale Video Transformer for
  Segmentation and Tracking
TAM-VT: Transformation-Aware Multi-scale Video Transformer for Segmentation and Tracking
Raghav Goyal
Wan-Cyuan Fan
Mennatullah Siam
Leonid Sigal
VOS
82
3
0
13 Dec 2023
SAM-guided Graph Cut for 3D Instance Segmentation
SAM-guided Graph Cut for 3D Instance Segmentation
Haoyu Guo
He Zhu
Sida Peng
Yuang Wang
Yujun Shen
Ruizhen Hu
Xiaowei Zhou
3DV
104
18
0
13 Dec 2023
See, Say, and Segment: Teaching LMMs to Overcome False Premises
See, Say, and Segment: Teaching LMMs to Overcome False Premises
Tsung-Han Wu
Giscard Biamby
David M. Chan
Lisa Dunlap
Ritwik Gupta
Xudong Wang
Joseph E. Gonzalez
Trevor Darrell
VLMMLLM
115
21
0
13 Dec 2023
PnPNet: Pull-and-Push Networks for Volumetric Segmentation with Boundary
  Confusion
PnPNet: Pull-and-Push Networks for Volumetric Segmentation with Boundary Confusion
Xin You
Ming Ding
Minghui Zhang
Hanxiao Zhang
Yi Yu
Jie Yang
Yun Gu
128
2
0
13 Dec 2023
Semantic Lens: Instance-Centric Semantic Alignment for Video
  Super-Resolution
Semantic Lens: Instance-Centric Semantic Alignment for Video Super-Resolution
Qi Tang
Yao-Min Zhao
Meiqin Liu
Jian Jin
Chao Yao
86
6
0
13 Dec 2023
CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
Shuyang Sun
Runjia Li
Philip Torr
Xiuye Gu
Siyang Li
VLMCLIP
140
34
0
12 Dec 2023
Toward Real Text Manipulation Detection: New Dataset and New Solution
Dongliang Luo
Yuliang Liu
Rui Yang
Xianjin Liu
Jishen Zeng
Yu Zhou
Xiang Bai
69
3
0
12 Dec 2023
Adaptive Human Trajectory Prediction via Latent Corridors
Adaptive Human Trajectory Prediction via Latent Corridors
Neerja Thakkar
K. Mangalam
Andrea V. Bajcsy
Jitendra Malik
81
5
0
11 Dec 2023
4M: Massively Multimodal Masked Modeling
4M: Massively Multimodal Masked Modeling
David Mizrahi
Roman Bachmann
Ouguzhan Fatih Kar
Teresa Yeo
Mingfei Gao
Afshin Dehghan
Amir Zamir
MLLM
99
75
0
11 Dec 2023
TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance
  Segmentation
TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
102
2
0
11 Dec 2023
Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for
  Audio-Visual Segmentation
Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-Visual Segmentation
Qi Yang
Xing Nie
Tong Li
Pengfei Gao
Ying Guo
Cheng Zhen
Pengfei Yan
Shiming Xiang
VOS
87
14
0
11 Dec 2023
NVFi: Neural Velocity Fields for 3D Physics Learning from Dynamic Videos
NVFi: Neural Velocity Fields for 3D Physics Learning from Dynamic Videos
Jinxi Li
Ziyang Song
Bo Yang
3DH
78
15
0
11 Dec 2023
U-MixFormer: UNet-like Transformer with Mix-Attention for Efficient
  Semantic Segmentation
U-MixFormer: UNet-like Transformer with Mix-Attention for Efficient Semantic Segmentation
Seul-Ki Yeom
Julian von Klitzing
ViT
88
8
0
11 Dec 2023
MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Abdullah Rashwan
Jiageng Zhang
A. Taalimi
Fan Yang
Xingyi Zhou
Chaochao Yan
Liang-Chieh Chen
Yeqing Li
ViT
117
5
0
11 Dec 2023
OpenSD: Unified Open-Vocabulary Segmentation and Detection
OpenSD: Unified Open-Vocabulary Segmentation and Detection
Shuai Li
Ming-hui Li
Pengfei Wang
Lei Zhang
ObjDVLM
72
6
0
10 Dec 2023
EipFormer: Emphasizing Instance Positions in 3D Instance Segmentation
EipFormer: Emphasizing Instance Positions in 3D Instance Segmentation
Mengnan Zhao
Lihe Zhang
Yuqiu Kong
Baocai Yin
88
1
0
09 Dec 2023
VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement
VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement
Hanjung Kim
Jaehyun Kang
Miran Heo
Sukjun Hwang
Seoung Wug Oh
Seon Joo Kim
88
0
0
08 Dec 2023
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Zhen Li
Mingdeng Cao
Xintao Wang
Zhongang Qi
Ming-Ming Cheng
Ying Shan
DiffM
141
201
0
07 Dec 2023
Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for
  Domain Generalized Semantic Segmentation
Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation
Zhixiang Wei
Lin Chen
Yi Jin
Xiaoxiao Ma
Tianle Liu
Pengyang Lin
Ben Wang
H. Chen
Jinjin Zheng
103
48
0
07 Dec 2023
ZePT: Zero-Shot Pan-Tumor Segmentation via Query-Disentangling and
  Self-Prompting
ZePT: Zero-Shot Pan-Tumor Segmentation via Query-Disentangling and Self-Prompting
Yankai Jiang
Zhongzhen Huang
Rongzhao Zhang
Xiaofan Zhang
Shaoting Zhang
VLM
97
13
0
07 Dec 2023
Open-Vocabulary Segmentation with Semantic-Assisted Calibration
Open-Vocabulary Segmentation with Semantic-Assisted Calibration
Yong Liu
Sule Bai
Guanbin Li
Yitong Wang
Yansong Tang
VLM
97
33
0
07 Dec 2023
Previous
123...151617...272829
Next