Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.01527
Cited By
v1
v2
v3 (latest)
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,408 papers shown
Title
Leveraging Open-Vocabulary Diffusion to Camouflaged Instance Segmentation
Tuan-Anh Vu
Duc Thanh Nguyen
Qing Guo
Binh-Son Hua
N. Chung
Ivor W. Tsang
Sai-Kit Yeung
DiffM
83
3
0
29 Dec 2023
HEAP: Unsupervised Object Discovery and Localization with Contrastive Grouping
Xin Zhang
Jinheng Xie
Yuan. Yuan
Michael Bi Mi
Robby T. Tan
VOS
OCL
VLM
141
4
0
29 Dec 2023
Amodal Ground Truth and Completion in the Wild
Guanqi Zhan
Chuanxia Zheng
Weidi Xie
Andrew Zisserman
82
22
0
28 Dec 2023
Unsupervised Universal Image Segmentation
Dantong Niu
Xudong Wang
Xinyang Han
Long Lian
Roei Herzig
Trevor Darrell
VLM
94
20
0
28 Dec 2023
LISA++: An Improved Baseline for Reasoning Segmentation with Large Language Model
Senqiao Yang
Tianyuan Qu
Xin Lai
Zhuotao Tian
Bohao Peng
Shu Liu
Jiaya Jia
VLM
122
32
0
28 Dec 2023
Fully Sparse 3D Occupancy Prediction
Haisong Liu
Yang Chen
Haiguang Wang
Zetong Yang
Tianyu Li
Jia Zeng
Li Chen
Hongyang Li
Limin Wang
128
19
0
28 Dec 2023
LaneSegNet: Map Learning with Lane Segment Perception for Autonomous Driving
Tianyu Li
Peijin Jia
Bangjun Wang
Li Chen
Kun Jiang
Junchi Yan
Hongyang Li
88
38
0
26 Dec 2023
Semantic-aware SAM for Point-Prompted Instance Segmentation
Zhaoyang Wei
Pengfei Chen
Xuehui Yu
Guorong Li
Jianbin Jiao
Zhenjun Han
VLM
101
6
0
26 Dec 2023
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces
Jiannan Wu
Yi Jiang
Bin Yan
Huchuan Lu
Zehuan Yuan
Ping Luo
VOS
106
18
0
25 Dec 2023
WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Large-scale Natural Environments
Kavisha Vidanapathirana
Joshua Knights
Stephen Hausler
Mark Cox
Milad Ramezani
...
Ethan Griffiths
Shaheer Mohamed
Sridha Sridharan
Clinton Fookes
Peyman Moghadam
3DV
89
9
0
23 Dec 2023
Harnessing Diffusion Models for Visual Perception with Meta Prompts
Qiang Wan
Zilong Huang
Bingyi Kang
Jiashi Feng
Li Zhang
MDE
VLM
105
16
0
22 Dec 2023
SurgicalPart-SAM: Part-to-Whole Collaborative Prompting for Surgical Instrument Segmentation
Wenxi Yue
Jing Zhang
Kun Hu
Qiuxia Wu
Zongyuan Ge
Yong Xia
Jiebo Luo
Zhiyong Wang
80
3
0
22 Dec 2023
UniHuman: A Unified Model for Editing Human Images in the Wild
Nannan Li
Qing Liu
Krishna Kumar Singh
Yilin Wang
Jianming Zhang
Bryan A. Plummer
Zhe Lin
56
10
0
22 Dec 2023
Leveraging Habitat Information for Fine-grained Bird Identification
Tin Nguyen
Peijie Chen
Anh Totti Nguyen
VLM
113
0
0
22 Dec 2023
VCoder: Versatile Vision Encoders for Multimodal Large Language Models
Jitesh Jain
Jianwei Yang
Humphrey Shi
MLLM
76
31
0
21 Dec 2023
TinySAM: Pushing the Envelope for Efficient Segment Anything Model
Han Shu
Wenshuo Li
Yehui Tang
Yiman Zhang
Yihao Chen
Houqiang Li
Yunhe Wang
Xinghao Chen
VLM
124
21
0
21 Dec 2023
Unlocking Pre-trained Image Backbones for Semantic Image Synthesis
Tariq Berrada
Jakob Verbeek
Camille Couprie
Alahari Karteek
93
9
0
20 Dec 2023
SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process
Meng Wang
Henghui Ding
Jun Hao Liew
Jiajun Liu
Yao-Min Zhao
Yunchao Wei
DiffM
109
19
0
19 Dec 2023
Mask Grounding for Referring Image Segmentation
Yong Xien Chng
Henry Zheng
Yizeng Han
Xuchong Qiu
Gao Huang
ISeg
ObjD
143
21
0
19 Dec 2023
Spherical Mask: Coarse-to-Fine 3D Point Cloud Instance Segmentation with Spherical Representation
Sangyun Shin
Kaichen Zhou
M. Vankadari
Andrew Markham
Niki Trigoni
3DPC
80
11
0
18 Dec 2023
MatchDet: A Collaborative Framework for Image Matching and Object Detection
Jinxiang Lai
Wenlong Wu
Bin-Bin Gao
Jun Liu
Jiawei Zhan
Congchong Nie
Yi Zeng
Chengjie Wang
VLM
85
0
0
18 Dec 2023
Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance
P. Nguyen
T.D. Ngo
E. Kalogerakis
Chuang Gan
Anh Tran
Cuong Pham
Khoi Duc Minh Nguyen
ISeg
154
55
0
17 Dec 2023
DETER: Detecting Edited Regions for Deterring Generative Manipulations
Sai Wang
Ye Zhu
Ruoyu Wang
Amaya Dharmasiri
Olga Russakovsky
Yu Wu
71
2
0
16 Dec 2023
Part Representation Learning with Teacher-Student Decoder for Occluded Person Re-identification
Shang Gao
Chenyang Yu
Pingping Zhang
Huchuan Lu
90
4
0
15 Dec 2023
Collaborating Foundation Models for Domain Generalized Semantic Segmentation
Yasser Benigmim
Subhankar Roy
S. Essid
Vicky Kalogeiton
Stéphane Lathuilière
139
14
0
15 Dec 2023
From-Ground-To-Objects: Coarse-to-Fine Self-supervised Monocular Depth Estimation of Dynamic Objects with Ground Contact Prior
Jaeho Moon
J. P. Bello
Byeongjun Kwon
Munchurl Kim
61
7
0
15 Dec 2023
General Object Foundation Model for Images and Videos at Scale
Junfeng Wu
Yi Jiang
Qihao Liu
Zehuan Yuan
Xiang Bai
Song Bai
VOS
VLM
111
41
0
14 Dec 2023
Tokenize Anything via Prompting
Ting Pan
Lulu Tang
Xinlong Wang
Shiguang Shan
VLM
68
23
0
14 Dec 2023
LEMON: Learning 3D Human-Object Interaction Relation from 2D Images
Yuhang Yang
Wei Zhai
Hongcheng Luo
Yang Cao
Zheng-Jun Zha
124
26
0
14 Dec 2023
TAM-VT: Transformation-Aware Multi-scale Video Transformer for Segmentation and Tracking
Raghav Goyal
Wan-Cyuan Fan
Mennatullah Siam
Leonid Sigal
VOS
82
3
0
13 Dec 2023
SAM-guided Graph Cut for 3D Instance Segmentation
Haoyu Guo
He Zhu
Sida Peng
Yuang Wang
Yujun Shen
Ruizhen Hu
Xiaowei Zhou
3DV
104
18
0
13 Dec 2023
See, Say, and Segment: Teaching LMMs to Overcome False Premises
Tsung-Han Wu
Giscard Biamby
David M. Chan
Lisa Dunlap
Ritwik Gupta
Xudong Wang
Joseph E. Gonzalez
Trevor Darrell
VLM
MLLM
115
21
0
13 Dec 2023
PnPNet: Pull-and-Push Networks for Volumetric Segmentation with Boundary Confusion
Xin You
Ming Ding
Minghui Zhang
Hanxiao Zhang
Yi Yu
Jie Yang
Yun Gu
128
2
0
13 Dec 2023
Semantic Lens: Instance-Centric Semantic Alignment for Video Super-Resolution
Qi Tang
Yao-Min Zhao
Meiqin Liu
Jian Jin
Chao Yao
86
6
0
13 Dec 2023
CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
Shuyang Sun
Runjia Li
Philip Torr
Xiuye Gu
Siyang Li
VLM
CLIP
140
34
0
12 Dec 2023
Toward Real Text Manipulation Detection: New Dataset and New Solution
Dongliang Luo
Yuliang Liu
Rui Yang
Xianjin Liu
Jishen Zeng
Yu Zhou
Xiang Bai
69
3
0
12 Dec 2023
Adaptive Human Trajectory Prediction via Latent Corridors
Neerja Thakkar
K. Mangalam
Andrea V. Bajcsy
Jitendra Malik
81
5
0
11 Dec 2023
4M: Massively Multimodal Masked Modeling
David Mizrahi
Roman Bachmann
Ouguzhan Fatih Kar
Teresa Yeo
Mingfei Gao
Afshin Dehghan
Amir Zamir
MLLM
99
75
0
11 Dec 2023
TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
102
2
0
11 Dec 2023
Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-Visual Segmentation
Qi Yang
Xing Nie
Tong Li
Pengfei Gao
Ying Guo
Cheng Zhen
Pengfei Yan
Shiming Xiang
VOS
87
14
0
11 Dec 2023
NVFi: Neural Velocity Fields for 3D Physics Learning from Dynamic Videos
Jinxi Li
Ziyang Song
Bo Yang
3DH
78
15
0
11 Dec 2023
U-MixFormer: UNet-like Transformer with Mix-Attention for Efficient Semantic Segmentation
Seul-Ki Yeom
Julian von Klitzing
ViT
88
8
0
11 Dec 2023
MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Abdullah Rashwan
Jiageng Zhang
A. Taalimi
Fan Yang
Xingyi Zhou
Chaochao Yan
Liang-Chieh Chen
Yeqing Li
ViT
117
5
0
11 Dec 2023
OpenSD: Unified Open-Vocabulary Segmentation and Detection
Shuai Li
Ming-hui Li
Pengfei Wang
Lei Zhang
ObjD
VLM
72
6
0
10 Dec 2023
EipFormer: Emphasizing Instance Positions in 3D Instance Segmentation
Mengnan Zhao
Lihe Zhang
Yuqiu Kong
Baocai Yin
88
1
0
09 Dec 2023
VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement
Hanjung Kim
Jaehyun Kang
Miran Heo
Sukjun Hwang
Seoung Wug Oh
Seon Joo Kim
88
0
0
08 Dec 2023
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Zhen Li
Mingdeng Cao
Xintao Wang
Zhongang Qi
Ming-Ming Cheng
Ying Shan
DiffM
141
201
0
07 Dec 2023
Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation
Zhixiang Wei
Lin Chen
Yi Jin
Xiaoxiao Ma
Tianle Liu
Pengyang Lin
Ben Wang
H. Chen
Jinjin Zheng
103
48
0
07 Dec 2023
ZePT: Zero-Shot Pan-Tumor Segmentation via Query-Disentangling and Self-Prompting
Yankai Jiang
Zhongzhen Huang
Rongzhao Zhang
Xiaofan Zhang
Shaoting Zhang
VLM
97
13
0
07 Dec 2023
Open-Vocabulary Segmentation with Semantic-Assisted Calibration
Yong Liu
Sule Bai
Guanbin Li
Yitong Wang
Yansong Tang
VLM
97
33
0
07 Dec 2023
Previous
1
2
3
...
15
16
17
...
27
28
29
Next