Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.01527
Cited By
v1
v2
v3 (latest)
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,408 papers shown
Title
TokenCompose: Text-to-Image Diffusion with Token-level Supervision
Zirui Wang
Zhizhou Sha
Zheng Ding
Yilin Wang
Zhuowen Tu
DiffM
105
23
0
06 Dec 2023
Foundation Model Assisted Weakly Supervised Semantic Segmentation
Xiaobo Yang
Xiaojin Gong
VLM
91
27
0
06 Dec 2023
AI-SAM: Automatic and Interactive Segment Anything Model
Yimu Pan
Sitao Zhang
Alison D. Gernand
Jeffery A. Goldstein
J. Z. Wang
VLM
64
4
0
05 Dec 2023
RotaTR: Detection Transformer for Dense and Rotated Object
Yuke Zhu
Yumeng Ruan
Lei Yang
Sheng Guo
79
0
0
05 Dec 2023
Uni3DL: Unified Model for 3D and Language Understanding
Xiang Li
Jian Ding
Zhaoyang Chen
Mohamed Elhoseiny
117
5
0
05 Dec 2023
Lenna: Language Enhanced Reasoning Detection Assistant
Fei Wei
Xinyu Zhang
Ailing Zhang
Bo Zhang
Xiangxiang Chu
MLLM
LRM
99
25
0
05 Dec 2023
PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness
Anh-Quan Cao
Angela Dai
Raoul de Charette
UQCV
71
22
0
04 Dec 2023
Aligning and Prompting Everything All at Once for Universal Visual Perception
Yunhang Shen
Chaoyou Fu
Peixian Chen
Mengdan Zhang
Ke Li
Xing Sun
Yunsheng Wu
Shaohui Lin
Rongrong Ji
VLM
ObjD
122
39
0
04 Dec 2023
UniGS: Unified Representation for Image Generation and Segmentation
Lu Qi
Lehan Yang
Weidong Guo
Yu-Syuan Xu
Bo Du
Varun Jampani
Ming-Hsuan Yang
98
19
0
04 Dec 2023
Unveiling Objects with SOLA: An Annotation-Free Image Search on the Object Level for Automotive Data Sets
Philipp Rigoll
Jacob Langner
Eric Sax
84
4
0
04 Dec 2023
Effective Adapter for Face Recognition in the Wild
Yunhao Liu
Yu-Ju Tsai
Kelvin C. K. Chan
Xiangtai Li
Lu Qi
Ming-Hsuan Yang
CVBM
64
1
0
04 Dec 2023
Universal Segmentation at Arbitrary Granularity with Language Instruction
Yong Liu
Cairong Zhang
Yitong Wang
Jiahao Wang
Yujiu Yang
Yansong Tang
VLM
VOS
117
20
0
04 Dec 2023
SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
Feng Wang
Jieru Mei
Alan Yuille
VLM
146
66
0
04 Dec 2023
SANeRF-HQ: Segment Anything for NeRF in High Quality
Yichen Liu
Benran Hu
Chi-Keung Tang
Yu-Wing Tai
97
13
0
03 Dec 2023
Segment and Caption Anything
Xiaoke Huang
Jianfeng Wang
Yansong Tang
Zheng Zhang
Han Hu
Jiwen Lu
Lijuan Wang
Zicheng Liu
MLLM
VLM
94
21
0
01 Dec 2023
Sequential Modeling Enables Scalable Learning for Large Vision Models
Yutong Bai
Xinyang Geng
K. Mangalam
Amir Bar
Alan Yuille
Trevor Darrell
Jitendra Malik
Alexei A. Efros
MLLM
VLM
88
169
0
01 Dec 2023
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Yunyang Xiong
Bala Varadarajan
Lemeng Wu
Xiaoyu Xiang
Fanyi Xiao
...
Dilin Wang
Fei Sun
Forrest N. Iandola
Raghuraman Krishnamoorthi
Vikas Chandra
VLM
107
159
0
01 Dec 2023
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion Models
Pengxiang Li
Kai Chen
Zhili Liu
Ruiyuan Gao
Lanqing Hong
Guo Zhou
Hua Yao
Dit-Yan Yeung
Huchuan Lu
Xu Jia
VGen
DiffM
66
0
0
01 Dec 2023
InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation
Rongyao Fang
Shilin Yan
Zhaoyang Huang
Jingqiu Zhou
Hao Tian
Jifeng Dai
Hongsheng Li
MLLM
106
14
0
30 Nov 2023
A Lightweight Clustering Framework for Unsupervised Semantic Segmentation
Yau Shing Jonathan Cheung
Xi Chen
Lihe Yang
Hengshuang Zhao
75
1
0
30 Nov 2023
A Simple Video Segmenter by Tracking Objects Along Axial Trajectories
Ju He
Qihang Yu
Inkyu Shin
XueQing Deng
Alan Yuille
Xiaohui Shen
Liang-Chieh Chen
VOS
120
2
0
30 Nov 2023
One-Shot Open Affordance Learning with Foundation Models
Gen Li
Deqing Sun
Laura Sevilla-Lara
Varun Jampani
VLM
118
26
0
29 Nov 2023
Focus on Query: Adversarial Mining Transformer for Few-Shot Segmentation
Yuan Wang
Naisong Luo
Tianzhu Zhang
100
13
0
29 Nov 2023
Continual Learning for Image Segmentation with Dynamic Query
Weijia Wu
Yuzhong Zhao
Zhuang Li
Lianlei Shan
Hong Zhou
Mike Zheng Shou
VLM
CLL
90
17
0
29 Nov 2023
How does spatial structure affect psychological restoration? A method based on Graph Neural Networks and Street View Imagery
Haoran Ma
Yan Zhang
Pengyuan Liu
Fan Zhang
Pengyu Zhu
47
12
0
29 Nov 2023
Panoptic Video Scene Graph Generation
Jingkang Yang
Wen-Hsiao Peng
Xiangtai Li
Zujin Guo
Liangyu Chen
...
Zheng Ma
Kaiyang Zhou
Wayne Zhang
Chen Change Loy
Ziwei Liu
VOS
120
43
0
28 Nov 2023
TransNeXt: Robust Foveal Visual Perception for Vision Transformers
Dai Shi
ViT
93
98
0
28 Nov 2023
LLaFS: When Large Language Models Meet Few-Shot Segmentation
Lanyun Zhu
Tianrun Chen
Deyi Ji
Jieping Ye
Jun Liu
VLM
122
42
0
28 Nov 2023
VLPrompt: Vision-Language Prompting for Panoptic Scene Graph Generation
Zijian Zhou
Miaojing Shi
Holger Caesar
VLM
125
12
0
27 Nov 2023
SOAC: Spatio-Temporal Overlap-Aware Multi-Sensor Calibration using Neural Radiance Fields
Quentin Herau
Nathan Piasco
Moussâb Bennehar
Luis Roldão
D. Tsishkou
Cyrille Migniot
Pascal Vasseur
C. Demonceaux
91
12
0
27 Nov 2023
Stable Segment Anything Model
Qi Fan
Xin Tao
Lei Ke
Mingqiao Ye
Yuanhui Zhang
Pengfei Wan
Zhong-ming Wang
Yu-Wing Tai
Chi-Keung Tang
VLM
85
6
0
27 Nov 2023
RISAM: Referring Image Segmentation via Mutual-Aware Attention Features
Mengxi Zhang
Yiming Liu
Xiangjun Yin
Huanjing Yue
Jingyu Yang
124
1
0
27 Nov 2023
SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation
Bin Xie
Jiale Cao
Jin Xie
Fahad Shahbaz Khan
Yanwei Pang
VLM
125
48
0
27 Nov 2023
FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding
Thanh-Dat Truong
Utsav Prabhu
Bhiksha Raj
Jackson Cothren
Khoa Luu
CLL
168
3
0
27 Nov 2023
SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation
Lingchen Meng
Shiyi Lan
Hengduo Li
Jose M. Alvarez
Zuxuan Wu
Yu-Gang Jiang
VLM
ISeg
MLLM
79
9
0
24 Nov 2023
OneFormer3D: One Transformer for Unified Point Cloud Segmentation
Maksim Kolodiazhnyi
Anna Vorontsova
Anton Konushin
D. Rukhovich
ViT
96
52
0
24 Nov 2023
The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024
Benjamin Kiefer
Lojze Žust
Matej Kristan
J. Pers
Matija Tersek
...
Magdalena Šumunec
Nadir Kapetanović
A. Michel
Wolfgang Gross
Martin Weinmann
64
4
0
23 Nov 2023
Visual In-Context Prompting
Feng Li
Qing Jiang
Hao Zhang
Tianhe Ren
Shilong Liu
...
Hongyang Li
Chun-yue Li
Jianwei Yang
Lei Zhang
Jianfeng Gao
VLM
LRM
MLLM
89
36
0
22 Nov 2023
T-Rex: Counting by Visual Prompting
Qing Jiang
Feng Li
Tianhe Ren
Shilong Liu
Zhaoyang Zeng
Kent Yu
Lei Zhang
104
14
0
22 Nov 2023
Spanning Training Progress: Temporal Dual-Depth Scoring (TDDS) for Enhanced Dataset Pruning
Xin Zhang
Jiawei Du
Yunsong Li
Weiying Xie
Qiufeng Wang
95
14
0
22 Nov 2023
Exploring Lip Segmentation Techniques in Computer Vision: A Comparative Analysis
Pietro Masur
Francisco Braulio Oliveira
Lucas Moreira Medino
Emanuel Huber
Milene Haraguchi Padilha
Cássio De Alcantara
Renata Sellaro
35
1
0
20 Nov 2023
GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding
Hao Li
Dingwen Zhang
Yalun Dai
Nian Liu
Lechao Cheng
Jingfeng Li
Jingdong Wang
Junwei Han
103
16
0
20 Nov 2023
OmniSeg3D: Omniversal 3D Segmentation via Hierarchical Contrastive Learning
Haiyang Ying
Yixuan Yin
Jinzhi Zhang
Fan Wang
Tao Yu
Ruqi Huang
Lu Fang
48
33
0
20 Nov 2023
Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model
Chunming He
Chengyu Fang
Yulun Zhang
Chenyu You
Kai Li
Longxiang Tang
Fengyang Xiao
Xiu Li
Z. Guo
148
32
0
20 Nov 2023
Open-Vocabulary Camouflaged Object Segmentation
Youwei Pang
Xiaoqi Zhao
Jiaming Zuo
Lihe Zhang
Huchuan Lu
VLM
ObjD
100
7
0
19 Nov 2023
Enhancing Transformer-Based Segmentation for Breast Cancer Diagnosis using Auto-Augmentation and Search Optimisation Techniques
Leon Hamnett
M. Adewunmi
M. Abayomi
Kayode Raheem
Fahad Ahmed
ViT
29
1
0
18 Nov 2023
Segment Anything in Defect Detection
Bozhen Hu
Bin Gao
Cheng Tan
Tongle Wu
Stan Z. Li
38
7
0
17 Nov 2023
Towards Open-Ended Visual Recognition with Large Language Model
Qihang Yu
Xiaohui Shen
Liang-Chieh Chen
VLM
74
8
0
14 Nov 2023
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Bin Xiao
Haiping Wu
Weijian Xu
Xiyang Dai
Houdong Hu
Yumao Lu
Michael Zeng
Ce Liu
Lu Yuan
VLM
127
175
0
10 Nov 2023
VioLA: Aligning Videos to 2D LiDAR Scans
Jun-Jee Chao
Selim Engin
Nikhil Chavan-Dafle
Bhoram Lee
Volkan Isler
VGen
61
0
0
08 Nov 2023
Previous
1
2
3
...
16
17
18
...
27
28
29
Next