Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.01527
Cited By
v1
v2
v3 (latest)
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,408 papers shown
Title
When Do We Not Need Larger Vision Models?
Baifeng Shi
Ziyang Wu
Maolin Mao
Xin Wang
Trevor Darrell
VLM
LRM
119
47
0
19 Mar 2024
PCT: Perspective Cue Training Framework for Multi-Camera BEV Segmentation
Haruya Ishikawa
Takumi Iida
Yoshinori Konishi
Yoshimitsu Aoki
73
2
0
19 Mar 2024
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation
Wenqi Zhu
Jiale Cao
Jin Xie
Shuangming Yang
Yanwei Pang
VLM
CLIP
115
3
0
19 Mar 2024
Fusion Transformer with Object Mask Guidance for Image Forgery Analysis
Dimitrios Karageorgiou
Giorgos Kordopatis-Zilos
Symeon Papadopoulos
ViT
61
7
0
18 Mar 2024
Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery
Yuqi Zhang
Guanying Chen
Jiaxing Chen
Shuguang Cui
73
2
0
18 Mar 2024
EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding
Wenhua Wu
Qi Wang
Guangming Wang
Junping Wang
Tiankun Zhao
Yang Liu
Dongchao Gao
Yanfeng Guo
Hesheng Wang
AI4CE
3DV
105
11
0
18 Mar 2024
Video Object Segmentation with Dynamic Query Modulation
Hantao Zhou
Runze Hu
Xiu Li
VOS
81
1
0
18 Mar 2024
PosSAM: Panoptic Open-vocabulary Segment Anything
VS Vibashan
Shubhankar Borse
Hyojin Park
Debasmit Das
Vishal M. Patel
Munawar Hayat
Fatih Porikli
VLM
MLLM
78
7
0
14 Mar 2024
Renovating Names in Open-Vocabulary Segmentation Benchmarks
Haiwen Huang
Songyou Peng
Dan Zhang
Andreas Geiger
VLM
76
3
0
14 Mar 2024
The NeRFect Match: Exploring NeRF Features for Visual Localization
Qunjie Zhou
Maxim Maximov
Or Litany
Laura Leal-Taixé
88
16
0
14 Mar 2024
WeakSurg: Weakly supervised surgical instrument segmentation using temporal equivariance and semantic continuity
Qiyuan Wang
Y. Liu
Shang Zhao
Rong Liu
S. Kevin Zhou
86
0
0
14 Mar 2024
Faceptor: A Generalist Model for Face Perception
Lixiong Qin
Mei Wang
Xuannan Liu
Yuhang Zhang
Weihong Deng
Xiaoshuai Song
Weiran Xu
Weihong Deng
CVBM
68
6
0
14 Mar 2024
RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes
Thang-Anh-Quan Nguyen
Luis Roldão
Nathan Piasco
Moussâb Bennehar
D. Tsishkou
134
6
0
14 Mar 2024
GiT: Towards Generalist Vision Transformer through Universal Language Interface
Haiyang Wang
Hao Tang
Li Jiang
Shaoshuai Shi
Muhammad Ferjad Naeem
Hongsheng Li
Bernt Schiele
Liwei Wang
VLM
101
13
0
14 Mar 2024
PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation
Yizhe Xiong
Hui Chen
Tianxiang Hao
Zijia Lin
Jungong Han
Yuesong Zhang
Guoxin Wang
Yongjun Bao
Guiguang Ding
97
18
0
14 Mar 2024
When Semantic Segmentation Meets Frequency Aliasing
Linwei Chen
Lin Gu
Ying Fu
110
6
0
14 Mar 2024
HIMap: HybrId Representation Learning for End-to-end Vectorized HD Map Construction
Yi Zhou
Hui Zhang
Jiaqian Yu
Yifan Yang
Sangil Jung
Seungsang Park
ByungIn Yoo
3DPC
110
20
0
13 Mar 2024
Language-Grounded Dynamic Scene Graphs for Interactive Object Search with Mobile Manipulation
Daniel Honerkamp
Martin Buchner
Fabien Despinoy
Tim Welschehold
Abhinav Valada
LM&Ro
117
34
0
13 Mar 2024
Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Zicheng Zhang
Tong Zhang
Yi Zhu
Jian-zhuo Liu
Xiaodan Liang
QiXiang Ye
Wei Ke
VLM
97
2
0
13 Mar 2024
TINA: Think, Interaction, and Action Framework for Zero-Shot Vision Language Navigation
Dingbang Li
Wenzhou Chen
Xin Lin
LLMAG
LM&Ro
77
4
0
13 Mar 2024
MoAI: Mixture of All Intelligence for Large Language and Vision Models
Byung-Kwan Lee
Beomchan Park
Chae Won Kim
Yonghyun Ro
MLLM
VLM
133
23
0
12 Mar 2024
ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions
Chunlong Xia
Xinliang Wang
Feng Lv
Xin Hao
Yifeng Shi
ViT
89
57
0
12 Mar 2024
Chart4Blind: An Intelligent Interface for Chart Accessibility Conversion
Omar Moured
Morris Baumgarten-Egemole
Alina Roitberg
Karin Muller
Thorsten Schwarz
Rainer Stiefelhagen
79
9
0
11 Mar 2024
Query-guided Prototype Evolution Network for Few-Shot Segmentation
Runmin Cong
Hang Xiong
Jinpeng Chen
Wei Zhang
Qingming Huang
Yao Zhao
105
15
0
11 Mar 2024
Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning
Woojin Ahn
G. Yang
H. Choi
M. Lim
61
8
0
10 Mar 2024
FrameQuant: Flexible Low-Bit Quantization for Transformers
Harshavardhan Adepu
Zhanpeng Zeng
Li Zhang
Vikas Singh
MQ
60
8
0
10 Mar 2024
Frequency-Adaptive Dilated Convolution for Semantic Segmentation
Linwei Chen
Lin Gu
Ying Fu
120
29
0
08 Mar 2024
InstructGIE: Towards Generalizable Image Editing
Zichong Meng
Changdi Yang
Jun Liu
Hao Tang
Pu Zhao
Yanzhi Wang
DiffM
99
9
0
08 Mar 2024
ComFe: An Interpretable Head for Vision Transformers
Evelyn J. Mannix
H. Bondell
Howard Bondell
VLM
ViT
99
1
0
07 Mar 2024
Continual Segmentation with Disentangled Objectness Learning and Class Recognition
Yizheng Gong
Siyue Yu
Xiaoyang Wang
Jimin Xiao
CLL
84
6
0
06 Mar 2024
DINOv2 based Self Supervised Learning For Few Shot Medical Image Segmentation
Lev Ayzenberg
Raja Giryes
H. Greenspan
66
4
0
05 Mar 2024
Deep Common Feature Mining for Efficient Video Semantic Segmentation
Yaoyan Zheng
Hongyu Yang
Di Huang
77
0
0
05 Mar 2024
Benchmarking Segmentation Models with Mask-Preserved Attribute Editing
Zijin Yin
Kongming Liang
Bing Li
Zhanyu Ma
Jun Guo
VLM
134
2
0
02 Mar 2024
A citizen science toolkit to collect human perceptions of urban environments using open street view images
Matthew Danish
SM Labib
Britta Ricker
Marco Helbich
62
9
0
29 Feb 2024
PEM: Prototype-based Efficient MaskFormer for Image Segmentation
Niccolò Cavagnero
Gabriele Rosi
Claudia Cuttano
Francesca Pistilli
Marco Ciccone
Giuseppe Averta
Fabio Cermelli
108
23
0
29 Feb 2024
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
Ming-hui Li
Shuai Li
Xindong Zhang
Lei Zhang
VOS
107
18
0
28 Feb 2024
Masked Gamma-SSL: Learning Uncertainty Estimation via Masked Image Modeling
David S. W. Williams
Matthew Gadd
Paul Newman
Daniele De Martini
UQCV
37
1
0
27 Feb 2024
An Efficient MLP-based Point-guided Segmentation Network for Ore Images with Ambiguous Boundary
Guodong Sun
Yuting Peng
Lei Cheng
Mengya Xu
An-Chi Wang
Bo Wu
Hongliang Ren
Yang Zhang
75
2
0
27 Feb 2024
A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge -- Multi-Task Robustness Track
Zehui Chen
Qiuchen Wang
Zhenyu Li
Jiaming Liu
Shanghang Zhang
Feng Zhao
74
1
0
27 Feb 2024
GROUNDHOG: Grounding Large Language Models to Holistic Segmentation
Yichi Zhang
Ziqiao Ma
Xiaofeng Gao
Suhaila Shakiah
Qiaozi Gao
Joyce Chai
MLLM
VLM
133
47
0
26 Feb 2024
ConSept: Continual Semantic Segmentation via Adapter-based Vision Transformer
Bowen Dong
Guanglei Yang
W. Zuo
Lei Zhang
93
1
0
26 Feb 2024
Placing Objects in Context via Inpainting for Out-of-distribution Segmentation
Pau de Jorge
Riccardo Volpi
P. Dokania
Philip Torr
Grégory Rogez
DiffM
118
5
0
26 Feb 2024
Benchmarking the Robustness of Panoptic Segmentation for Automated Driving
Yiting Wang
Haonan Zhao
Daniel Gummadi
M. Dianati
Kurt Debattista
Valentina Donzella
91
3
0
23 Feb 2024
Outlier detection by ensembling uncertainty with negative objectness
Anja Delić
Matej Grcić
Sinisa Segvic
UQCV
118
15
0
23 Feb 2024
WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition
Lianghui Zhu
Junwei Zhou
Yan Liu
Xin Hao
Wenyu Liu
Xinggang Wang
VLM
69
7
0
22 Feb 2024
Subobject-level Image Tokenization
Delong Chen
Samuel Cahyawijaya
Jianfeng Liu
Baoyuan Wang
Pascale Fung
VLM
OCL
282
9
0
22 Feb 2024
Generalizable Semantic Vision Query Generation for Zero-shot Panoptic and Semantic Segmentation
Jialei Chen
Daisuke Deguchi
Chenkai Zhang
Hiroshi Murase
VLM
131
1
0
21 Feb 2024
Cell Graph Transformer for Nuclei Classification
Wei Lou
Guanbin Li
Xiang Wan
Haofeng Li
ViT
MedIm
118
5
0
20 Feb 2024
Object-level Geometric Structure Preserving for Natural Image Stitching
Wenxiao Cai
Wankou Yang
55
5
0
20 Feb 2024
How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey
Fabio Tosi
Youming Zhang
Ziren Gong
Erik Sandström
S. Mattoccia
Martin R. Oswald
Matteo Poggi
3DGS
208
64
0
20 Feb 2024
Previous
1
2
3
...
13
14
15
...
27
28
29
Next