Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.01527
Cited By
v1
v2
v3 (latest)
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,408 papers shown
Title
UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation
Tao Zhang
Jinyong Wen
Zhen Chen
Kun Ding
Di Zhang
Chunhong Pan
262
1
0
04 Feb 2025
AquaticCLIP: A Vision-Language Foundation Model for Underwater Scene Analysis
B. Alawode
I. I. Ganapathi
S. Javed
Naoufel Werghi
Mohammed Bennamoun
Arif Mahmood
CLIP
VLM
110
1
0
03 Feb 2025
Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation
Lin Chen
Qi Yang
Kun Ding
Zhu Li
Gang Shen
Fei Li
Qiyuan Cao
Shiming Xiang
VLM
80
0
0
29 Jan 2025
Not Every Patch is Needed: Towards a More Efficient and Effective Backbone for Video-based Person Re-identification
Lanyun Zhu
Tianrun Chen
Deyi Ji
Jieping Ye
Jing Liu
156
2
0
28 Jan 2025
An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control
Aosong Feng
Weikang Qiu
Jinbin Bai
Xiao Zhang
Zhen Dong
Kaicheng Zhou
Rex Ying
Leandros Tassiulas
DiffM
122
6
0
28 Jan 2025
Neural Radiance Fields for the Real World: A Survey
Wenhui Xiao
Remi Chierchia
Rodrigo Santa Cruz
Xuesong Li
David Ahmedt-Aristizabal
Olivier Salvado
Clinton Fookes
Léo Lebrat
AI4CE
180
0
0
22 Jan 2025
DynamicEarth: How Far are We from Open-Vocabulary Change Detection?
Kaiyu Li
Xiangyong Cao
Yupeng Deng
Chao Pang
Zepeng Xin
Deyu Meng
Zhi Wang
ObjD
151
1
0
22 Jan 2025
Towards Accurate Unified Anomaly Segmentation
Wenxin Ma
Qingsong Yao
Xiang Zhang
Zhelong Huang
Zihang Jiang
S. Kevin Zhou
136
2
0
21 Jan 2025
3rd Workshop on Maritime Computer Vision (MaCVi) 2025: Challenge Results
Benjamin Kiefer
Lojze Žust
Jon Muhovič
Matej Kristan
J. Pers
...
Ashraf Saleem
Ching-Heng Cheng
Yu-Fan Lin
Tzu-Yu Lin
Chih-Chung Hsu
77
1
0
20 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CE
LM&MA
VLM
288
27
0
17 Jan 2025
Few-Shot Adaptation of Training-Free Foundation Model for 3D Medical Image Segmentation
Xingxin He
Yifan Hu
Zhaoye Zhou
Mohamed Jarraya
Fang Liu
VLM
MedIm
105
2
0
17 Jan 2025
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding
Haomiao Xiong
Yunzhi Zhuge
Jiawen Zhu
Lu Zhang
Huchuan Lu
79
3
0
14 Jan 2025
Static Segmentation by Tracking: A Label-Efficient Approach for Fine-Grained Specimen Image Segmentation
Zhenyang Feng
Zihe Wang
Saul Ibaven Bueno
Saul Ibaven Bueno
Tomasz Frelek
...
Hilmar Lapp
Charles V. Stewart
T. Berger-Wolf
Yu-Chuan Su
Wei-Lun Chao
94
0
0
12 Jan 2025
Semi-supervised 3D Semantic Scene Completion with 2D Vision Foundation Model Guidance
Duc-Hai Pham
Duc Dung Nguyen
Anh Pham
Ho Lai Tuan
P. Nguyen
Khoi Duc Minh Nguyen
Rang Nguyen
3DPC
170
1
0
10 Jan 2025
AutoFish: Dataset and Benchmark for Fine-grained Analysis of Fish
S. Bengtson
Daniel Lehotský
Vasiliki Ismiroglou
Niels Madsen
T. Moeslund
Malte Pedersen
64
0
0
08 Jan 2025
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Haobo Yuan
Xianrui Li
Tao Zhang
Zilong Huang
Shilin Xu
S. Ji
Yunhai Tong
Lu Qi
Jiashi Feng
Ming-Hsuan Yang
VLM
195
25
0
07 Jan 2025
Exploiting Boundary Loss for the Hierarchical Panoptic Segmentation of Plants and Leaves
Madeleine Darbyshire
Elizabeth I. Sklar
Simon Parsons
138
0
0
03 Jan 2025
PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM
Runnan Chen
Zhaoqing Wang
Jiepeng Wang
Yuexin Ma
Mingming Gong
Wenping Wang
Tongliang Liu
3DGS
106
3
0
03 Jan 2025
A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images
Dawen Yu
Shunping Ji
ViT
114
2
0
03 Jan 2025
Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot Segmentation
S. Park
Subeen Lee
Hyun Seok Seong
Jaejoon Yoo
Jae-Pil Heo
129
1
0
03 Jan 2025
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
Jiannan Wu
Muyan Zhong
Sen Xing
Zeqiang Lai
Zhaoyang Liu
...
Lewei Lu
Tong Lu
Ping Luo
Yu Qiao
Jifeng Dai
MLLM
VLM
LRM
363
59
0
03 Jan 2025
Unlocking adaptive digital pathology through dynamic feature learning
Jiawen Li
Tian Guan
Qingxin Xia
Yanjie Wang
Xitong Ling
...
Xiu-Wu Bian
Ziyi Wang
Lingchuan Guo
Chao He
Yonghong He
AI4CE
72
0
0
31 Dec 2024
Towards Visual Grounding: A Survey
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
284
5
0
31 Dec 2024
LiDAR-Camera Fusion for Video Panoptic Segmentation without Video Training
Fardin Ayar
Ehsan Javanmardi
Manabu Tsukada
Mahdi Javanmardi
Mohammad Rahmati
VOS
279
0
0
31 Dec 2024
DPBridge: Latent Diffusion Bridge for Dense Prediction
Haorui Ji
Taojun Lin
Hongdong Li
DiffM
299
1
0
29 Dec 2024
Establishing Reality-Virtuality Interconnections in Urban Digital Twins for Superior Intelligent Road Inspection
Yikang Zhang
Chuang-Wei Liu
Jiahang Li
Yingbing Chen
Jie Cheng
Rui Fan
80
0
0
23 Dec 2024
Segmentation of arbitrary features in very high resolution remote sensing imagery
Henry Cording
Yves Plancherel
Pablo Brito-Parada
120
0
0
20 Dec 2024
Enhancing Generalized Few-Shot Semantic Segmentation via Effective Knowledge Transfer
Xinyue Chen
Miaojing Shi
Zijian Zhou
Lianghua He
Sophia Tsoka
125
0
0
20 Dec 2024
FashionComposer: Compositional Fashion Image Generation
S. Ji
Yiyang Wang
Xi Chen
Xiaogang Xu
Hao Luo
Hengshuang Zhao
151
0
0
18 Dec 2024
Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation
J. Zhang
Li Zhang
Shijian Li
VLM
177
0
0
18 Dec 2024
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models
Cong Wei
Yujie Zhong
Haoxian Tan
Yingsen Zeng
Yong Liu
Zheng Zhao
Yujiu Yang
MLLM
VLM
VOS
152
3
0
18 Dec 2024
Locate n' Rotate: Two-stage Openable Part Detection with Foundation Model Priors
Siqi Li
Xiaoxue Chen
Haoyu Cheng
Guyue Zhou
Hao Zhao
Guanzhong Tian
197
0
0
17 Dec 2024
Open-World Panoptic Segmentation
Matteo Sodano
Federico Magistri
Jens Behley
Cyrill Stachniss
VLM
159
0
0
17 Dec 2024
Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation
Hongwei Niu
Linhuang Xie
Jianghang Lin
Shengchuan Zhang
137
3
0
16 Dec 2024
Expanded Comprehensive Robotic Cholecystectomy Dataset (CRCD)
K. Oh
Leonardo Borgioli
Alberto Mangano
Valentina Valle
Marco Di Pangrazio
...
Luciano Ambrosini
Alvaro Ducas
Milos Zefran
Liaohai Chen
P. Giulianotti
124
1
0
16 Dec 2024
SAMIC: Segment Anything with In-Context Spatial Prompt Engineering
S. Nagendra
Kashif Rashid
Chaopeng Shen
Daniel Kifer
VLM
143
2
0
16 Dec 2024
DINO-Foresight
\texttt{DINO-Foresight}
DINO-Foresight
: Looking into the Future with DINO
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
AI4CE
150
3
0
16 Dec 2024
HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object Detection
Zijian Gu
Jianwei Ma
Yan Huang
Honghao Wei
Zhanye Chen
Huatian Zhang
Wei Hong
159
2
0
16 Dec 2024
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
Yunxiang Fu
Meng Lou
Yizhou Yu
324
1
0
16 Dec 2024
Mask Enhanced Deeply Supervised Prostate Cancer Detection on B-mode Micro-Ultrasound
Lichun Zhang
Steve Zhou
Moon Hyung Choi
Jeong Hoon Lee
Shengtian Sang
...
Wei Shao
Ahmed N. El Kaffas
Richard E. Fan
G. Sonn
M. Rusu
MedIm
129
0
0
14 Dec 2024
Neural Network Meta Classifier: Improving the Reliability of Anomaly Segmentation
Jurica Runtas
Tomislav Petkovic
UQCV
130
0
0
14 Dec 2024
MAL: Cluster-Masked and Multi-Task Pretraining for Enhanced xLSTM Vision Performance
Wenjun Huang
Jianguo Hu
123
0
0
14 Dec 2024
PanSR: An Object-Centric Mask Transformer for Panoptic Segmentation
Lojze Žust
Matej Kristan
ViT
152
1
0
13 Dec 2024
Coherent 3D Scene Diffusion From a Single RGB Image
Manuel Dahnert
Angela Dai
Norman Muller
Matthias Nießner
127
0
0
13 Dec 2024
Continual Learning for Segment Anything Model Adaptation
Jinglong Yang
Yichen Wu
Jun Cen
Wenjian Huang
Hong Wang
Jianguo Zhang
TTA
CLL
120
0
0
09 Dec 2024
A Pipeline and NIR-Enhanced Dataset for Parking Lot Segmentation
Shirin Qiam
Saipraneeth Devunuri
Lewis J. Lehe
84
0
0
09 Dec 2024
Optimizing Dense Visual Predictions Through Multi-Task Coherence and Prioritization
Maxime Fontana
Michael W. Spratling
Miaojing Shi
MoE
VLM
153
0
0
04 Dec 2024
SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection
Joongwon Chae
Zhenyu Wang
Peiwu Qin
VLM
105
0
0
03 Dec 2024
VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models
Byung-Kwan Lee
Ryo Hachiuma
Yu-Chiang Frank Wang
Y. Ro
Yueh-Hua Wu
VLM
147
1
0
02 Dec 2024
Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis
Anton Voronov
Denis Kuznedelev
Mikhail Khoroshikh
Valentin Khrulkov
Dmitry Baranchuk
267
4
0
02 Dec 2024
Previous
1
2
3
4
5
6
...
27
28
29
Next