Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.01527
Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,365 papers shown
Title
Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations
Daan de Geus
Gijs Dubbelman
42
0
0
14 Jun 2024
Understanding Pedestrian Movement Using Urban Sensing Technologies: The Promise of Audio-based Sensors
Chaeyeon Han
Pavan Seshadri
Yiwei Ding
Noah Posner
B. Koo
Animesh Agrawal
Alexander Lerch
S. Guhathakurta
26
2
0
14 Jun 2024
ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers
Narges Norouzi
Svetlana Orlova
Daan de Geus
Gijs Dubbelman
ViT
FedML
48
4
0
14 Jun 2024
Open-Vocabulary Semantic Segmentation with Image Embedding Balancing
Xiangheng Shan
Dongyue Wu
Guilin Zhu
Yuanjie Shao
Nong Sang
Changxin Gao
VLM
29
16
0
14 Jun 2024
Depth Anything V2
Lihe Yang
Bingyi Kang
Zilong Huang
Zhen Zhao
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
DiffM
VLM
MDE
59
332
0
13 Jun 2024
4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities
Roman Bachmann
Oğuzhan Fatih Kar
David Mizrahi
Ali Garjani
Mingfei Gao
David Griffiths
Jiaming Hu
Afshin Dehghan
Amir Zamir
MoE
VLM
MLLM
41
14
0
13 Jun 2024
Scale-Invariant Monocular Depth Estimation via SSI Depth
S. M. H. Miangoleh
Mahesh Kumar Krishna Reddy
Yağız Aksoy
MDE
31
5
0
13 Jun 2024
RMem: Restricted Memory Banks Improve Video Object Segmentation
Junbao Zhou
Ziqi Pang
Yu-xiong Wang
VOS
63
7
0
12 Jun 2024
Dataset Enhancement with Instance-Level Augmentations
Orest Kupyn
Christian Rupprecht
48
9
0
12 Jun 2024
2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation
Zhensong Xu
Jiangtao Yao
Chengjing Wu
Ting Liu
Luoqi Liu
23
1
0
12 Jun 2024
ROADWork Dataset: Learning to Recognize, Observe, Analyze and Drive Through Work Zones
Anurag Ghosh
R. Tamburo
Shen Zheng
Juan R. Alvarez-Padilla
Hailiang Zhu
Michael Cardei
Nicholas Dunn
Christoph Mertz
Srinivasa G. Narasimhan
49
1
0
11 Jun 2024
CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor Segmentation
Zhongzhen Huang
Yankai Jiang
Rongzhao Zhang
Shaoting Zhang
Xiaofan Zhang
MedIm
70
4
0
11 Jun 2024
Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset
Shijie Lian
Ziyi Zhang
Hua Li
Wenjie Li
Laurence Tianruo Yang
Sam Kwong
Runmin Cong
VLM
31
12
0
10 Jun 2024
Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks
Zhiyuan Cheng
Cheng Han
James Liang
Qifan Wang
Xiangyu Zhang
Dongfang Liu
AAML
40
4
0
09 Jun 2024
F-LMM: Grounding Frozen Large Multimodal Models
Size Wu
Sheng Jin
Wenwei Zhang
Lumin Xu
Wentao Liu
Wei Li
Chen Change Loy
MLLM
80
12
0
09 Jun 2024
ProMotion: Prototypes As Motion Learners
Yawen Lu
Dongfang Liu
Qifan Wang
Cheng Han
Yiming Cui
Zhiwen Cao
Xueling Zhang
Yingjie Victor Chen
Heng Fan
DiffM
46
2
0
07 Jun 2024
Semantic Segmentation on VSPW Dataset through Masked Video Consistency
Chen Liang
Qiang Guo
Chongkai Yu
Chengjing Wu
Ting Liu
Luoqi Liu
50
1
0
07 Jun 2024
3rd Place Solution for MeViS Track in CVPR 2024 PVUW workshop: Motion Expression guided Video Segmentation
Feiyu Pan
Hao Fang
Xiankai Lu
34
3
0
07 Jun 2024
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model
Yang Sui
Yanyu Li
Anil Kag
Yerlan Idelbayev
Junli Cao
Ju Hu
Dhritiman Sagar
Bo Yuan
Sergey Tulyakov
Jian Ren
MQ
44
18
0
06 Jun 2024
Matching Anything by Segmenting Anything
Siyuan Li
Lei Ke
Martin Danelljan
Luigi Piccinelli
Mattia Segu
Luc Van Gool
Fisher Yu
VOS
40
22
0
06 Jun 2024
3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
Ruipu Wu
Jifei Che
Han Li
Chengjing Wu
Ting Liu
Luoqi Liu
36
0
0
06 Jun 2024
Frequency-based Matcher for Long-tailed Semantic Segmentation
Shan Li
Lu Yang
Pu Cao
Liulei Li
Huadong Ma
46
1
0
06 Jun 2024
Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy
Yunho Kim
Jeong Hyun Lee
Choongin Lee
Juhyeok Mun
D. Youm
Jeongsoo Park
Jemin Hwangbo
37
1
0
05 Jun 2024
P2PFormer: A Primitive-to-polygon Method for Regular Building Contour Extraction from Remote Sensing Images
Tao Zhang
Shiqing Wei
Yikang Zhou
M. Luo
Wenling You
Shunping Ji
19
1
0
05 Jun 2024
FastLGS: Speeding up Language Embedded Gaussians with Feature Grid Mapping
Yuzhou Ji
He Zhu
Junshu Tang
Wuyi Liu
Zhizhong Zhang
Yuan Xie
Xin Tan
39
8
0
04 Jun 2024
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Mohamed El Amine Boudjoghra
Angela Dai
Jean Lahoud
Hisham Cholakkal
Rao Muhammad Anwer
Salman Khan
Fahad Shahbaz Khan
VLM
ISeg
83
6
0
04 Jun 2024
Segmentation-Free Guidance for Text-to-Image Diffusion Models
K. Azarian
Debasmit Das
Qiqi Hou
Fatih Porikli
VLM
59
0
0
03 Jun 2024
EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding
Thanh-Dat Truong
Utsav Prabhu
Dongyi Wang
Bhiksha Raj
Susan Gauch
J. Subbiah
Khoa Luu
52
2
0
03 Jun 2024
MP-PolarMask: A Faster and Finer Instance Segmentation for Concave Images
Ke-Lei Wang
Pin-Hsuan Chou
Young-Ching Chou
Chia-Jen Liu
Cheng-Kuan Lin
Yu-Chee Tseng
31
0
0
03 Jun 2024
On the Nonlinearity of Layer Normalization
Yunhao Ni
Yuxin Guo
Junlong Jia
Lei Huang
47
4
0
03 Jun 2024
Object Aware Egocentric Online Action Detection
Joungbin An
Yunsu Park
Hyolim Kang
Seon Joo Kim
EgoV
37
0
0
03 Jun 2024
Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
Yunheng Li
Zhongyu Li
Quansheng Zeng
Qibin Hou
Ming-Ming Cheng
VLM
48
8
0
02 Jun 2024
Memory-guided Network with Uncertainty-based Feature Augmentation for Few-shot Semantic Segmentation
Xinyue Chen
Miaojing Shi
48
0
0
01 Jun 2024
2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
Biao Wu
Diankai Zhang
Sihan Gao
Cheng-yong Zheng
Shaoli Liu
Ning Wang
27
0
0
01 Jun 2024
Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Tiancheng Shen
Jun Hao Liew
Long Mai
Lu Qi
Jiashi Feng
Jiaya Jia
DiffM
30
1
0
31 May 2024
Extreme Point Supervised Instance Segmentation
Hyeonjun Lee
S. Hwang
Suha Kwak
21
2
0
31 May 2024
On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines
Selim Kuzucu
Kemal Oksuz
Jonathan Sadeghi
P. Dokania
44
4
0
30 May 2024
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
Chaoyang Wang
Xiangtai Li
Lu Qi
Henghui Ding
Yunhai Tong
Ming-Hsuan Yang
DiffM
81
6
0
30 May 2024
View-Consistent Hierarchical 3D Segmentation Using Ultrametric Feature Fields
Haodi He
Colton Stearns
Adam W. Harley
Leonidas J. Guibas
3DV
41
2
0
30 May 2024
A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation
Niclas Vodisch
Kürsat Petek
Markus Kappeler
Abhinav Valada
Wolfram Burgard
VLM
40
4
0
29 May 2024
BAISeg: Boundary Assisted Weakly Supervised Instance Segmentation
Tengbo Wang
Yu Bai
ISeg
47
0
0
27 May 2024
Memorize What Matters: Emergent Scene Decomposition from Multitraverse
Yiming Li
Zehong Wang
Yue Wang
Zhiding Yu
Zan Gojcic
Marco Pavone
Chen Feng
Jose M. Alvarez
3DGS
57
1
0
27 May 2024
HDC: Hierarchical Semantic Decoding with Counting Assistance for Generalized Referring Expression Segmentation
Zhuoyan Luo
Yinghao Wu
Yong-Jin Liu
Yicheng Xiao
Xiao-Ping Zhang
Yujiu Yang
38
0
0
24 May 2024
U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation
Bingyu Li
Da Zhang
Zhiyuan Zhao
Junyu Gao
Xuelong Li
30
5
0
24 May 2024
Synergistic Global-space Camera and Human Reconstruction from Videos
Yizhou Zhao
Tuanfeng Y. Wang
Bhiksha Raj
Min Xu
Jimei Yang
Chun-Hao Paul Huang
3DGS
3DH
38
1
0
23 May 2024
Efficient Robot Learning for Perception and Mapping
Niclas Vodisch
SSL
40
0
0
23 May 2024
RoGS: Large Scale Road Surface Reconstruction based on 2D Gaussian Splatting
Zhiheng Feng
Wenhua Wu
Hesheng Wang
3DGS
42
0
0
23 May 2024
Tuning-free Universally-Supervised Semantic Segmentation
Xiaobo Yang
Xiaojin Gong
VLM
55
1
0
23 May 2024
Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models
Katherine Xu
Lingzhi Zhang
Jianbo Shi
58
12
0
23 May 2024
EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views
Yuhang Yang
Wei Zhai
Chengfeng Wang
Chengjun Yu
Yang Cao
Zheng-jun Zha
44
5
0
22 May 2024
Previous
1
2
3
...
9
10
11
...
26
27
28
Next