Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.00759
Cited By
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers
1 December 2020
Huiyu Wang
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers"
50 / 329 papers shown
Title
Generalizable Entity Grounding via Assistance of Large Language Model
Lu Qi
Yi-Wen Chen
Lehan Yang
Tiancheng Shen
Xiangtai Li
Weidong Guo
Yu-Syuan Xu
Ming-Hsuan Yang
VLM
69
9
0
04 Feb 2024
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
Maoyuan Ye
Jing Zhang
Juhua Liu
Chenyu Liu
Baocai Yin
Cong Liu
Bo Du
Dacheng Tao
VLM
37
11
0
31 Jan 2024
OMG-Seg: Is One Model Good Enough For All Segmentation?
Xiangtai Li
Haobo Yuan
Wei Li
Henghui Ding
Size Wu
Wenwei Zhang
Yining Li
Kai Chen
Chen Change Loy
VLM
MLLM
ViT
80
53
0
18 Jan 2024
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting
Wouter Van Gansbeke
Bert De Brabandere
DiffM
46
11
0
18 Jan 2024
Graph Transformer GANs with Graph Masked Modeling for Architectural Layout Generation
Hao Tang
Ling Shao
N. Sebe
Luc Van Gool
38
5
0
15 Jan 2024
Low-Resource Vision Challenges for Foundation Models
Yunhua Zhang
Hazel Doughty
Cees G. M. Snoek
VLM
32
5
0
09 Jan 2024
SPFormer: Enhancing Vision Transformer with Superpixel Representation
Jieru Mei
Liang-Chieh Chen
Alan Yuille
Cihang Xie
ViT
MDE
21
4
0
05 Jan 2024
Unsupervised Universal Image Segmentation
Dantong Niu
Xudong Wang
Xinyang Han
Long Lian
Roei Herzig
Trevor Darrell
VLM
40
17
0
28 Dec 2023
CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
Shuyang Sun
Runjia Li
Philip Torr
Xiuye Gu
Siyang Li
VLM
CLIP
39
32
0
12 Dec 2023
MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Abdullah Rashwan
Jiageng Zhang
A. Taalimi
Fan Yang
Xingyi Zhou
Chaochao Yan
Liang-Chieh Chen
Yeqing Li
ViT
31
5
0
11 Dec 2023
SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
Feng Wang
Jieru Mei
Alan Yuille
VLM
35
55
0
04 Dec 2023
A Simple Video Segmenter by Tracking Objects Along Axial Trajectories
Ju He
Qihang Yu
Inkyu Shin
XueQing Deng
Alan Yuille
Xiaohui Shen
Liang-Chieh Chen
VOS
42
2
0
30 Nov 2023
Semi-supervised Medical Image Segmentation via Query Distribution Consistency
Rong Wu
Dehua Li
Cong Zhang
33
1
0
21 Nov 2023
Towards Open-Ended Visual Recognition with Large Language Model
Qihang Yu
Xiaohui Shen
Liang-Chieh Chen
VLM
22
8
0
14 Nov 2023
Explainability of Vision Transformers: A Comprehensive Review and New Perspectives
Rojina Kashefi
Leili Barekatain
Mohammad Sabokrou
Fatemeh Aghaeipoor
ViT
45
9
0
12 Nov 2023
PolyMaX: General Dense Prediction with Mask Transformer
Xuan S. Yang
Liangzhe Yuan
Kimberly Wilber
Astuti Sharma
Xiuye Gu
...
Stephanie Debats
Huisheng Wang
Hartwig Adam
Mikhail Sirotenko
Liang-Chieh Chen
36
14
0
09 Nov 2023
Improving Robustness for Vision Transformer with a Simple Dynamic Scanning Augmentation
Shashank Kotyan
Danilo Vasconcellos Vargas
ViT
34
2
0
01 Nov 2023
Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis
Shangbang Long
Siyang Qin
Yasuhisa Fujii
Alessandro Bissacco
Michalis Raptis
31
5
0
25 Oct 2023
OV-VG: A Benchmark for Open-Vocabulary Visual Grounding
Chunlei Wang
Wenquan Feng
Xiangtai Li
Guangliang Cheng
Shuchang Lyu
Binghao Liu
Lijiang Chen
Qi Zhao
ObjD
VLM
28
10
0
22 Oct 2023
VST++: Efficient and Stronger Visual Saliency Transformer
Nian Liu
Ziyang Luo
Ni Zhang
Junwei Han
ViT
31
15
0
18 Oct 2023
USDC: Unified Static and Dynamic Compression for Visual Transformer
Huan Yuan
Chao Liao
Jianchao Tan
Peng Yao
Jiyuan Jia
Bin Chen
Chengru Song
Di Zhang
ViT
25
0
0
17 Oct 2023
Rank-DETR for High Quality Object Detection
Yifan Pu
Weicong Liang
Yiduo Hao
Yuhui Yuan
Yukang Yang
Chao Zhang
Hanhua Hu
Gao Huang
46
57
0
13 Oct 2023
3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers
Jieneng Chen
Jieru Mei
Xianhang Li
Yongyi Lu
Qihang Yu
...
M. Lungren
Lei Xing
Le Lu
Alan Yuille
Yuyin Zhou
MedIm
ViT
41
36
0
11 Oct 2023
Completing Visual Objects via Bridging Generation and Segmentation
Xiang Li
Yinpeng Chen
Chung-Ching Lin
Hao Chen
Kai Hu
Rita Singh
Bhiksha Raj
Lijuan Wang
Zicheng Liu
DiffM
29
4
0
01 Oct 2023
Superpixel Transformers for Efficient Semantic Segmentation
Xiao Han
Jieru Mei
Lu Zhang
Hang Yan
Yongkai Wu
Liang-Chieh Chen
Henrik Kretzschmar
ViT
36
10
0
28 Sep 2023
Regress Before Construct: Regress Autoencoder for Point Cloud Self-supervised Learning
Yang Liu
Chong Chen
Can Wang
Xulin King
Mengyuan Liu
3DPC
45
7
0
25 Sep 2023
ClusterFormer: Clustering As A Universal Visual Learner
James Liang
Yiming Cui
Qifan Wang
Tong Geng
Wenguan Wang
Dongfang Liu
VLM
39
9
0
22 Sep 2023
TCOVIS: Temporally Consistent Online Video Instance Segmentation
Junlong Li
Ting Yu
Yongming Rao
Jie Zhou
Jiwen Lu
41
12
0
21 Sep 2023
OccupancyDETR: Using DETR for Mixed Dense-sparse 3D Occupancy Prediction
Yupeng Jia
Jie He
Runze Chen
Fang Zhao
Haiyong Luo
3DPC
26
1
0
15 Sep 2023
Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation
Zhaochong An
Guolei Sun
Zongwei Wu
Hao Tang
Luc Van Gool
VOS
32
4
0
14 Sep 2023
Deep Video Restoration for Under-Display Camera
Xuanxi Chen
Tao Wang
Ziqian Shao
Kaihao Zhang
Wenhan Luo
Tong Lu
Zikun Liu
Tae-Kyun Kim
Hongdong Li
40
1
0
09 Sep 2023
Character Queries: A Transformer-based Approach to On-Line Handwritten Character Segmentation
Michael Jungo
Beat Wolf
Andrii Maksai
C. Musat
Andreas Fischer
47
2
0
06 Sep 2023
Self-supervised Scene Text Segmentation with Object-centric Layered Representations Augmented by Text Regions
Yibo Wang
Yunhu Ye
Yuanpeng Mao
Yanwei Yu
Yuanping Song
41
2
0
25 Aug 2023
EFormer: Enhanced Transformer towards Semantic-Contour Features of Foreground for Portraits Matting
Zitao Wang
Qiguang Miao
Peipei Zhao
Yue Xi
ViT
30
2
0
24 Aug 2023
Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion
Junjiao Tian
Lavisha Aggarwal
Andrea Colaco
Z. Kira
Mar González-Franco
DiffM
33
77
0
23 Aug 2023
SPANet: Frequency-balancing Token Mixer using Spectral Pooling Aggregation Modulation
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Dong Hwan Kim
MoE
33
8
0
22 Aug 2023
How Much Temporal Long-Term Context is Needed for Action Segmentation?
Emad Bahrami Rad
Gianpiero Francesca
Juergen Gall
ViT
27
26
0
22 Aug 2023
Vision Transformer Pruning Via Matrix Decomposition
Tianyi Sun
30
0
0
21 Aug 2023
LaRS: A Diverse Panoptic Maritime Obstacle Detection Dataset and Benchmark
Lojze Žust
J. Pers
Matej Kristan
VOS
26
15
0
18 Aug 2023
Agglomerative Transformer for Human-Object Interaction Detection
Danyang Tu
Wei Sun
Guangtao Zhai
Wei Shen
ViT
32
5
0
16 Aug 2023
RestoreFormer++: Towards Real-World Blind Face Restoration from Undegraded Key-Value Pairs
Zhouxia Wang
Jiawei Zhang
Tianshui Chen
Wenping Wang
Ping Luo
41
16
0
14 Aug 2023
ACTIVE: Towards Highly Transferable 3D Physical Camouflage for Universal and Robust Vehicle Evasion
Naufal Suryanto
Yongsu Kim
Harashta Tatimma Larasati
Hyoeun Kang
Thi-Thu-Huong Le
Yoonyoung Hong
Hunmin Yang
Se-Yoon Oh
Howon Kim
AAML
38
23
0
14 Aug 2023
Revisiting Vision Transformer from the View of Path Ensemble
Shuning Chang
Pichao Wang
Haowen Luo
Fan Wang
Mike Zheng Shou
ViT
40
3
0
12 Aug 2023
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
Qihang Yu
Ju He
XueQing Deng
Xiaohui Shen
Liang-Chieh Chen
VLM
CLIP
45
136
0
04 Aug 2023
Point2Mask: Point-supervised Panoptic Segmentation via Optimal Transport
Wentong Li
Yu-Jie Yuan
Song Wang
Jianke Zhu
Jianshu Li
Jian Liu
Lei Zhang
3DPC
OT
32
19
0
03 Aug 2023
E^2VPT: An Effective and Efficient Approach for Visual Prompt Tuning
Cheng Han
Qifan Wang
Yiming Cui
Zhiwen Cao
Wenguan Wang
Siyuan Qi
Dongfang Liu
VPVLM
VLM
27
48
0
25 Jul 2023
Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation
Yiming Cui
L. Yang
Hai-ping Yu
37
8
0
23 Jul 2023
Pair then Relation: Pair-Net for Panoptic Scene Graph Generation
Jinghao Wang
Zhengyu Wen
Xiangtai Li
Zujin Guo
Jingkang Yang
Ziwei Liu
48
17
0
17 Jul 2023
Liver Tumor Screening and Diagnosis in CT with Pixel-Lesion-Patient Network
K. Yan
Xiaoli Yin
Yingda Xia
Fakai Wang
Shu Wang
...
Xiaoyu Bai
Jingren Zhou
Ling Zhang
Le Lu
Yu Shi
MedIm
40
5
0
17 Jul 2023
Unified Open-Vocabulary Dense Visual Prediction
Hengcan Shi
Munawar Hayat
Jianfei Cai
ObjD
VLM
43
19
0
17 Jul 2023
Previous
1
2
3
4
5
6
7
Next