Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.00759
Cited By
v1
v2
v3 (latest)
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers
1 December 2020
Huiyu Wang
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (1023★)
Papers citing
"MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers"
50 / 323 papers shown
Title
ClusterFuG: Clustering Fully connected Graphs by Multicut
Ahmed Abbas
Paul Swoboda
87
3
0
28 Jan 2023
Skeleton-based Action Recognition through Contrasting Two-Stream Spatial-Temporal Networks
Chen Pang
Xuequan Lu
Lei Lyu
95
22
0
27 Jan 2023
Exploiting Optical Flow Guidance for Transformer-Based Video Inpainting
Kaiwen Zhang
Jialun Peng
Jingjing Fu
Dong Liu
ViT
91
9
0
24 Jan 2023
Learning Open-vocabulary Semantic Segmentation Models From Natural Language Supervision
Jilan Xu
Junlin Hou
Yuejie Zhang
Rui Feng
Yi Wang
Yu Qiao
Weidi Xie
VLM
86
87
0
22 Jan 2023
MultiNet with Transformers: A Model for Cancer Diagnosis Using Images
H. Barzekar
Yash J. Patel
L. Tong
Zeyun Yu
MedIm
98
6
0
21 Jan 2023
Class Enhancement Losses with Pseudo Labels for Zero-shot Semantic Segmentation
S. D. Dao
Hengcan Shi
Dinh Q. Phung
Jianfei Cai
VLM
64
0
0
18 Jan 2023
Linguistic Query-Guided Mask Generation for Referring Image Segmentation
Zhichao Wei
Xiaohao Chen
Mingqiang Chen
Siyu Zhu
VLM
120
1
0
16 Jan 2023
Vision Transformers Are Good Mask Auto-Labelers
Shiyi Lan
Xitong Yang
Zhiding Yu
Zuxuan Wu
J. Álvarez
Anima Anandkumar
ISeg
ViT
MedIm
97
19
0
10 Jan 2023
Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network
Haowei Wang
Jiayi Ji
Yiyi Zhou
Yongjian Wu
Xiaoshuai Sun
84
15
0
09 Jan 2023
InsPro: Propagating Instance Query and Proposal for Online Video Instance Segmentation
Fei He
Haoyang Zhang
Naiyu Gao
Jian Jia
Yanhu Shan
Xin Zhao
Kaiqi Huang
ISeg
136
15
0
05 Jan 2023
Semi-MAE: Masked Autoencoders for Semi-supervised Vision Transformers
Haojie Yu
Kangnian Zhao
Xiaoming Xu
ViT
83
1
0
04 Jan 2023
Ego-Only: Egocentric Action Detection without Exocentric Transferring
Huiyu Wang
Mitesh Singh
Lorenzo Torresani
EgoV
128
26
0
03 Jan 2023
PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation
Xiangtai Li
Shilin Xu
Yibo Yang
Haobo Yuan
Guangliang Cheng
Yu Tong
Zhouchen Lin
Ming-Hsuan Yang
Dacheng Tao
ViT
160
21
0
03 Jan 2023
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
Jianzong Wu
Xiangtai Li
Henghui Ding
Xia Li
Guangliang Cheng
Yu Tong
Chen Change Loy
VLM
180
31
0
02 Jan 2023
PanDepth: Joint Panoptic Segmentation and Depth Completion
J. Lagos
Esa Rahtu
3DPC
VLM
87
1
0
29 Dec 2022
Generalized Decoding for Pixel, Image, and Language
Xueyan Zou
Zi-Yi Dou
Jianwei Yang
Zhe Gan
Linjie Li
...
Lu Yuan
Nanyun Peng
Lijuan Wang
Yong Jae Lee
Jianfeng Gao
VLM
MLLM
ObjD
134
259
0
21 Dec 2022
LUMix: Improving Mixup by Better Modelling Label Uncertainty
Shuyang Sun
Jieneng Chen
Ruifei He
Alan Yuille
Philip Torr
Song Bai
UQCV
NoLa
74
5
0
29 Nov 2022
RbA: Segmenting Unknown Regions Rejected by All
Nazir Nayal
Mısra Yavuz
João F. Henriques
Fatma Guney
UQCV
102
47
0
25 Nov 2022
CoMFormer: Continual Learning in Semantic and Panoptic Segmentation
Fabio Cermelli
Matthieu Cord
Arthur Douillard
CLL
VLM
97
22
0
25 Nov 2022
Mutual Guidance and Residual Integration for Image Enhancement
Kun Zhou
Kenkun Liu
Wenbo Li
Xiaoguang Han
Jiangbo Lu
72
1
0
25 Nov 2022
High-Quality Entity Segmentation
Lu Qi
Jason Kuen
Weidong Guo
Tiancheng Shen
Jiuxiang Gu
Jiaya Jia
Zhe Lin
Ming-Hsuan Yang
ISeg
112
55
0
10 Nov 2022
OneFormer: One Transformer to Rule Universal Image Segmentation
Jitesh Jain
Jiacheng Li
M. Chiu
Ali Hassani
Nikita Orlov
Humphrey Shi
ViT
81
349
0
10 Nov 2022
Fine-grained Semantic Alignment Network for Weakly Supervised Temporal Language Grounding
Yuechen Wang
Wen-gang Zhou
Houqiang Li
AI4TS
63
13
0
21 Oct 2022
Instance Segmentation with Cross-Modal Consistency
A. Z. Zhu
Vincent Casser
R. Mahjourian
Henrik Kretzschmar
Soren Pirk
ISeg
90
1
0
14 Oct 2022
Intermediate Prototype Mining Transformer for Few-Shot Semantic Segmentation
Yuanwei Liu
Nian Liu
Xiwen Yao
Junwei Han
65
63
0
13 Oct 2022
A Generalist Framework for Panoptic Segmentation of Images and Videos
Ting-Li Chen
Lala Li
Saurabh Saxena
Geoffrey E. Hinton
David J. Fleet
VGen
MLLM
124
104
0
12 Oct 2022
Fine-Grained Image Style Transfer with Visual Transformers
Jianbo Wang
Huan Yang
Jianlong Fu
T. Yamasaki
B. Guo
ViT
115
14
0
11 Oct 2022
MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models
Chenglin Yang
Siyuan Qiao
Qihang Yu
Xiaoding Yuan
Yukun Zhu
Alan Yuille
Hartwig Adam
Liang-Chieh Chen
ViT
MoE
126
66
0
04 Oct 2022
Enhancing Fine-Grained 3D Object Recognition using Hybrid Multi-Modal Vision Transformer-CNN Models
Songsong Xiong
Georgios Tziafas
Hamidreza Kasaei
ViT
57
3
0
03 Oct 2022
A Review of Modern Approaches for Coronary Angiography Imaging Analysis
Maxim Y Popov
Temirgali Aimyshev
Eldar Ismailov
Ablay Bulegenov
S. Fazli
37
3
0
28 Sep 2022
PointScatter: Point Set Representation for Tubular Structure Extraction
Dong Wang
Zhao Zhang
Zi-Long Zhao
Yuhang Liu
Yihong Chen
Liwei Wang
3DPC
84
12
0
13 Sep 2022
CenterFormer: Center-based Transformer for 3D Object Detection
Zixiang Zhou
Xian Zhao
Yu Wang
Panqu Wang
H. Foroosh
3DPC
ViT
114
142
0
12 Sep 2022
Segmenting Known Objects and Unseen Unknowns without Prior Knowledge
Stefano Gasperini
Alvaro Marcos-Ramiro
Michael Schmidt
Nassir Navab
Benjamin Busam
F. Tombari
102
8
0
12 Sep 2022
SUNet: Scale-aware Unified Network for Panoptic Segmentation
Wei Yan
Yeqiang Qian
Chunxiang Wang
Ming Yang
SSeg
88
0
0
07 Sep 2022
Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation
Nadine Behrmann
S. Golestaneh
Zico Kolter
Juergen Gall
M. Noroozi
108
75
0
01 Sep 2022
VMFormer: End-to-End Video Matting with Transformer
Jiacheng Li
Vidit Goel
Marianna Ohanyan
Shant Navasardyan
Yunchao Wei
Humphrey Shi
ViT
87
19
0
26 Aug 2022
Multiple Instance Neuroimage Transformer
Ayush Singla
Qingyu Zhao
Daniel K. Do
Yuyin Zhou
K. Pohl
Ehsan Adeli
ViT
MedIm
69
11
0
19 Aug 2022
SO(3)-Pose: SO(3)-Equivariance Learning for 6D Object Pose Estimation
Haoran Pan
Jun Zhou
Yuanpeng Liu
Xuequan Lu
Weiming Wang
Xu Yan
Mingqiang Wei
69
5
0
17 Aug 2022
Flow-Guided Transformer for Video Inpainting
Kaiwen Zhang
Jingjing Fu
Dong Liu
ViT
84
72
0
14 Aug 2022
PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding
Zihan Ding
Zixiang Ding
Tianrui Hui
Junshi Huang
Xiaoming Wei
Xiaolin K. Wei
Si Liu
96
14
0
11 Aug 2022
Multi-scale Feature Aggregation for Crowd Counting
Xiaoheng Jiang
Xinyi Wu
Hisham Cholakkal
Rao Muhammad Anwer
Jiale Xu
Bing Zhou
Yanwei Pang
Fahad Shahbaz Khan
63
1
0
10 Aug 2022
In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation
Bolin Lai
Miao Liu
Fiona Ryan
James M. Rehg
ViT
99
37
0
08 Aug 2022
MonoViT: Self-Supervised Monocular Depth Estimation with a Vision Transformer
Chaoqiang Zhao
Youming Zhang
Matteo Poggi
Fabio Tosi
Xianda Guo
Zheng Zhu
Guan Huang
Yang Tang
S. Mattoccia
ViT
MDE
95
187
0
06 Aug 2022
Convolutional Embedding Makes Hierarchical Vision Transformer Stronger
Cong Wang
Hongmin Xu
Xiong Zhang
Li Wang
Zhitong Zheng
Haifeng Liu
ViT
61
23
0
27 Jul 2022
DETRs with Hybrid Matching
Ding Jia
Yuhui Yuan
Hao He
Xiao-pei Wu
Haojun Yu
Weihong Lin
Lei-huan Sun
Chao Zhang
Hanhua Hu
85
200
0
26 Jul 2022
Transformer with Implicit Edges for Particle-based Physics Simulation
Yidi Shao
Chen Change Loy
Bo Dai
106
17
0
22 Jul 2022
MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis
Yaqian Liang
Shanshan Zhao
Baosheng Yu
Jing Zhang
Fazhi He
ViT
97
39
0
20 Jul 2022
Weakly Supervised Video Salient Object Detection via Point Supervision
Shuyong Gao
Hao Xing
Wei Zhang
Yan Wang
Qianyu Guo
Wenqiang Zhang
69
26
0
15 Jul 2022
Online Video Instance Segmentation via Robust Context Fusion
Xiang Li
Jinglu Wang
Xiaohao Xu
Bhiksha Raj
Yan Lu
74
5
0
12 Jul 2022
Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments
Daniel Seichter
Söhnke Benedikt Fischedick
Mona Köhler
H. Groß
89
40
0
10 Jul 2022
Previous
1
2
3
4
5
6
7
Next