ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.00759
  4. Cited By
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers
v1v2v3 (latest)

MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers

1 December 2020
Huiyu Wang
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
    ViT
ArXiv (abs)PDFHTMLGithub (1023★)

Papers citing "MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers"

50 / 323 papers shown
Title
ClusterFuG: Clustering Fully connected Graphs by Multicut
ClusterFuG: Clustering Fully connected Graphs by Multicut
Ahmed Abbas
Paul Swoboda
87
3
0
28 Jan 2023
Skeleton-based Action Recognition through Contrasting Two-Stream
  Spatial-Temporal Networks
Skeleton-based Action Recognition through Contrasting Two-Stream Spatial-Temporal Networks
Chen Pang
Xuequan Lu
Lei Lyu
95
22
0
27 Jan 2023
Exploiting Optical Flow Guidance for Transformer-Based Video Inpainting
Exploiting Optical Flow Guidance for Transformer-Based Video Inpainting
Kaiwen Zhang
Jialun Peng
Jingjing Fu
Dong Liu
ViT
91
9
0
24 Jan 2023
Learning Open-vocabulary Semantic Segmentation Models From Natural
  Language Supervision
Learning Open-vocabulary Semantic Segmentation Models From Natural Language Supervision
Jilan Xu
Junlin Hou
Yuejie Zhang
Rui Feng
Yi Wang
Yu Qiao
Weidi Xie
VLM
86
87
0
22 Jan 2023
MultiNet with Transformers: A Model for Cancer Diagnosis Using Images
MultiNet with Transformers: A Model for Cancer Diagnosis Using Images
H. Barzekar
Yash J. Patel
L. Tong
Zeyun Yu
MedIm
98
6
0
21 Jan 2023
Class Enhancement Losses with Pseudo Labels for Zero-shot Semantic
  Segmentation
Class Enhancement Losses with Pseudo Labels for Zero-shot Semantic Segmentation
S. D. Dao
Hengcan Shi
Dinh Q. Phung
Jianfei Cai
VLM
64
0
0
18 Jan 2023
Linguistic Query-Guided Mask Generation for Referring Image Segmentation
Linguistic Query-Guided Mask Generation for Referring Image Segmentation
Zhichao Wei
Xiaohao Chen
Mingqiang Chen
Siyu Zhu
VLM
120
1
0
16 Jan 2023
Vision Transformers Are Good Mask Auto-Labelers
Vision Transformers Are Good Mask Auto-Labelers
Shiyi Lan
Xitong Yang
Zhiding Yu
Zuxuan Wu
J. Álvarez
Anima Anandkumar
ISegViTMedIm
97
19
0
10 Jan 2023
Towards Real-Time Panoptic Narrative Grounding by an End-to-End
  Grounding Network
Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network
Haowei Wang
Jiayi Ji
Yiyi Zhou
Yongjian Wu
Xiaoshuai Sun
84
15
0
09 Jan 2023
InsPro: Propagating Instance Query and Proposal for Online Video
  Instance Segmentation
InsPro: Propagating Instance Query and Proposal for Online Video Instance Segmentation
Fei He
Haoyang Zhang
Naiyu Gao
Jian Jia
Yanhu Shan
Xin Zhao
Kaiqi Huang
ISeg
136
15
0
05 Jan 2023
Semi-MAE: Masked Autoencoders for Semi-supervised Vision Transformers
Semi-MAE: Masked Autoencoders for Semi-supervised Vision Transformers
Haojie Yu
Kangnian Zhao
Xiaoming Xu
ViT
83
1
0
04 Jan 2023
Ego-Only: Egocentric Action Detection without Exocentric Transferring
Ego-Only: Egocentric Action Detection without Exocentric Transferring
Huiyu Wang
Mitesh Singh
Lorenzo Torresani
EgoV
128
26
0
03 Jan 2023
PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part
  Segmentation
PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation
Xiangtai Li
Shilin Xu
Yibo Yang
Haobo Yuan
Guangliang Cheng
Yu Tong
Zhouchen Lin
Ming-Hsuan Yang
Dacheng Tao
ViT
160
21
0
03 Jan 2023
Betrayed by Captions: Joint Caption Grounding and Generation for Open
  Vocabulary Instance Segmentation
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
Jianzong Wu
Xiangtai Li
Henghui Ding
Xia Li
Guangliang Cheng
Yu Tong
Chen Change Loy
VLM
180
31
0
02 Jan 2023
PanDepth: Joint Panoptic Segmentation and Depth Completion
PanDepth: Joint Panoptic Segmentation and Depth Completion
J. Lagos
Esa Rahtu
3DPCVLM
87
1
0
29 Dec 2022
Generalized Decoding for Pixel, Image, and Language
Generalized Decoding for Pixel, Image, and Language
Xueyan Zou
Zi-Yi Dou
Jianwei Yang
Zhe Gan
Linjie Li
...
Lu Yuan
Nanyun Peng
Lijuan Wang
Yong Jae Lee
Jianfeng Gao
VLMMLLMObjD
134
259
0
21 Dec 2022
LUMix: Improving Mixup by Better Modelling Label Uncertainty
LUMix: Improving Mixup by Better Modelling Label Uncertainty
Shuyang Sun
Jieneng Chen
Ruifei He
Alan Yuille
Philip Torr
Song Bai
UQCVNoLa
74
5
0
29 Nov 2022
RbA: Segmenting Unknown Regions Rejected by All
RbA: Segmenting Unknown Regions Rejected by All
Nazir Nayal
Mısra Yavuz
João F. Henriques
Fatma Guney
UQCV
102
47
0
25 Nov 2022
CoMFormer: Continual Learning in Semantic and Panoptic Segmentation
CoMFormer: Continual Learning in Semantic and Panoptic Segmentation
Fabio Cermelli
Matthieu Cord
Arthur Douillard
CLLVLM
97
22
0
25 Nov 2022
Mutual Guidance and Residual Integration for Image Enhancement
Mutual Guidance and Residual Integration for Image Enhancement
Kun Zhou
Kenkun Liu
Wenbo Li
Xiaoguang Han
Jiangbo Lu
72
1
0
25 Nov 2022
High-Quality Entity Segmentation
High-Quality Entity Segmentation
Lu Qi
Jason Kuen
Weidong Guo
Tiancheng Shen
Jiuxiang Gu
Jiaya Jia
Zhe Lin
Ming-Hsuan Yang
ISeg
112
55
0
10 Nov 2022
OneFormer: One Transformer to Rule Universal Image Segmentation
OneFormer: One Transformer to Rule Universal Image Segmentation
Jitesh Jain
Jiacheng Li
M. Chiu
Ali Hassani
Nikita Orlov
Humphrey Shi
ViT
81
349
0
10 Nov 2022
Fine-grained Semantic Alignment Network for Weakly Supervised Temporal
  Language Grounding
Fine-grained Semantic Alignment Network for Weakly Supervised Temporal Language Grounding
Yuechen Wang
Wen-gang Zhou
Houqiang Li
AI4TS
63
13
0
21 Oct 2022
Instance Segmentation with Cross-Modal Consistency
Instance Segmentation with Cross-Modal Consistency
A. Z. Zhu
Vincent Casser
R. Mahjourian
Henrik Kretzschmar
Soren Pirk
ISeg
90
1
0
14 Oct 2022
Intermediate Prototype Mining Transformer for Few-Shot Semantic
  Segmentation
Intermediate Prototype Mining Transformer for Few-Shot Semantic Segmentation
Yuanwei Liu
Nian Liu
Xiwen Yao
Junwei Han
65
63
0
13 Oct 2022
A Generalist Framework for Panoptic Segmentation of Images and Videos
A Generalist Framework for Panoptic Segmentation of Images and Videos
Ting-Li Chen
Lala Li
Saurabh Saxena
Geoffrey E. Hinton
David J. Fleet
VGenMLLM
124
104
0
12 Oct 2022
Fine-Grained Image Style Transfer with Visual Transformers
Fine-Grained Image Style Transfer with Visual Transformers
Jianbo Wang
Huan Yang
Jianlong Fu
T. Yamasaki
B. Guo
ViT
115
14
0
11 Oct 2022
MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision
  Models
MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models
Chenglin Yang
Siyuan Qiao
Qihang Yu
Xiaoding Yuan
Yukun Zhu
Alan Yuille
Hartwig Adam
Liang-Chieh Chen
ViTMoE
126
66
0
04 Oct 2022
Enhancing Fine-Grained 3D Object Recognition using Hybrid Multi-Modal
  Vision Transformer-CNN Models
Enhancing Fine-Grained 3D Object Recognition using Hybrid Multi-Modal Vision Transformer-CNN Models
Songsong Xiong
Georgios Tziafas
Hamidreza Kasaei
ViT
57
3
0
03 Oct 2022
A Review of Modern Approaches for Coronary Angiography Imaging Analysis
A Review of Modern Approaches for Coronary Angiography Imaging Analysis
Maxim Y Popov
Temirgali Aimyshev
Eldar Ismailov
Ablay Bulegenov
S. Fazli
37
3
0
28 Sep 2022
PointScatter: Point Set Representation for Tubular Structure Extraction
PointScatter: Point Set Representation for Tubular Structure Extraction
Dong Wang
Zhao Zhang
Zi-Long Zhao
Yuhang Liu
Yihong Chen
Liwei Wang
3DPC
84
12
0
13 Sep 2022
CenterFormer: Center-based Transformer for 3D Object Detection
CenterFormer: Center-based Transformer for 3D Object Detection
Zixiang Zhou
Xian Zhao
Yu Wang
Panqu Wang
H. Foroosh
3DPCViT
114
142
0
12 Sep 2022
Segmenting Known Objects and Unseen Unknowns without Prior Knowledge
Segmenting Known Objects and Unseen Unknowns without Prior Knowledge
Stefano Gasperini
Alvaro Marcos-Ramiro
Michael Schmidt
Nassir Navab
Benjamin Busam
F. Tombari
102
8
0
12 Sep 2022
SUNet: Scale-aware Unified Network for Panoptic Segmentation
SUNet: Scale-aware Unified Network for Panoptic Segmentation
Wei Yan
Yeqiang Qian
Chunxiang Wang
Ming Yang
SSeg
88
0
0
07 Sep 2022
Unified Fully and Timestamp Supervised Temporal Action Segmentation via
  Sequence to Sequence Translation
Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation
Nadine Behrmann
S. Golestaneh
Zico Kolter
Juergen Gall
M. Noroozi
108
75
0
01 Sep 2022
VMFormer: End-to-End Video Matting with Transformer
VMFormer: End-to-End Video Matting with Transformer
Jiacheng Li
Vidit Goel
Marianna Ohanyan
Shant Navasardyan
Yunchao Wei
Humphrey Shi
ViT
87
19
0
26 Aug 2022
Multiple Instance Neuroimage Transformer
Multiple Instance Neuroimage Transformer
Ayush Singla
Qingyu Zhao
Daniel K. Do
Yuyin Zhou
K. Pohl
Ehsan Adeli
ViTMedIm
69
11
0
19 Aug 2022
SO(3)-Pose: SO(3)-Equivariance Learning for 6D Object Pose Estimation
SO(3)-Pose: SO(3)-Equivariance Learning for 6D Object Pose Estimation
Haoran Pan
Jun Zhou
Yuanpeng Liu
Xuequan Lu
Weiming Wang
Xu Yan
Mingqiang Wei
69
5
0
17 Aug 2022
Flow-Guided Transformer for Video Inpainting
Flow-Guided Transformer for Video Inpainting
Kaiwen Zhang
Jingjing Fu
Dong Liu
ViT
84
72
0
14 Aug 2022
PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative
  Grounding
PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding
Zihan Ding
Zixiang Ding
Tianrui Hui
Junshi Huang
Xiaoming Wei
Xiaolin K. Wei
Si Liu
96
14
0
11 Aug 2022
Multi-scale Feature Aggregation for Crowd Counting
Multi-scale Feature Aggregation for Crowd Counting
Xiaoheng Jiang
Xinyi Wu
Hisham Cholakkal
Rao Muhammad Anwer
Jiale Xu
Bing Zhou
Yanwei Pang
Fahad Shahbaz Khan
63
1
0
10 Aug 2022
In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze
  Estimation
In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation
Bolin Lai
Miao Liu
Fiona Ryan
James M. Rehg
ViT
99
37
0
08 Aug 2022
MonoViT: Self-Supervised Monocular Depth Estimation with a Vision
  Transformer
MonoViT: Self-Supervised Monocular Depth Estimation with a Vision Transformer
Chaoqiang Zhao
Youming Zhang
Matteo Poggi
Fabio Tosi
Xianda Guo
Zheng Zhu
Guan Huang
Yang Tang
S. Mattoccia
ViTMDE
95
187
0
06 Aug 2022
Convolutional Embedding Makes Hierarchical Vision Transformer Stronger
Convolutional Embedding Makes Hierarchical Vision Transformer Stronger
Cong Wang
Hongmin Xu
Xiong Zhang
Li Wang
Zhitong Zheng
Haifeng Liu
ViT
61
23
0
27 Jul 2022
DETRs with Hybrid Matching
DETRs with Hybrid Matching
Ding Jia
Yuhui Yuan
Hao He
Xiao-pei Wu
Haojun Yu
Weihong Lin
Lei-huan Sun
Chao Zhang
Hanhua Hu
85
200
0
26 Jul 2022
Transformer with Implicit Edges for Particle-based Physics Simulation
Transformer with Implicit Edges for Particle-based Physics Simulation
Yidi Shao
Chen Change Loy
Bo Dai
106
17
0
22 Jul 2022
MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis
MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis
Yaqian Liang
Shanshan Zhao
Baosheng Yu
Jing Zhang
Fazhi He
ViT
97
39
0
20 Jul 2022
Weakly Supervised Video Salient Object Detection via Point Supervision
Weakly Supervised Video Salient Object Detection via Point Supervision
Shuyong Gao
Hao Xing
Wei Zhang
Yan Wang
Qianyu Guo
Wenqiang Zhang
69
26
0
15 Jul 2022
Online Video Instance Segmentation via Robust Context Fusion
Online Video Instance Segmentation via Robust Context Fusion
Xiang Li
Jinglu Wang
Xiaohao Xu
Bhiksha Raj
Yan Lu
74
5
0
12 Jul 2022
Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments
Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments
Daniel Seichter
Söhnke Benedikt Fischedick
Mona Köhler
H. Groß
89
40
0
10 Jul 2022
Previous
1234567
Next