ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.00759
  4. Cited By
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers

MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers

1 December 2020
Huiyu Wang
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
    ViT
ArXivPDFHTML

Papers citing "MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers"

50 / 329 papers shown
Title
Scaling Open-Vocabulary Image Segmentation with Image-Level Labels
Scaling Open-Vocabulary Image Segmentation with Image-Level Labels
Golnaz Ghiasi
Xiuye Gu
Huayu Chen
Nayeon Lee
VLM
47
371
0
22 Dec 2021
MPViT: Multi-Path Vision Transformer for Dense Prediction
MPViT: Multi-Path Vision Transformer for Dense Prediction
Youngwan Lee
Jonghee Kim
Jeffrey Willette
Sung Ju Hwang
ViT
29
245
0
21 Dec 2021
Lite Vision Transformer with Enhanced Self-Attention
Lite Vision Transformer with Enhanced Self-Attention
Chenglin Yang
Yilin Wang
Jianming Zhang
He Zhang
Zijun Wei
Zhe-nan Lin
Alan Yuille
ViT
21
114
0
20 Dec 2021
Slot-VPS: Object-centric Representation Learning for Video Panoptic
  Segmentation
Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation
Yi Zhou
Hui Zhang
Hana Lee
Shuyang Sun
Pingjun Li
Yangguang Zhu
ByungIn Yoo
Xiaojuan Qi
Jae-Joon Han
VOS
40
26
0
16 Dec 2021
QAHOI: Query-Based Anchors for Human-Object Interaction Detection
QAHOI: Query-Based Anchors for Human-Object Interaction Detection
Junwen Chen
Keiji Yanai
26
40
0
16 Dec 2021
DualFormer: Local-Global Stratified Transformer for Efficient Video
  Recognition
DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition
Keli Zhang
Pan Zhou
Roger Zimmermann
Shuicheng Yan
ViT
32
21
0
09 Dec 2021
PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic
  Segmentation
PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation
Haobo Yuan
Xiangtai Li
Yibo Yang
Guangliang Cheng
Jing Zhang
Yunhai Tong
Lefei Zhang
Dacheng Tao
MDE
47
42
0
05 Dec 2021
Hybrid Instance-aware Temporal Fusion for Online Video Instance
  Segmentation
Hybrid Instance-aware Temporal Fusion for Online Video Instance Segmentation
Xiang Li
Jinglu Wang
Xiao Li
Yan Lu
38
19
0
03 Dec 2021
Masked-attention Mask Transformer for Universal Image Segmentation
Masked-attention Mask Transformer for Universal Image Segmentation
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
144
2,281
0
02 Dec 2021
End-to-End Referring Video Object Segmentation with Multimodal
  Transformers
End-to-End Referring Video Object Segmentation with Multimodal Transformers
Adam Botach
Evgenii Zheltonozhskii
Chaim Baskin
VOS
34
141
0
29 Nov 2021
Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point
  Modeling
Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling
Xumin Yu
Lulu Tang
Yongming Rao
Tiejun Huang
Jie Zhou
Jiwen Lu
3DPC
51
655
0
29 Nov 2021
Efficient Self-Ensemble for Semantic Segmentation
Efficient Self-Ensemble for Semantic Segmentation
Walid Bousselham
Guillaume Thibault
Lucas Pagano
Archana Machireddy
Joe W. Gray
Y. Chang
Xubo B. Song
ViT
33
24
0
26 Nov 2021
PTQ4ViT: Post-training quantization for vision transformers with twin
  uniform quantization
PTQ4ViT: Post-training quantization for vision transformers with twin uniform quantization
Zhihang Yuan
Chenhao Xue
Yiqi Chen
Qiang Wu
Guangyu Sun
ViT
MQ
33
133
0
24 Nov 2021
Pruning Self-attentions into Convolutional Layers in Single Path
Pruning Self-attentions into Convolutional Layers in Single Path
Haoyu He
Jianfei Cai
Jing Liu
Zizheng Pan
Jing Zhang
Dacheng Tao
Bohan Zhuang
ViT
34
40
0
23 Nov 2021
DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion
DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion
Renrui Zhang
Ziyao Zeng
Ziyu Guo
Xing Gao
Kexue Fu
Jianbo Shi
3DPC
24
26
0
19 Nov 2021
TransMix: Attend to Mix for Vision Transformers
TransMix: Attend to Mix for Vision Transformers
Jieneng Chen
Shuyang Sun
Ju He
Philip Torr
Alan Yuille
S. Bai
ViT
30
103
0
18 Nov 2021
A Survey of Visual Transformers
A Survey of Visual Transformers
Yang Liu
Yao Zhang
Yixin Wang
Feng Hou
Jin Yuan
Jiang Tian
Yang Zhang
Zhongchao Shi
Jianping Fan
Zhiqiang He
3DGS
ViT
79
332
0
11 Nov 2021
Sampling Equivariant Self-attention Networks for Object Detection in
  Aerial Images
Sampling Equivariant Self-attention Networks for Object Detection in Aerial Images
Guo-Ye Yang
Xiang-Li Li
Ralph Robert Martin
Shimin Hu
3DPC
21
13
0
05 Nov 2021
DocTr: Document Image Transformer for Geometric Unwarping and
  Illumination Correction
DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction
Hao Feng
Yuechen Wang
Wen-gang Zhou
Jiajun Deng
Houqiang Li
ViT
8
58
0
25 Oct 2021
Video Instance Segmentation by Instance Flow Assembly
Video Instance Segmentation by Instance Flow Assembly
Xiang Li
Jinglu Wang
Xiao Li
Yan Lu
VOS
27
15
0
20 Oct 2021
ASFormer: Transformer for Action Segmentation
ASFormer: Transformer for Action Segmentation
Fangqiu Yi
Hongyu Wen
Tingting Jiang
ViT
79
174
0
16 Oct 2021
The Center of Attention: Center-Keypoint Grouping via Attention for
  Multi-Person Pose Estimation
The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation
Guillem Brasó
Nikita Kister
Laura Leal-Taixé
3DPC
16
40
0
11 Oct 2021
ProTo: Program-Guided Transformer for Program-Guided Tasks
ProTo: Program-Guided Transformer for Program-Guided Tasks
Zelin Zhao
Karan Samel
Binghong Chen
Le Song
ViT
LM&Ro
34
30
0
02 Oct 2021
Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with
  Transformers
Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers
Zhiqi Li
Wenhai Wang
Enze Xie
Zhiding Yu
Anima Anandkumar
J. Álvarez
Ping Luo
Tong Lu
ViT
34
135
0
08 Sep 2021
Voxel Transformer for 3D Object Detection
Voxel Transformer for 3D Object Detection
Jiageng Mao
Yujing Xue
Minzhe Niu
Haoyue Bai
Jiashi Feng
Xiaodan Liang
Hang Xu
Chunjing Xu
3DPC
ViT
38
404
0
06 Sep 2021
Searching for Efficient Multi-Stage Vision Transformers
Searching for Efficient Multi-Stage Vision Transformers
Yi-Lun Liao
S. Karaman
Vivienne Sze
ViT
24
19
0
01 Sep 2021
Trans4Trans: Efficient Transformer for Transparent Object and Semantic
  Scene Segmentation in Real-World Navigation Assistance
Trans4Trans: Efficient Transformer for Transparent Object and Semantic Scene Segmentation in Real-World Navigation Assistance
Jiaming Zhang
Kailun Yang
Angela Constantinescu
Kunyu Peng
Karin Muller
Rainer Stiefelhagen
ViT
46
69
0
20 Aug 2021
Fully Convolutional Networks for Panoptic Segmentation with Point-based
  Supervision
Fully Convolutional Networks for Panoptic Segmentation with Point-based Supervision
Yanwei Li
Hengshuang Zhao
Xiaojuan Qi
Yukang Chen
Lu Qi
Liwei Wang
Zeming Li
Jian Sun
Jiaya Jia
41
51
0
17 Aug 2021
PSViT: Better Vision Transformer via Token Pooling and Attention Sharing
PSViT: Better Vision Transformer via Token Pooling and Attention Sharing
Boyu Chen
Peixia Li
Baopu Li
Chuming Li
Lei Bai
Chen Lin
Ming Sun
Junjie Yan
Wanli Ouyang
ViT
73
33
0
07 Aug 2021
Perceiver IO: A General Architecture for Structured Inputs & Outputs
Perceiver IO: A General Architecture for Structured Inputs & Outputs
Andrew Jaegle
Sebastian Borgeaud
Jean-Baptiste Alayrac
Carl Doersch
Catalin Ionescu
...
Olivier J. Hénaff
M. Botvinick
Andrew Zisserman
Oriol Vinyals
João Carreira
MLLM
VLM
GNN
22
567
0
30 Jul 2021
PlaneTR: Structure-Guided Transformers for 3D Plane Recovery
PlaneTR: Structure-Guided Transformers for 3D Plane Recovery
Bin Tan
Nan Xue
S. Bai
Tianfu Wu
Guisong Xia
ViT
17
39
0
27 Jul 2021
Image Fusion Transformer
Image Fusion Transformer
VS Vibashan
Jeya Maria Jose Valanarasu
Poojan Oza
Vishal M. Patel
ViT
30
116
0
19 Jul 2021
Per-Pixel Classification is Not All You Need for Semantic Segmentation
Per-Pixel Classification is Not All You Need for Semantic Segmentation
Bowen Cheng
Alex Schwing
Alexander Kirillov
VLM
ViT
35
1,503
0
13 Jul 2021
Locally Enhanced Self-Attention: Combining Self-Attention and
  Convolution as Local and Context Terms
Locally Enhanced Self-Attention: Combining Self-Attention and Convolution as Local and Context Terms
Chenglin Yang
Siyuan Qiao
Adam Kortylewski
Alan Yuille
28
4
0
12 Jul 2021
Local-to-Global Self-Attention in Vision Transformers
Local-to-Global Self-Attention in Vision Transformers
Jinpeng Li
Yichao Yan
Tianran Ouyang
Xiaokang Yang
Ling Shao
ViT
25
29
0
10 Jul 2021
Trans4Trans: Efficient Transformer for Transparent Object Segmentation
  to Help Visually Impaired People Navigate in the Real World
Trans4Trans: Efficient Transformer for Transparent Object Segmentation to Help Visually Impaired People Navigate in the Real World
Jiaming Zhang
Kailun Yang
Angela Constantinescu
Kunyu Peng
Karin Muller
Rainer Stiefelhagen
ViT
41
61
0
07 Jul 2021
Focal Self-attention for Local-Global Interactions in Vision
  Transformers
Focal Self-attention for Local-Global Interactions in Vision Transformers
Jianwei Yang
Chunyuan Li
Pengchuan Zhang
Xiyang Dai
Bin Xiao
Lu Yuan
Jianfeng Gao
ViT
47
428
0
01 Jul 2021
Looking Outside the Window: Wide-Context Transformer for the Semantic
  Segmentation of High-Resolution Remote Sensing Images
Looking Outside the Window: Wide-Context Transformer for the Semantic Segmentation of High-Resolution Remote Sensing Images
L. Ding
Dong Lin
Shaofu Lin
Jing Zhang
Xiaojie Cui
Yuebin Wang
Hao Tang
Lorenzo Bruzzone
ViT
33
98
0
29 Jun 2021
K-Net: Towards Unified Image Segmentation
K-Net: Towards Unified Image Segmentation
Wenwei Zhang
Jiangmiao Pang
Kai-xiang Chen
Chen Change Loy
ISeg
32
358
0
28 Jun 2021
P2T: Pyramid Pooling Transformer for Scene Understanding
P2T: Pyramid Pooling Transformer for Scene Understanding
Yu-Huan Wu
Yun-Hai Liu
Xin Zhan
Mingg-Ming Cheng
ViT
29
220
0
22 Jun 2021
Efficient Self-supervised Vision Transformers for Representation
  Learning
Efficient Self-supervised Vision Transformers for Representation Learning
Chunyuan Li
Jianwei Yang
Pengchuan Zhang
Mei Gao
Bin Xiao
Xiyang Dai
Lu Yuan
Jianfeng Gao
ViT
40
209
0
17 Jun 2021
DeepLab2: A TensorFlow Library for Deep Labeling
DeepLab2: A TensorFlow Library for Deep Labeling
Mark Weber
Huiyu Wang
Siyuan Qiao
Jun Xie
Maxwell D. Collins
...
Laura Leal-Taixe
Alan Yuille
Florian Schroff
Hartwig Adam
Liang-Chieh Chen
VLM
27
46
0
17 Jun 2021
Improved Transformer for High-Resolution GANs
Improved Transformer for High-Resolution GANs
Long Zhao
Zizhao Zhang
Ting Chen
Dimitris N. Metaxas
Han Zhang
ViT
34
95
0
14 Jun 2021
CAT: Cross Attention in Vision Transformer
CAT: Cross Attention in Vision Transformer
Hezheng Lin
Xingyi Cheng
Xiangyu Wu
Fan Yang
Dong Shen
Zhongyuan Wang
Qing Song
Wei Yuan
ViT
35
149
0
10 Jun 2021
Salient Object Ranking with Position-Preserved Attention
Salient Object Ranking with Position-Preserved Attention
Haoyang Fang
Daoxin Zhang
Yi Zhang
Minghao Chen
Jiawei Li
Yao Hu
Deng Cai
Xiaofei He
23
20
0
09 Jun 2021
Chasing Sparsity in Vision Transformers: An End-to-End Exploration
Chasing Sparsity in Vision Transformers: An End-to-End Exploration
Tianlong Chen
Yu Cheng
Zhe Gan
Lu Yuan
Lei Zhang
Zhangyang Wang
ViT
24
216
0
08 Jun 2021
SIMONe: View-Invariant, Temporally-Abstracted Object Representations via
  Unsupervised Video Decomposition
SIMONe: View-Invariant, Temporally-Abstracted Object Representations via Unsupervised Video Decomposition
Rishabh Kabra
Daniel Zoran
Goker Erdogan
Loic Matthey
Antonia Creswell
M. Botvinick
Alexander Lerchner
Christopher P. Burgess
OCL
52
77
0
07 Jun 2021
Video Instance Segmentation using Inter-Frame Communication Transformers
Video Instance Segmentation using Inter-Frame Communication Transformers
Sukjun Hwang
Miran Heo
Seoung Wug Oh
Seon Joo Kim
ViT
33
135
0
07 Jun 2021
Combinatorial Optimization for Panoptic Segmentation: A Fully
  Differentiable Approach
Combinatorial Optimization for Panoptic Segmentation: A Fully Differentiable Approach
Ahmed Abbas
Paul Swoboda
27
14
0
06 Jun 2021
TransVOS: Video Object Segmentation with Transformers
TransVOS: Video Object Segmentation with Transformers
Jianbiao Mei
Mengmeng Wang
Yen-Yu Lin
Yi Yuan
Yong Liu
ViT
19
28
0
01 Jun 2021
Previous
1234567
Next