Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.00759
Cited By
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers
1 December 2020
Huiyu Wang
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers"
50 / 329 papers shown
Title
Falcon: Fractional Alternating Cut with Overcoming Minima in Unsupervised Segmentation
Xiao Zhang
Xiangyu Han
Xiwen Lai
Yao Sun
Pei Zhang
Konrad Kording
34
0
0
08 Apr 2025
Spectral-Adaptive Modulation Networks for Visual Perception
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Paul Hongsuck Seo
Dong Hwan Kim
42
0
0
31 Mar 2025
Your ViT is Secretly an Image Segmentation Model
Tommie Kerssies
Niccolò Cavagnero
Alexander Hermans
Narges Norouzi
Giuseppe Averta
Bastian Leibe
Gijs Dubbelman
Daan de Geus
ViT
VLM
69
1
0
24 Mar 2025
3D Occupancy Prediction with Low-Resolution Queries via Prototype-aware View Transformation
Gyeongrok Oh
Sungjune Kim
Heeju Ko
Hyung-Gun Chi
J. Kim
Dongwook Lee
Daehyun Ji
Sungjoon Choi
Sujin Jang
Sangpil Kim
41
0
0
19 Mar 2025
Dynamic Relation Inference via Verb Embeddings
Omri Suissa
Muhiim Ali
Ariana Azarbal
Hui Shen
Shekhar Pradhan
46
0
0
17 Mar 2025
Robust Latent Matters: Boosting Image Generation with Sampling Error Synthesis
Kai Qiu
Xianrui Li
Jason Kuen
Hongyu Chen
Xiaohao Xu
Jiuxiang Gu
Yinyi Luo
Bhiksha Raj
Zhe-nan Lin
Marios Savvides
62
0
0
11 Mar 2025
Dictionary-based Framework for Interpretable and Consistent Object Parsing
Tiezheng Zhang
Qihang Yu
Alan Yuille
Ju He
80
1
0
26 Feb 2025
LiDAR-Camera Fusion for Video Panoptic Segmentation without Video Training
Fardin Ayar
Ehsan Javanmardi
Manabu Tsukada
Mahdi Javanmardi
Mohammad Rahmati
VOS
45
0
0
31 Dec 2024
Hierarchical Vector Quantization for Unsupervised Action Segmentation
Federico Spurio
Emad Bahrami
Gianpiero Francesca
Juergen Gall
44
0
0
23 Dec 2024
PanSR: An Object-Centric Mask Transformer for Panoptic Segmentation
Lojze Žust
Matej Kristan
ViT
94
1
0
13 Dec 2024
XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation
Xianrui Li
Kai Qiu
Hongyu Chen
Jason Kuen
Jiuxiang Gu
Rongxiang Weng
Zhe-nan Lin
Bhiksha Raj
VLM
131
3
0
02 Dec 2024
MGNiceNet: Unified Monocular Geometric Scene Understanding
Markus Schön
Michael Buchholz
Klaus C. J. Dietmayer
3DPC
84
0
0
18 Nov 2024
DCT-HistoTransformer: Efficient Lightweight Vision Transformer with DCT Integration for histopathological image analysis
Mahtab Ranjbar
Mehdi Mohebbi
Mahdi Cherakhloo
Bijan Vosoughi. Vahdat
MedIm
26
0
0
24 Oct 2024
ImageFolder: Autoregressive Image Generation with Folded Tokens
Xiang Li
Kai Qiu
Hao Chen
Jason Kuen
Jiuxiang Gu
Bhiksha Raj
Zhe-nan Lin
VLM
44
18
0
02 Oct 2024
VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection
Liangyu Zhong
Joachim Sicking
Fabian Hüger
Hanno Gottschalk
VLM
42
0
0
25 Sep 2024
Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation
Qilong Zhangli
Di Liu
Abhishek Aich
Dimitris Metaxas
S. Schulter
36
0
0
15 Sep 2024
iSeg: An Iterative Refinement-based Framework for Training-free Segmentation
Lin Sun
Jiale Cao
J. Xie
Fahad Shahbaz Khan
Yanwei Pang
DiffM
43
1
0
05 Sep 2024
Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended?
Chen Liang
Qiang Guo
Xiaochao Qu
Luoqi Liu
Ting Liu
VOS
34
0
0
20 Aug 2024
Neural-based Video Compression on Solar Dynamics Observatory Images
Atefeh Khoshkhahtinat
Ali Zafari
P. Mehta
Nasser M. Nasrabadi
Barbara J. Thompson
M. Kirk
D. D. Silva
48
0
0
12 Jul 2024
AMD: Automatic Multi-step Distillation of Large-scale Vision Models
Cheng Han
Qifan Wang
S. Dianat
Majid Rabbani
Raghuveer M. Rao
Yi Fang
Qiang Guan
Lifu Huang
Dongfang Liu
VLM
41
4
0
05 Jul 2024
Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation
Zihan Gao
Lingling Li
Licheng Jiao
Fang Liu
Xu Liu
Wenping Ma
Yuwei Guo
Shuyuan Yang
34
0
0
01 Jul 2024
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding
Tao Zhang
Xiangtai Li
Hao Fei
Haobo Yuan
Shengqiong Wu
Shunping Ji
Chen Change Loy
Shuicheng Yan
LRM
MLLM
VLM
51
49
0
27 Jun 2024
Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations
Daan de Geus
Gijs Dubbelman
42
0
0
14 Jun 2024
An Image is Worth 32 Tokens for Reconstruction and Generation
Qihang Yu
Mark Weber
XueQing Deng
Xiaohui Shen
Daniel Cremers
Liang-Chieh Chen
VLM
ViT
60
85
0
11 Jun 2024
CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor Segmentation
Zhongzhen Huang
Yankai Jiang
Rongzhao Zhang
Shaoting Zhang
Xiaofan Zhang
MedIm
70
4
0
11 Jun 2024
PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving
Yining Shi
Jiusi Li
Kun Jiang
Ke Wang
Yunlong Wang
Mengmeng Yang
Diange Yang
3DPC
52
5
0
11 Jun 2024
3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
Ruipu Wu
Jifei Che
Han Li
Chengjing Wu
Ting Liu
Luoqi Liu
36
0
0
06 Jun 2024
Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting
Inkyu Shin
Qihang Yu
Xiaohui Shen
In So Kweon
KuK-Jin Yoon
Liang-Chieh Chen
VGen
DiffM
71
1
0
04 Jun 2024
Prototypical Transformer as Unified Motion Learners
Cheng Han
Yawen Lu
Guohao Sun
James Liang
Zhiwen Cao
...
S. Dianat
Raghuveer M. Rao
Tong Geng
Zhiqiang Tao
Dongfang Liu
ViT
37
2
0
03 Jun 2024
2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
Biao Wu
Diankai Zhang
Sihan Gao
Cheng-yong Zheng
Shaoli Liu
Ning Wang
27
0
0
01 Jun 2024
Bring Adaptive Binding Prototypes to Generalized Referring Expression Segmentation
Weize Li
Zhaohui Hou
Haochen Bai
Zhicheng Zhao
50
0
0
24 May 2024
Unsupervised Pre-training with Language-Vision Prompts for Low-Data Instance Segmentation
Dingwen Zhang
Hao Li
Diqi He
Nian Liu
Lechao Cheng
Jingdong Wang
Junwei Han
VLM
49
0
0
22 May 2024
HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval
Chao He
Hongxi Wei
30
6
0
13 May 2024
Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Tianci Bi
Xiaoyi Zhang
Zhizheng Zhang
Wenxuan Xie
Cuiling Lan
Yan Lu
Nanning Zheng
VLM
55
1
0
13 May 2024
Multi-Space Alignments Towards Universal LiDAR Segmentation
You-Chen Liu
Lingdong Kong
Xiaoyang Wu
Runnan Chen
Xin Li
Liang Pan
Ziwei Liu
Yuexin Ma
3DPC
53
17
0
02 May 2024
COCONut: Modernizing COCO Segmentation
XueQing Deng
Qihang Yu
Peng Wang
Xiaohui Shen
Liang-Chieh Chen
48
16
0
12 Apr 2024
Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation
Elham Amin Mansour
Ozan Unal
Suman Saha
Benjamin Bejar
Luc Van Gool
48
1
0
04 Apr 2024
ViTamin: Designing Scalable Vision Models in the Vision-Language Era
Jienneg Chen
Qihang Yu
Xiaohui Shen
Alan Yuille
Liang-Chieh Chen
3DV
VLM
47
25
0
02 Apr 2024
JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments
Duy-Tho Le
Chenhui Gou
Stavya Datta
Hengcan Shi
Ian Reid
Jianfei Cai
Hamid Rezatofighi
34
2
0
02 Apr 2024
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning
Beomyoung Kim
Joonsang Yu
Sung Ju Hwang
VLM
CLL
44
11
0
29 Mar 2024
Benchmarking Object Detectors with COCO: A New Path Forward
Shweta Singh
Aayan Yadav
Jitesh Jain
Humphrey Shi
Justin Johnson
Karan Desai
33
7
0
27 Mar 2024
Inpainting-Driven Mask Optimization for Object Removal
Kodai Shimosato
Norimichi Ukita
3DPC
38
1
0
23 Mar 2024
3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge
Siwei Yang
Xianhang Li
Jieru Mei
Jieneng Chen
Cihang Xie
Yuyin Zhou
MedIm
44
4
0
23 Mar 2024
Depth-aware Panoptic Segmentation
Tuan Nguyen
M. Mehltretter
Franz Rottensteiner
MDE
30
0
0
21 Mar 2024
Accelerating ViT Inference on FPGA through Static and Dynamic Pruning
Dhruv Parikh
Shouyi Li
Bingyi Zhang
Rajgopal Kannan
Carl E. Busart
Viktor Prasanna
40
1
0
21 Mar 2024
Continual Segmentation with Disentangled Objectness Learning and Class Recognition
Yizheng Gong
Siyue Yu
Xiaoyang Wang
Jimin Xiao
CLL
37
5
0
06 Mar 2024
Benchmarking the Robustness of Panoptic Segmentation for Automated Driving
Yiting Wang
Haonan Zhao
Daniel Gummadi
M. Dianati
Kurt Debattista
Valentina Donzella
39
2
0
23 Feb 2024
Open-Vocabulary Segmentation with Unpaired Mask-Text Supervision
Zhaoqing Wang
Xiaobo Xia
Ziye Chen
Xiao He
Yandong Guo
Biwei Huang
Tongliang Liu
VLM
29
11
0
14 Feb 2024
Lumos : Empowering Multimodal LLMs with Scene Text Recognition
Ashish Shenoy
Yichao Lu
Srihari Jayakumar
Debojeet Chatterjee
Mohsen Moslehpour
...
Shicong Zhao
Longfang Zhao
Ankit Ramchandani
Xin Luna Dong
Anuj Kumar
MLLM
37
2
0
12 Feb 2024
AdaTreeFormer: Few Shot Domain Adaptation for Tree Counting from a Single High-Resolution Image
H. A. Amirkolaee
Miaojing Shi
Lianghua He
Mark Mulligan
28
1
0
05 Feb 2024
1
2
3
4
5
6
7
Next