Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.02777
Cited By
v1
v2
v3 (latest)
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
6 June 2022
Feng Li
Hao Zhang
Hu-Sheng Xu
Siyi Liu
Lei Zhang
L. Ni
H. Shum
ISeg
Re-assign community
ArXiv (abs)
PDF
HTML
Github (1325★)
Papers citing
"Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
50 / 235 papers shown
Title
Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts
Zhiwei Lin
Yongtao Wang
Zhi Tang
ObjD
VLM
81
7
0
08 Oct 2024
In-Place Panoptic Radiance Field Segmentation with Perceptual Prior for 3D Scene Understanding
Shenghao Li
65
1
0
06 Oct 2024
RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions
Ziyao Zeng
Yangchao Wu
Hyoungseob Park
Daniel Wang
Fengyu Yang
Stefano Soatto
Dong Lao
Byung-Woo Hong
Alex Wong
MDE
101
7
0
03 Oct 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
Weifeng Lin
Xinyu Wei
Renrui Zhang
Le Zhuo
Shitian Zhao
...
Junlin Xie
Junlin Xie
Yu Qiao
Peng Gao
Hongsheng Li
MLLM
DiffM
192
14
0
23 Sep 2024
A Bottom-Up Approach to Class-Agnostic Image Segmentation
Sebastian Dille
Ari Blondal
Sylvain Paris
Yağız Aksoy
63
0
0
20 Sep 2024
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding
Wenbo Wei
Jun Wang
Abhir Bhalerao
452
0
0
19 Sep 2024
TopoMaskV2: Enhanced Instance-Mask-Based Formulation for the Road Topology Problem
M. E. Kalfaoglu
H. Öztürk
Ozsel Kilinc
A. Temi̇zel
3DPC
86
2
0
17 Sep 2024
A Likelihood Ratio-Based Approach to Segmenting Unknown Objects
Nazir Nayal
Youssef Shoeb
Fatma Güney
OODD
82
4
0
10 Sep 2024
INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding
Ji Ha Jang
H. Seo
Se Young Chun
93
3
0
10 Sep 2024
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation
Hayeon Jo
Hyesong Choi
Minhee Cho
Dongbo Min
127
2
0
04 Sep 2024
A Simple and Generalist Approach for Panoptic Segmentation
Nedyalko Prisadnikov
Wouter Van Gansbeke
Danda Pani Paudel
Luc Van Gool
VLM
116
0
0
29 Aug 2024
A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships
Gracile Astlin Pereira
Muhammad Hussain
ViT
70
10
0
27 Aug 2024
VPOcc: Exploiting Vanishing Point for Monocular 3D Semantic Occupancy Prediction
Junsu Kim
Junhee Lee
Ukcheol Shin
Jean Oh
Kyungdon Joo
3DPC
68
0
0
07 Aug 2024
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
Anurag Das
Xinting Hu
Li Jiang
Bernt Schiele
VLM
117
4
0
31 Jul 2024
Improving 2D Feature Representations by 3D-Aware Fine-Tuning
Yuanwen Yue
Anurag Das
Francis Engelmann
Siyu Tang
J. E. Lenssen
110
28
0
29 Jul 2024
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
Junyi Li
Junfeng Wu
Weizhi Zhao
Song Bai
Xiang Bai
85
3
0
23 Jul 2024
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
Pengfei Chen
Lingxi Xie
Xinyue Huo
Xuehui Yu
Xiaopeng Zhang
Yingfei Sun
Zhenjun Han
Qi Tian
VLM
202
1
0
23 Jul 2024
Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation Masks
Sehwan Choi
Jungho Kim
Hongjae Shin
Jungwook Choi
3DPC
100
9
0
18 Jul 2024
IE-NeRF: Inpainting Enhanced Neural Radiance Fields in the Wild
Shuaixian Wang
Haoran Xu
Yaokun Li
Jiwei Chen
Guang Tan
102
3
0
15 Jul 2024
A Fair Ranking and New Model for Panoptic Scene Graph Generation
Julian Lorenz
Alexander Pest
Daniel Kienzle
K. Ludwig
Rainer Lienhart
98
1
0
12 Jul 2024
Anatomy-guided Pathology Segmentation
A. Jaus
C. Seibold
Simon Reiß
Lukas Heine
Anton Schily
Moon Kim
F. Bahnsen
Ken Herrmann
Rainer Stiefelhagen
Jens Kleesiek
MedIm
64
3
0
08 Jul 2024
Improving Computer Vision Interpretability: Transparent Two-level Classification for Complex Scenes
Stefan Scholz
Nils B. Weidmann
Zachary C. Steinert-Threlkeld
Eda Keremoğlu
Bastian Goldlücke
60
1
0
04 Jul 2024
Label-free Neural Semantic Image Synthesis
Jiayi Wang
Kevin Laube
Yumeng Li
J. H. Metzen
Shin-I Cheng
Julio Borges
Anna Khoreva
DiffM
150
0
0
01 Jul 2024
Rethinking Remote Sensing Change Detection With A Mask View
Xiaowen Ma
Zhenkai Wu
Rongrong Lian
Wei Zhang
Siyang Song
70
3
0
21 Jun 2024
Liveness Detection in Computer Vision: Transformer-based Self-Supervised Learning for Face Anti-Spoofing
Arman Keresh
Pakizar Shamoi
72
7
0
19 Jun 2024
Technique Report of CVPR 2024 PBDL Challenges
Ying Fu
Yu Li
Shaodi You
Boxin Shi
Linwei Chen
...
Songyin Dai
Sen Jia
Junpei Zhang
Puhua Chen
Qihang Li
90
0
0
15 Jun 2024
CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor Segmentation
Zhongzhen Huang
Yankai Jiang
Rongzhao Zhang
Shaoting Zhang
Xiaofan Zhang
MedIm
119
5
0
11 Jun 2024
CAAP: Context-Aware Action Planning Prompting to Solve Computer Tasks with Front-End UI Only
Junhee Cho
Jihoon Kim
Daseul Bae
Jinho Choo
Youngjune Gwon
Yeong-Dae Kwon
LLMAG
59
1
0
11 Jun 2024
ProMotion: Prototypes As Motion Learners
Yawen Lu
Dongfang Liu
Qifan Wang
Cheng Han
Yiming Cui
Zhiwen Cao
Xueling Zhang
Yingjie Victor Chen
Heng Fan
DiffM
122
3
0
07 Jun 2024
Frequency-based Matcher for Long-tailed Semantic Segmentation
Shan Li
Lu Yang
Pu Cao
Liulei Li
Huadong Ma
94
1
0
06 Jun 2024
Extreme Point Supervised Instance Segmentation
Hyeonjun Lee
S. Hwang
Suha Kwak
61
2
0
31 May 2024
BAISeg: Boundary Assisted Weakly Supervised Instance Segmentation
Tengbo Wang
Yu Bai
ISeg
82
1
0
27 May 2024
UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation
Yachan Guo
Yi Xiao
Danna Xue
Jose Luis Gomez Zurita
Antonio M. López
142
0
0
15 May 2024
Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Tianci Bi
Xiaoyi Zhang
Zhizheng Zhang
Wenxuan Xie
Cuiling Lan
Yan Lu
Nanning Zheng
VLM
79
1
0
13 May 2024
Enhancing DETRs Variants through Improved Content Query and Similar Query Aggregation
Yingying Zhang
Chuangji Shi
Xin Guo
Jiangwei Lao
Jian Wang
Jiaotuan Wang
Jingdong Chen
81
3
0
06 May 2024
Mapping the Unseen: Unified Promptable Panoptic Mapping with Dynamic Labeling using Foundation Models
Mohamad Al Al Mdfaa
Raghad Salameh
Sergey Zagoruyko
Gonzalo Ferrer
70
1
0
03 May 2024
Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification
Siqi Yin
Lifan Jiang
47
0
0
03 May 2024
GraCo: Granularity-Controllable Interactive Segmentation
Yian Zhao
Kehan Li
Ze-Long Cheng
Pengchong Qiao
Xiawu Zheng
Rongrong Ji
Chang Liu
Li-ming Yuan
Jie Chen
114
9
0
01 May 2024
UniFS: Universal Few-shot Instance Perception with Point Representations
Sheng Jin
Ruijie Yao
Lumin Xu
Wentao Liu
Chao Qian
Ji Wu
Ping Luo
119
2
0
30 Apr 2024
UniRGB-IR: A Unified Framework for Visible-Infrared Semantic Tasks via Adapter Tuning
Maoxun Yuan
Bo Cui
Tianyi Zhao
Xingxing Wei
Shan Fu
Xue Yang
Xingxing Wei
112
0
0
26 Apr 2024
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Abhishek Aich
Yumin Suh
S. Schulter
Manmohan Chandraker
165
0
0
23 Apr 2024
CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect
Minh Q. Tran
Sang Truong
Arthur F. A. Fernandes
Michael Kidd
Ngan Le
ViT
100
4
0
17 Apr 2024
kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies
Zhongrui Gui
Shuyang Sun
Runjia Li
Jianhao Yuan
Zhaochong An
Karsten Roth
Ameya Prabhu
Philip Torr
VLM
CLL
81
7
0
15 Apr 2024
LaSagnA: Language-based Segmentation Assistant for Complex Queries
Cong Wei
Haoxian Tan
Yujie Zhong
Yujiu Yang
Lin Ma
113
17
0
12 Apr 2024
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Pei Wang
Zhaowei Cai
Hao Yang
Ashwin Swaminathan
R. Manmatha
Stefano Soatto
123
2
0
06 Apr 2024
Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer
Qinji Yu
Yirui Wang
K. Yan
Haoshen Li
Dazhou Guo
...
Na Shen
Qifeng Wang
Xiaowei Ding
X. Ye
Dakai Jin
MedIm
137
2
0
04 Apr 2024
JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments
Duy-Tho Le
Chenhui Gou
Stavya Datta
Hengcan Shi
Ian Reid
Jianfei Cai
Hamid Rezatofighi
106
4
0
02 Apr 2024
Rethinking Saliency-Guided Weakly-Supervised Semantic Segmentation
Beomyoung Kim
Donghyeon Kim
Sung Ju Hwang
126
0
0
01 Apr 2024
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning
Beomyoung Kim
Joonsang Yu
Sung Ju Hwang
VLM
CLL
105
13
0
29 Mar 2024
Leveraging Large Language Model-based Room-Object Relationships Knowledge for Enhancing Multimodal-Input Object Goal Navigation
Leyuan Sun
Asako Kanezaki
Guillaume Caron
Yusuke Yoshiyasu
LM&Ro
76
2
0
21 Mar 2024
Previous
1
2
3
4
5
Next