Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.02777
Cited By
v1
v2
v3 (latest)
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
6 June 2022
Feng Li
Hao Zhang
Hu-Sheng Xu
Siyi Liu
Lei Zhang
L. Ni
H. Shum
ISeg
Re-assign community
ArXiv (abs)
PDF
HTML
Github (1325★)
Papers citing
"Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
50 / 235 papers shown
Title
Video Object Segmentation with Dynamic Query Modulation
Hantao Zhou
Runze Hu
Xiu Li
VOS
81
1
0
18 Mar 2024
Endora: Video Generation Models as Endoscopy Simulators
Chenxin Li
Hengyu Liu
Yifan Liu
Brandon Yushan Feng
Wuyang Li
Xinyu Liu
Zhen Chen
Jing Shao
Yixuan Yuan
VGen
MedIm
127
41
0
17 Mar 2024
PosSAM: Panoptic Open-vocabulary Segment Anything
VS Vibashan
Shubhankar Borse
Hyojin Park
Debasmit Das
Vishal M. Patel
Munawar Hayat
Fatih Porikli
VLM
MLLM
78
7
0
14 Mar 2024
MOTPose: Multi-object 6D Pose Estimation for Dynamic Video Sequences using Attention-based Temporal Fusion
Arul Selvam Periyasamy
Sven Behnke
3DPC
70
0
0
14 Mar 2024
Closing the Visual Sim-to-Real Gap with Object-Composable NeRFs
Nikhil Mishra
Maximilian Sieb
Pieter Abbeel
Xi Chen
3DPC
68
1
0
07 Mar 2024
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
Ming-hui Li
Shuai Li
Xindong Zhang
Lei Zhang
VOS
107
18
0
28 Feb 2024
Vision Transformers with Natural Language Semantics
Young-Kyung Kim
Matías Di Martino
Guillermo Sapiro
ViT
63
5
0
27 Feb 2024
Benchmarking the Robustness of Panoptic Segmentation for Automated Driving
Yiting Wang
Haonan Zhao
Daniel Gummadi
M. Dianati
Kurt Debattista
Valentina Donzella
91
3
0
23 Feb 2024
Outlier detection by ensembling uncertainty with negative objectness
Anja Delić
Matej Grcić
Sinisa Segvic
UQCV
118
15
0
23 Feb 2024
Generalizable Semantic Vision Query Generation for Zero-shot Panoptic and Semantic Segmentation
Jialei Chen
Daisuke Deguchi
Chenkai Zhang
Hiroshi Murase
VLM
131
1
0
21 Feb 2024
Domain Adaptable Fine-Tune Distillation Framework For Advancing Farm Surveillance
Raza Imam
Muhammad Huzaifa
Nabil Mansour
Shaher Bano Mirza
Fouad Lamghari
121
1
0
10 Feb 2024
SISP: A Benchmark Dataset for Fine-grained Ship Instance Segmentation in Panchromatic Satellite Images
Pengming Feng
Mingjie Xie
Hongning Liu
Xuanjia Zhao
Guangjun He
Xueliang Zhang
Jian Guan
59
1
0
06 Feb 2024
GEM: Boost Simple Network for Glass Surface Segmentation via Segment Anything Model and Data Synthesis
Jing Hao
Moyun Liu
Kuo Feng Hung
DiffM
59
2
0
27 Jan 2024
EEND-M2F: Masked-attention mask transformers for speaker diarization
Marc Härkönen
Samuel J. Broughton
Lahiru Samarakoon
109
9
0
23 Jan 2024
Stream Query Denoising for Vectorized HD Map Construction
Shuo Wang
Fan Jia
Yingfei Liu
Yucheng Zhao
Zehui Chen
Tiancai Wang
Chi Zhang
Xiangyu Zhang
Feng Zhao
90
20
0
17 Jan 2024
An Efficient Instance Segmentation Framework Based on Oriented Bounding Boxes
Zhen Zhou
Junfeng Fan
Yunkai Ma
Sihan Zhao
Fengshui Jing
M. Tan
ISeg
73
0
0
16 Jan 2024
UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding
Bowen Shi
Peisen Zhao
Zichen Wang
Yuhang Zhang
Yaoming Wang
...
Wenrui Dai
Junni Zou
Hongkai Xiong
Qi Tian
Xiaopeng Zhang
VLM
63
8
0
12 Jan 2024
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications
Yuwen Xiong
Zhiqi Li
Yuntao Chen
Feng Wang
Xizhou Zhu
...
Hongsheng Li
Yu Qiao
Lewei Lu
Jie Zhou
Jifeng Dai
69
63
0
11 Jan 2024
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively
Haobo Yuan
Xiangtai Li
Chong Zhou
Yining Li
Kai Chen
Chen Change Loy
VLM
118
51
0
05 Jan 2024
Generalized Mask-aware IoU for Anchor Assignment for Real-time Instance Segmentation
Baris Can Cam
Kemal Oksuz
Fehmi Kahraman
Z. S. Baltaci
Sinan Kalkan
Emre Akbas
62
0
0
28 Dec 2023
TAO-Amodal: A Benchmark for Tracking Any Object Amodally
Cheng-Yen Hsieh
Kaihua Chen
Achal Dave
Tarasha Khurana
Deva Ramanan
115
0
0
19 Dec 2023
General Object Foundation Model for Images and Videos at Scale
Junfeng Wu
Yi Jiang
Qihao Liu
Zehuan Yuan
Xiang Bai
Song Bai
VOS
VLM
111
41
0
14 Dec 2023
Beyond Classification: Definition and Density-based Estimation of Calibration in Object Detection
Teodora Popordanoska
A. Tiulpin
Matthew B. Blaschko
113
8
0
11 Dec 2023
MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Abdullah Rashwan
Jiageng Zhang
A. Taalimi
Fan Yang
Xingyi Zhou
Chaochao Yan
Liang-Chieh Chen
Yeqing Li
ViT
117
5
0
11 Dec 2023
OpenSD: Unified Open-Vocabulary Segmentation and Detection
Shuai Li
Ming-hui Li
Pengfei Wang
Lei Zhang
ObjD
VLM
72
6
0
10 Dec 2023
You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception
Sheng Jin
Shuhuai Li
Tong Li
Wentao Liu
Chao Qian
Ping Luo
117
5
0
09 Dec 2023
Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation
Zhixiang Wei
Lin Chen
Yi Jin
Xiaoxiao Ma
Tianle Liu
Pengyang Lin
Ben Wang
H. Chen
Jinjin Zheng
103
48
0
07 Dec 2023
Gaussian Grouping: Segment and Edit Anything in 3D Scenes
Mingqiao Ye
Martin Danelljan
Fisher Yu
Lei Ke
3DGS
DiffM
137
188
0
01 Dec 2023
Learning Part Segmentation from Synthetic Animals
Jiawei Peng
Ju He
Prakhar Kaushik
Zihao Xiao
Jiteng Mu
Alan Yuille
72
3
0
30 Nov 2023
SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation
Lingchen Meng
Shiyi Lan
Hengduo Li
Jose M. Alvarez
Zuxuan Wu
Yu-Gang Jiang
VLM
ISeg
MLLM
79
9
0
24 Nov 2023
OneFormer3D: One Transformer for Unified Point Cloud Segmentation
Maksim Kolodiazhnyi
Anna Vorontsova
Anton Konushin
D. Rukhovich
ViT
96
52
0
24 Nov 2023
Visual In-Context Prompting
Feng Li
Qing Jiang
Hao Zhang
Tianhe Ren
Shilong Liu
...
Hongyang Li
Chun-yue Li
Jianwei Yang
Lei Zhang
Jianfeng Gao
VLM
LRM
MLLM
89
36
0
22 Nov 2023
T-Rex: Counting by Visual Prompting
Qing Jiang
Feng Li
Tianhe Ren
Shilong Liu
Zhaoyang Zeng
Kent Yu
Lei Zhang
104
14
0
22 Nov 2023
SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis
Hanrong Ye
Jason Kuen
Qing Liu
Zhe Lin
Brian L. Price
Dan Xu
VLM
130
12
0
06 Nov 2023
OpenForest: A data catalogue for machine learning in forest monitoring
Arthur Ouaknine
T. Kattenborn
Etienne Laliberté
David Rolnick
163
6
0
01 Nov 2023
A Self-Supervised Approach to Land Cover Segmentation
Charles Moore
Dakota Hester
59
0
0
27 Oct 2023
Open-NeRF: Towards Open Vocabulary NeRF Decomposition
Hao Zhang
Fang Li
Narendra Ahuja
90
12
0
25 Oct 2023
Prompt-Driven Building Footprint Extraction in Aerial Images with Offset-Building Model
Kai Li
Yupeng Deng
Yun-long Kong
Diyou Liu
Jingbo Chen
Yu Meng
Junxian Ma
Chenhao Wang
258
1
0
25 Oct 2023
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V
Jianwei Yang
Hao Zhang
Feng Li
Xueyan Zou
Chun-yue Li
Jianfeng Gao
MLLM
VLM
130
189
0
17 Oct 2023
A Framework For Automated Dissection Along Tissue Boundary
K. Oh
Leonardo Borgioli
Milos Zefran
Liaohai Chen
P. Giulianotti
68
7
0
14 Oct 2023
Rank-DETR for High Quality Object Detection
Yifan Pu
Weicong Liang
Yiduo Hao
Yuhui Yuan
Yukang Yang
Chao Zhang
Hanhua Hu
Gao Huang
101
63
0
13 Oct 2023
Cross-Task Data Augmentation by Pseudo-label Generation for Region Based Coronary Artery Instance Segmentation
Sandesh Pokhrel
Sanjay Bhandari
Eduard Vazquez
Yash Raj Shrestha
Binod Bhattarai
38
0
0
08 Oct 2023
Low-Resolution Self-Attention for Semantic Segmentation
Yu-Huan Wu
Shi-Chen Zhang
Yun-Hai Liu
Le Zhang
Xin Zhan
Daquan Zhou
Jiashi Feng
Ming-Ming Cheng
Liangli Zhen
ViT
222
3
0
08 Oct 2023
LoCUS: Learning Multiscale 3D-consistent Features from Posed Images
Dominik A. Kloepfer
Dylan Campbell
João F. Henriques
3DPC
3DV
76
0
0
02 Oct 2023
Deep Learning-Based Connector Detection for Robotized Assembly of Automotive Wire Harnesses
Hao Wang
Björn Johansson
44
10
0
24 Sep 2023
RTrack: Accelerating Convergence for Visual Object Tracking via Pseudo-Boxes Exploration
Guotian Zeng
Bi Zeng
Kuanqi Cai
Jianqi Liu
Qingmao Wei
61
1
0
23 Sep 2023
ClusterFormer: Clustering As A Universal Visual Learner
James Liang
Yiming Cui
Qifan Wang
Tong Geng
Wenguan Wang
Dongfang Liu
VLM
98
10
0
22 Sep 2023
Haystack: A Panoptic Scene Graph Dataset to Evaluate Rare Predicate Classes
Julian Lorenz
Florian Barthel
Daniel Kienzle
Rainer Lienhart
64
5
0
05 Sep 2023
DeNISE: Deep Networks for Improved Segmentation Edges
S. Jyhne
Per-Arne Andersen
Morten Goodwin Olsen
49
0
0
05 Sep 2023
Mask-Attention-Free Transformer for 3D Instance Segmentation
Xin Lai
Yuhui Yuan
Ruihang Chu
Yukang Chen
Han Hu
Jiaya Jia
MedIm
ISeg
3DPC
100
31
0
04 Sep 2023
Previous
1
2
3
4
5
Next