Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.05499
Cited By
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
9 March 2023
Shilong Liu
Zhaoyang Zeng
Tianhe Ren
Feng Li
Hao Zhang
Jie-jin Yang
Chun-yue Li
Jianwei Yang
Hang Su
Jun Zhu
Lei Zhang
ObjD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
41 / 1,341 papers shown
Title
Text-to-image Editing by Image Information Removal
Zhongping Zhang
Jian Zheng
Jacob Zhiyuan Fang
Bryan A. Plummer
DiffM
34
12
0
27 May 2023
Building One-class Detector for Anything: Open-vocabulary Zero-shot OOD Detection Using Text-image Models
Yunhao Ge
Jie Jessie Ren
Jiaping Zhao
Kaifeng Chen
Andrew Gallagher
Laurent Itti
Balaji Lakshminarayanan
VLM
ObjD
26
1
0
26 May 2023
Sim-Suction: Learning a Suction Grasp Policy for Cluttered Environments Using a Synthetic Benchmark
Juncheng Li
D. Cappelleri
3DPC
22
11
0
25 May 2023
ChatCAD+: Towards a Universal and Reliable Interactive CAD using LLMs
Zihao Zhao
Sheng Wang
Jinchen Gu
Yitao Zhu
Lanzhuju Mei
Zixu Zhuang
Zhiming Cui
Qian Wang
Dinggang Shen
LM&MA
37
36
0
25 May 2023
Visual Programming for Text-to-Image Generation and Evaluation
Jaemin Cho
Abhaysinh Zala
Joey Tianyi Zhou
MLLM
38
50
0
24 May 2023
AMELI: Enhancing Multimodal Entity Linking with Fine-Grained Attributes
Barry Menglong Yao
Yu Chen
Qifan Wang
Sijia Wang
Minqian Liu
Zhiyang Xu
Licheng Yu
Lifu Huang
26
7
0
24 May 2023
DetGPT: Detect What You Need via Reasoning
Renjie Pi
Jiahui Gao
Shizhe Diao
Rui Pan
Hanze Dong
...
Lewei Yao
Jianhua Han
Hang Xu
Lingpeng Kong Tong Zhang
Tong Zhang
LRM
LM&Ro
27
92
0
23 May 2023
Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models
Ruichen Wang
Zekang Chen
Chen Chen
Jiancang Ma
H. Lu
Xiaodong Lin
DiffM
52
66
0
23 May 2023
Cross3DVG: Cross-Dataset 3D Visual Grounding on Different RGB-D Scans
Taiki Miyanishi
Daich Azuma
Shuhei Kurita
M. Kawanabe
44
2
0
23 May 2023
Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching
Yang Liu
Muzhi Zhu
Hengtao Li
Hao Chen
Xinlong Wang
Chunhua Shen
VLM
MLLM
88
86
0
22 May 2023
Interactive Data Synthesis for Systematic Vision Adaptation via LLMs-AIGCs Collaboration
Qifan Yu
Juncheng Li
Wentao Ye
Siliang Tang
Yueting Zhuang
36
13
0
22 May 2023
Going Denser with Open-Vocabulary Part Segmentation
Pei Sun
Shoufa Chen
Chenchen Zhu
Fanyi Xiao
Ping Luo
Saining Xie
Zhicheng Yan
ObjD
VLM
27
45
0
18 May 2023
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Peng Wang
Shijie Wang
Junyang Lin
Shuai Bai
Xiaohuan Zhou
Jingren Zhou
Xinggang Wang
Chang Zhou
VLM
MLLM
ObjD
48
116
0
18 May 2023
Segment Any Anomaly without Training via Hybrid Prompt Regularization
Yunkang Cao
Xiaohao Xu
Chen Sun
Y. Cheng
Zongwei Du
Liang Gao
Nong Sang
VLM
37
71
0
18 May 2023
OR-NeRF: Object Removing from 3D Scenes Guided by Multiview Segmentation with Neural Radiance Fields
Youtan Yin
Zhoujie Fu
Fan Yang
Guosheng Lin
50
29
0
17 May 2023
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Yuyang Zhao
Enze Xie
Lanqing Hong
Zhenguo Li
G. Lee
DiffM
VGen
38
33
0
15 May 2023
Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models
Zhimin Chen
Longlong Jing
Yingwei Li
Bing Li
32
31
0
15 May 2023
A Comprehensive Survey on Segment Anything Model for Vision and Beyond
Chunhui Zhang
Li Liu
Yawen Cui
Guanjie Huang
Weilin Lin
Yiqian Yang
Yuehong Hu
VLM
43
90
0
14 May 2023
A Survey on Segment Anything Model (SAM): Vision Foundation Model Meets Prompt Engineering
Chaoning Zhang
Fachrina Dewi Puspitasari
Sheng Zheng
Chenghao Li
Yu Qiao
...
Caiyan Qin
François Rameau
Lik-Hang Lee
Sung-Ho Bae
Choong Seon Hong
VLM
84
63
0
12 May 2023
Segment and Track Anything
Yangming Cheng
Liulei Li
Yuanyou Xu
Xiaodi Li
Zongxin Yang
Wenguan Wang
Yi Yang
VOS
30
193
0
11 May 2023
InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Zhaoyang Liu
Yinan He
Wenhai Wang
Weiyun Wang
Yi Wang
...
Yali Wang
Limin Wang
Ping Luo
Jifeng Dai
Yu Qiao
LRM
MLLM
41
79
0
09 May 2023
SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model
Di Wang
Jing Zhang
Bo Du
Minqiang Xu
Lin Liu
Dacheng Tao
Lefei Zhang
128
69
0
03 May 2023
An Alternative to WSSS? An Empirical Study of the Segment Anything Model (SAM) on Weakly-Supervised Semantic Segmentation Problems
Weixuan Sun
Zheyuan Liu
Yanhao Zhang
Yiran Zhong
Nick Barnes
VLM
83
20
0
02 May 2023
Attack-SAM: Towards Attacking Segment Anything Model With Adversarial Examples
Chenshuang Zhang
Chaoning Zhang
Taegoo Kang
Donghun Kim
Sung-Ho Bae
In So Kweon
AAML
VLM
44
3
0
01 May 2023
Learnable Ophthalmology SAM
Zhongxi Qiu
Yan Hu
Heng Li
Jiang-Dong Liu
VLM
MedIm
32
24
0
26 Apr 2023
Segment Anything in 3D with Radiance Fields
Jiazhong Cen
Jiemin Fang
Zanwei Zhou
Chen Yang
Lingxi Xie
Xiaopeng Zhang
Wei-Ming Shen
Qi Tian
43
43
0
24 Apr 2023
Segment Anything in Non-Euclidean Domains: Challenges and Opportunities
Yongcheng Jing
Xinchao Wang
Dacheng Tao
52
21
0
23 Apr 2023
Can SAM Count Anything? An Empirical Study on SAM Counting
Zhiheng Ma
Xiaopeng Hong
Qinnan Shangguan
VLM
30
18
0
21 Apr 2023
Text2Seg: Remote Sensing Image Semantic Segmentation via Text-Guided Visual Foundation Models
Jielu Zhang
Zhongliang Zhou
Gengchen Mai
Lan Mu
Mengxuan Hu
Sheng Li
VLM
34
46
0
20 Apr 2023
Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDa
VLM
MLLM
151
4,325
0
17 Apr 2023
Expressive Text-to-Image Generation with Rich Text
Songwei Ge
Taesung Park
Jun-Yan Zhu
Jia-Bin Huang
DiffM
79
78
0
13 Apr 2023
SATR: Zero-Shot Semantic Segmentation of 3D Shapes
Ahmed Abdelreheem
Ivan Skorokhodov
M. Ovsjanikov
Peter Wonka
3DPC
40
38
0
11 Apr 2023
Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images
Ruiqi Wang
A. Patil
Fenggen Yu
Hao Zhang
21
2
0
21 Mar 2023
Virtual Guidance as a Mid-level Representation for Navigation with Augmented Reality
Hsuan-Kung Yang
Tsung-Chih Chiang
Tingxin Liu
Chun-Wei Huang
Jou-Min Liu
Tsu-Ching Hsiao
Chun-Yi Lee
28
1
0
05 Mar 2023
Explainable Anomaly Detection in Images and Videos: A Survey
Yizhou Wang
Dongliang Guo
Sheng Li
Octavia Camps
Yun Fu
39
5
0
13 Feb 2023
Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals
Yue Wu
Yewen Fan
Paul Pu Liang
A. Azaria
Yuan-Fang Li
Tom Michael Mitchell
OffRL
26
47
0
09 Feb 2023
CPPF++: Uncertainty-Aware Sim2Real Object Pose Estimation by Vote Aggregation
Yang You
Wenhao He
Jin Liu
Hongkai Xiong
Weiming Wang
Cewu Lu
3DPC
40
4
0
24 Nov 2022
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection
Lewei Yao
Jianhua Han
Youpeng Wen
Xiaodan Liang
Dan Xu
Wei Zhang
Zhenguo Li
Chunjing Xu
Hang Xu
CLIP
VLM
115
153
0
20 Sep 2022
Complex Scene Image Editing by Scene Graph Comprehension
Zhongping Zhang
Huiwen He
Bryan A. Plummer
Z. Liao
Huayan Wang
DiffM
35
6
0
24 Mar 2022
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
Shilong Liu
Feng Li
Hao Zhang
Xiaohu Yang
Xianbiao Qi
Hang Su
Jun Zhu
Lei Zhang
ViT
161
729
0
28 Jan 2022
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation
Xiuye Gu
Nayeon Lee
Weicheng Kuo
Huayu Chen
VLM
ObjD
225
899
0
28 Apr 2021
Previous
1
2
3
...
25
26
27