
v1v2 (latest)
What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs
Papers citing "What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs"
50 / 72 papers shown
Title |
---|
![]() Grounded Language-Image Pre-training Liunian Harold Li Pengchuan Zhang Haotian Zhang Jianwei Yang Chunyuan Li ...Lu Yuan Lei Zhang Lei Li Kai-Wei Chang Jianfeng Gao |