
LocCa: Visual Pretraining with Location-aware Captioners
Papers citing "LocCa: Visual Pretraining with Location-aware Captioners"
50 / 63 papers shown
Title |
---|
![]() Grounded Language-Image Pre-training Liunian Harold Li Pengchuan Zhang Haotian Zhang Jianwei Yang Chunyuan Li ...Lu Yuan Lei Zhang Lei Li Kai-Wei Chang Jianfeng Gao |