LocCa: Visual Pretraining with Location-aware Captioners

LocCa: Visual Pretraining with Location-aware Captioners

    VLM

Papers citing "LocCa: Visual Pretraining with Location-aware Captioners"

50 / 63 papers shown
Title
ONE-PEACE: Exploring One General Representation Model Toward Unlimited
  Modalities
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Peng Wang
Shijie Wang
Junyang Lin
Shuai Bai
Xiaohuan Zhou
Jingren Zhou
Xinggang Wang
Chang Zhou
99
122
0
18 May 2023

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.