
v1v2 (latest)
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation
Papers citing "CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation"
46 / 46 papers shown
Title |
---|
![]() Grounded Language-Image Pre-training Liunian Harold Li Pengchuan Zhang Haotian Zhang Jianwei Yang Chunyuan Li ...Lu Yuan Lei Zhang Lei Li Kai-Wei Chang Jianfeng Gao |