Title |
---|
![]() Rephrase, Augment, Reason: Visual Grounding of Questions for
Vision-Language Models Archiki Prasad Elias Stengel-Eskin Mohit Bansal |
![]() GOPro: Generate and Optimize Prompts in CLIP using Self-Supervised
Learning Mainak Singha Ankit Jha Biplab Banerjee |