
Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World
Papers citing "Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World"
50 / 63 papers shown
Title |
---|
![]() End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting Yongqi Wang Xinxiao Wu Shuo Yang Jiebo Luo |