Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.01151
Cited By
Detect2Interact: Localizing Object Key Field in Visual Question Answering (VQA) with LLMs
1 April 2024
Jialou Wang
Manli Zhu
Yulei Li
Honglei Li
Long Yang
Wai Lok Woo
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Detect2Interact: Localizing Object Key Field in Visual Question Answering (VQA) with LLMs"
1 / 1 papers shown
Title
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Jun Chen
Deyao Zhu
Xiaoqian Shen
Xiang Li
Zechun Liu
Pengchuan Zhang
Raghuraman Krishnamoorthi
Vikas Chandra
Yunyang Xiong
Mohamed Elhoseiny
MLLM
168
448
0
14 Oct 2023
1