
VizWiz Grand Challenge: Answering Visual Questions from Blind People
Papers citing "VizWiz Grand Challenge: Answering Visual Questions from Blind People"
50 / 574 papers shown
Title |
---|
![]() AlignLLaVA: Cascaded Human and Large Language Model Preference
Alignment for Multi-modal Instruction Curation Hongzhe Huang Zhewen Yu Jiang Liu Li Cai Dian Jiao ...Siliang Tang Juncheng Li Hao Jiang Haoyuan Li Yueting Zhuang |
![]() MIO: A Foundation Model on Multimodal Tokens Zekun Wang King Zhu Chunpu Xu Wangchunshu Zhou Jiaheng Liu ...Yuanxing Zhang Ge Zhang Ke Xu Jie Fu Wenhao Huang |
![]() A Survey on Evaluation of Multimodal Large Language Models Jiaxing Huang Jingyi Zhang |