Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2208.03030
Cited By
ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding
5 August 2022
Bingning Wang
Feiya Lv
Ting Yao
Yiming Yuan
Jin Ma
Yu Luo
Haijin Liang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding"
4 / 4 papers shown
Title
ViTextVQA: A Large-Scale Visual Question Answering Dataset for Evaluating Vietnamese Text Comprehension in Images
Quan Van Nguyen
Dan Quang Tran
Huy Quang Pham
Thang Kien-Bao Nguyen
Nghia Hieu Nguyen
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
CoGe
39
3
0
16 Apr 2024
MORI-RAN: Multi-view Robust Representation Learning via Hybrid Contrastive Fusion
Guanzhou Ke
Yong-Nan Zhu
Yang Yu
37
7
0
26 Aug 2022
ComQA:Compositional Question Answering via Hierarchical Graph Neural Networks
Bingning Wang
Ting Yao
Weipeng Chen
Jingfang Xu
Xiaochuan Wang
CoGe
22
6
0
16 Jan 2021
Unified Vision-Language Pre-Training for Image Captioning and VQA
Luowei Zhou
Hamid Palangi
Lei Zhang
Houdong Hu
Jason J. Corso
Jianfeng Gao
MLLM
VLM
252
927
0
24 Sep 2019
1