Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.00145
Cited By
Cross-modal Knowledge Reasoning for Knowledge-based Visual Question Answering
31 August 2020
Jiahao Yu
Zihao Zhu
Yujing Wang
Weifeng Zhang
Yue Hu
Jianlong Tan
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Cross-modal Knowledge Reasoning for Knowledge-based Visual Question Answering"
16 / 16 papers shown
Title
Graph Neural Networks in Vision-Language Image Understanding: A Survey
Henry Senior
Greg Slabaugh
Shanxin Yuan
Luca Rossi
GNN
92
21
0
07 Mar 2023
The Contribution of Knowledge in Visiolinguistic Learning: A Survey on Tasks and Challenges
Maria Lymperaiou
Giorgos Stamou
VLM
101
4
0
04 Mar 2023
VQA and Visual Reasoning: An Overview of Recent Datasets, Methods and Challenges
R. Zakari
Jim Wilson Owusu
Hailin Wang
Ke Qin
Zaharaddeen Karami Lawal
Yue-hong Dong
LRM
77
16
0
26 Dec 2022
Hierarchical multimodal transformers for Multi-Page DocVQA
Rubèn Pérez Tito
Dimosthenis Karatzas
Ernest Valveny
94
61
0
07 Dec 2022
A survey on knowledge-enhanced multimodal learning
Maria Lymperaiou
Giorgos Stamou
174
15
0
19 Nov 2022
MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering
Yang Ding
Jing Yu
Bangchang Liu
Yue Hu
Mingxin Cui
Qi Wu
58
64
0
17 Mar 2022
Dynamic Key-value Memory Enhanced Multi-step Graph Reasoning for Knowledge-based Visual Question Answering
Mingxiao Li
Marie-Francine Moens
87
13
0
06 Mar 2022
Multi-Modal Knowledge Graph Construction and Application: A Survey
Xiangru Zhu
Zhixu Li
Xiaodan Wang
Xueyao Jiang
Penglei Sun
Xuwu Wang
Yanghua Xiao
N. Yuan
73
167
0
11 Feb 2022
SA-VQA: Structured Alignment of Visual and Semantic Representations for Visual Question Answering
Peixi Xiong
Quanzeng You
Pei Yu
Zicheng Liu
Ying Wu
65
5
0
25 Jan 2022
MoCA: Incorporating Multi-stage Domain Pretraining and Cross-guided Multimodal Attention for Textbook Question Answering
Fangzhi Xu
Qika Lin
Jing Liu
Lingling Zhang
Tianzhe Zhao
Qianyi Chai
Yudai Pan
55
2
0
06 Dec 2021
Two-stage Rule-induction Visual Reasoning on RPMs with an Application to Video Prediction
Wentao He
Jianfeng Ren
Ruibin Bai
Xudong Jiang
LRM
70
5
0
24 Nov 2021
Recent Advances and Trends in Multimodal Deep Learning: A Review
Jabeen Summaira
Xi Li
Amin Muhammad Shoib
Songyuan Li
Abdul Jabbar
HAI
237
59
0
24 May 2021
Select, Substitute, Search: A New Benchmark for Knowledge-Augmented Visual Question Answering
Aman Jain
Mayank Kothyari
Vishwajeet Kumar
Preethi Jyothi
Ganesh Ramakrishnan
Soumen Chakrabarti
68
36
0
09 Mar 2021
Decomposing Generation Networks with Structure Prediction for Recipe Generation
Hao Wang
Guosheng Lin
Guosheng Lin
Chunyan Miao
40
1
0
27 Jul 2020
Accuracy vs. Complexity: A Trade-off in Visual Question Answering Models
M. Farazi
Salman H. Khan
Nick Barnes
81
18
0
20 Jan 2020
Visual Question Answering using Deep Learning: A Survey and Performance Analysis
Yash Srivastava
Vaishnav Murali
S. Dubey
Snehasis Mukherjee
87
49
0
27 Aug 2019
1