ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.06039
  4. Cited By
MultiModalQA: Complex Question Answering over Text, Tables and Images

MultiModalQA: Complex Question Answering over Text, Tables and Images

13 April 2021
Alon Talmor
Ori Yoran
Amnon Catav
Dan Lahav
Yizhong Wang
Akari Asai
Gabriel Ilharco
Hannaneh Hajishirzi
Jonathan Berant
    LMTD
ArXivPDFHTML

Papers citing "MultiModalQA: Complex Question Answering over Text, Tables and Images"

50 / 100 papers shown
Title
Unified Language Representation for Question Answering over Text,
  Tables, and Images
Unified Language Representation for Question Answering over Text, Tables, and Images
Yu Bowen
Cheng Fu
Haiyang Yu
Fei Huang
Yongbin Li
LMTD
24
20
0
29 Jun 2023
Answer Mining from a Pool of Images: Towards Retrieval-Based Visual
  Question Answering
Answer Mining from a Pool of Images: Towards Retrieval-Based Visual Question Answering
A. S. Penamakuri
Manish Gupta
Mithun Das Gupta
Anand Mishra
37
7
0
29 Jun 2023
Pre-Training Multi-Modal Dense Retrievers for Outside-Knowledge Visual
  Question Answering
Pre-Training Multi-Modal Dense Retrievers for Outside-Knowledge Visual Question Answering
Alireza Salemi
Mahta Rafiee
Hamed Zamani
24
8
0
28 Jun 2023
ECG-QA: A Comprehensive Question Answering Dataset Combined With
  Electrocardiogram
ECG-QA: A Comprehensive Question Answering Dataset Combined With Electrocardiogram
Jungwoo Oh
Gyubok Lee
Seongsu Bae
Joon-Myoung Kwon
Edward Choi
18
16
0
21 Jun 2023
Table Meets LLM: Can Large Language Models Understand Structured Table
  Data? A Benchmark and Empirical Study
Table Meets LLM: Can Large Language Models Understand Structured Table Data? A Benchmark and Empirical Study
Yuan Sui
Mengyu Zhou
Mingjie Zhou
Shi Han
Dongmei Zhang
LMTD
24
71
0
22 May 2023
A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from
  Linguistically Complex Text
A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text
Yunxin Li
Baotian Hu
Yuxin Ding
Lin Ma
Hao Fei
23
5
0
03 May 2023
Multimodal Graph Transformer for Multimodal Question Answering
Multimodal Graph Transformer for Multimodal Question Answering
Xuehai He
Xin Eric Wang
36
7
0
30 Apr 2023
A Symmetric Dual Encoding Dense Retrieval Framework for
  Knowledge-Intensive Visual Question Answering
A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering
Alireza Salemi
Juan Altmayer Pizzorno
Hamed Zamani
15
14
0
26 Apr 2023
Answering Questions by Meta-Reasoning over Multiple Chains of Thought
Answering Questions by Meta-Reasoning over Multiple Chains of Thought
Ori Yoran
Tomer Wolfson
Ben Bogin
Uri Katz
Daniel Deutch
Jonathan Berant
ReLM
LRM
KELM
26
94
0
25 Apr 2023
MPMQA: Multimodal Question Answering on Product Manuals
MPMQA: Multimodal Question Answering on Product Manuals
Liangfu Zhang
Anwen Hu
Jing Zhang
Shuo Hu
Qin Jin
10
9
0
19 Apr 2023
VTQA: Visual Text Question Answering via Entity Alignment and
  Cross-Media Reasoning
VTQA: Visual Text Question Answering via Entity Alignment and Cross-Media Reasoning
Kan Chen
Xiangqian Wu
CoGe
19
8
0
05 Mar 2023
Complex QA and language models hybrid architectures, Survey
Complex QA and language models hybrid architectures, Survey
Xavier Daull
P. Bellot
Emmanuel Bruno
Vincent Martin
Elisabeth Murisasco
ELM
28
15
0
17 Feb 2023
MuG: A Multimodal Classification Benchmark on Game Data with Tabular,
  Textual, and Visual Fields
MuG: A Multimodal Classification Benchmark on Game Data with Tabular, Textual, and Visual Fields
Jiaying Lu
Yongchen Qian
Shifan Zhao
Yuanzhe Xi
Carl Yang
VLM
27
3
0
06 Feb 2023
SlideVQA: A Dataset for Document Visual Question Answering on Multiple
  Images
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images
Ryota Tanaka
Kyosuke Nishida
Kosuke Nishida
Taku Hasegawa
Itsumi Saito
Kuniko Saito
25
72
0
12 Jan 2023
A Survey on Table-and-Text HybridQA: Concepts, Methods, Challenges and
  Future Directions
A Survey on Table-and-Text HybridQA: Concepts, Methods, Challenges and Future Directions
Dingzirui Wang
Longxu Dou
Wanxiang Che
36
5
0
27 Dec 2022
Enhancing Multi-modal and Multi-hop Question Answering via Structured
  Knowledge and Unified Retrieval-Generation
Enhancing Multi-modal and Multi-hop Question Answering via Structured Knowledge and Unified Retrieval-Generation
Qian Yang
Qian Chen
Wen Wang
Baotian Hu
Min Zhang
37
24
0
16 Dec 2022
Training Vision-Language Models with Less Bimodal Supervision
Training Vision-Language Models with Less Bimodal Supervision
Elad Segal
Ben Bogin
Jonathan Berant
VLM
21
2
0
01 Nov 2022
PACIFIC: Towards Proactive Conversational Question Answering over
  Tabular and Textual Data in Finance
PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance
Yang Deng
Wenqiang Lei
Wenxuan Zhang
W. Lam
Tat-Seng Chua
44
52
0
17 Oct 2022
Large Language Models are few(1)-shot Table Reasoners
Large Language Models are few(1)-shot Table Reasoners
Wenhu Chen
LMTD
ReLM
LRM
22
138
0
13 Oct 2022
OpenCQA: Open-ended Question Answering with Charts
OpenCQA: Open-ended Question Answering with Charts
Shankar Kantharaj
Do Xuan Long
Rixie Tiffany Ko Leong
J. Tan
Enamul Hoque
Chenyu You
29
47
0
12 Oct 2022
Text-Derived Knowledge Helps Vision: A Simple Cross-modal Distillation
  for Video-based Action Anticipation
Text-Derived Knowledge Helps Vision: A Simple Cross-modal Distillation for Video-based Action Anticipation
Sayontan Ghosh
Tanvi Aggarwal
Minh Hoai
Niranjan Balasubramanian
VLM
12
4
0
12 Oct 2022
MuRAG: Multimodal Retrieval-Augmented Generator for Open Question
  Answering over Images and Text
MuRAG: Multimodal Retrieval-Augmented Generator for Open Question Answering over Images and Text
Wenhu Chen
Hexiang Hu
Xi Chen
Pat Verga
William W. Cohen
RALM
16
143
0
06 Oct 2022
Binding Language Models in Symbolic Languages
Binding Language Models in Symbolic Languages
Zhoujun Cheng
Tianbao Xie
Peng Shi
Chengzu Li
Rahul Nadkarni
...
Dragomir R. Radev
Mari Ostendorf
Luke Zettlemoyer
Noah A. Smith
Tao Yu
LMTD
122
198
0
06 Oct 2022
Dynamic Prompt Learning via Policy Gradient for Semi-structured
  Mathematical Reasoning
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning
Pan Lu
Liang Qiu
Kai-Wei Chang
Ying Nian Wu
Song-Chun Zhu
Tanmay Rajpurohit
Peter Clark
Ashwin Kalyan
ReLM
LRM
52
267
0
29 Sep 2022
A Survey on Text-to-SQL Parsing: Concepts, Methods, and Future
  Directions
A Survey on Text-to-SQL Parsing: Concepts, Methods, and Future Directions
Bowen Qin
Binyuan Hui
Lihan Wang
Min Yang
Jinyang Li
...
Rongyu Cao
Jian Sun
Luo Si
Fei Huang
Yongbin Li
LMTD
35
52
0
29 Aug 2022
OPERA: Harmonizing Task-Oriented Dialogs and Information Seeking
  Experience
OPERA: Harmonizing Task-Oriented Dialogs and Information Seeking Experience
Miaoran Li
Baolin Peng
Jianfeng Gao
Zhu Zhang
69
9
0
24 Jun 2022
Multimodal Learning with Transformers: A Survey
Multimodal Learning with Transformers: A Survey
P. Xu
Xiatian Zhu
David A. Clifton
ViT
72
527
0
13 Jun 2022
MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and
  Textual Data
MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data
Yilun Zhao
Yunxiang Li
Chenying Li
Rui Zhang
AIMat
39
97
0
03 Jun 2022
QAMPARI: An Open-domain Question Answering Benchmark for Questions with
  Many Answers from Multiple Paragraphs
QAMPARI: An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs
S. Amouyal
Tomer Wolfson
Ohad Rubin
Ori Yoran
Jonathan Herzig
Jonathan Berant
RALM
VLM
18
21
0
25 May 2022
DrugEHRQA: A Question Answering Dataset on Structured and Unstructured
  Electronic Health Records For Medicine Related Queries
DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries
Jayetri Bardhan
Anthony Colas
Kirk Roberts
D. Wang
CML
14
11
0
03 May 2022
Conversational Question Answering on Heterogeneous Sources
Conversational Question Answering on Heterogeneous Sources
Philipp Christmann
Rishiraj Saha Roy
Gerhard Weikum
24
42
0
25 Apr 2022
Learning to Answer Questions in Dynamic Audio-Visual Scenarios
Learning to Answer Questions in Dynamic Audio-Visual Scenarios
Guangyao Li
Yake Wei
Yapeng Tian
Chenliang Xu
Ji-Rong Wen
Di Hu
29
136
0
26 Mar 2022
Table Structure Recognition with Conditional Attention
Table Structure Recognition with Conditional Attention
Bin Xiao
Murat Simsek
B. Kantarci
Ala Abu Alkheir
LMTD
17
9
0
08 Mar 2022
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding
  with Text-to-Text Language Models
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models
Tianbao Xie
Chen Henry Wu
Peng Shi
Ruiqi Zhong
Torsten Scholak
...
Lingpeng Kong
Rui Zhang
Noah A. Smith
Luke Zettlemoyer
Tao Yu
LMTD
28
297
0
16 Jan 2022
CommonsenseQA 2.0: Exposing the Limits of AI through Gamification
CommonsenseQA 2.0: Exposing the Limits of AI through Gamification
Alon Talmor
Ori Yoran
Ronan Le Bras
Chandrasekhar Bhagavatula
Yoav Goldberg
Yejin Choi
Jonathan Berant
ELM
21
141
0
14 Jan 2022
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media
  Knowledge Extraction and Grounding
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Revanth Reddy Gangi Reddy
Xilin Rui
Manling Li
Xudong Lin
Haoyang Wen
...
Joey Tianyi Zhou
Avirup Sil
Shih-Fu Chang
A. Schwing
Heng Ji
25
31
0
20 Dec 2021
Multimodal End-to-End Group Emotion Recognition using Cross-Modal
  Attention
Multimodal End-to-End Group Emotion Recognition using Cross-Modal Attention
Lev Evtodienko
11
5
0
10 Nov 2021
Logic-level Evidence Retrieval and Graph-based Verification Network for
  Table-based Fact Verification
Logic-level Evidence Retrieval and Graph-based Verification Network for Table-based Fact Verification
Qi Shi
Yu Zhang
Qingyu Yin
Ting Liu
43
19
0
14 Sep 2021
SituatedQA: Incorporating Extra-Linguistic Contexts into QA
SituatedQA: Incorporating Extra-Linguistic Contexts into QA
Michael J.Q. Zhang
Eunsol Choi
RALM
32
136
0
13 Sep 2021
MATE: Multi-view Attention for Table Transformer Efficiency
MATE: Multi-view Attention for Table Transformer Efficiency
Julian Martin Eisenschlos
Maharshi Gor
Thomas Müller
William W. Cohen
LMTD
75
95
0
09 Sep 2021
WebQA: Multihop and Multimodal QA
WebQA: Multihop and Multimodal QA
Yingshan Chang
M. Narang
Hisami Suzuki
Guihong Cao
Jianfeng Gao
Yonatan Bisk
LRM
10
78
0
01 Sep 2021
Multi-modal Retrieval of Tables and Texts Using Tri-encoder Models
Multi-modal Retrieval of Tables and Texts Using Tri-encoder Models
Bogdan Kostić
Julian Risch
Timo Moller
RALM
110
23
0
09 Aug 2021
MuSiQue: Multihop Questions via Single-hop Question Composition
MuSiQue: Multihop Questions via Single-hop Question Composition
H. Trivedi
Niranjan Balasubramanian
Tushar Khot
Ashish Sabharwal
LRM
15
229
0
02 Aug 2021
QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering
  and Reading Comprehension
QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering and Reading Comprehension
Anna Rogers
Matt Gardner
Isabelle Augenstein
27
163
0
27 Jul 2021
MultiBench: Multiscale Benchmarks for Multimodal Representation Learning
MultiBench: Multiscale Benchmarks for Multimodal Representation Learning
Paul Pu Liang
Yiwei Lyu
Xiang Fan
Zetian Wu
Yun Cheng
...
Peter Wu
Michelle A. Lee
Yuke Zhu
Ruslan Salakhutdinov
Louis-Philippe Morency
VLM
32
159
0
15 Jul 2021
Turning Tables: Generating Examples from Semi-structured Tables for
  Endowing Language Models with Reasoning Skills
Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning Skills
Ori Yoran
Alon Talmor
Jonathan Berant
ReLM
LRM
183
53
0
15 Jul 2021
Question Decomposition with Dependency Graphs
Question Decomposition with Dependency Graphs
Matan Hasson
Jonathan Berant
GNN
30
9
0
17 Apr 2021
FeTaQA: Free-form Table Question Answering
FeTaQA: Free-form Table Question Answering
Linyong Nan
Chia-Hsuan Hsieh
Ziming Mao
Xi Lin
Neha Verma
...
Isabel Trindade
Renusree Bandaru
Jacob Cunningham
Caiming Xiong
Dragomir R. Radev
LMTD
20
144
0
01 Apr 2021
Challenges in Information-Seeking QA: Unanswerable Questions and
  Paragraph Retrieval
Challenges in Information-Seeking QA: Unanswerable Questions and Paragraph Retrieval
Akari Asai
Eunsol Choi
RALM
42
51
0
22 Oct 2020
Are We Modeling the Task or the Annotator? An Investigation of Annotator
  Bias in Natural Language Understanding Datasets
Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understanding Datasets
Mor Geva
Yoav Goldberg
Jonathan Berant
242
320
0
21 Aug 2019
Previous
12