Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.12567
Cited By
v1
v2
v3
v4 (latest)
FCMR: Robust Evaluation of Financial Cross-Modal Multi-Hop Reasoning
17 December 2024
Seunghee Kim
Changhyeon Kim
Taeuk Kim
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"FCMR: Robust Evaluation of Financial Cross-Modal Multi-Hop Reasoning"
25 / 25 papers shown
Title
Optimizing Retrieval Strategies for Financial Question Answering Documents in Retrieval-Augmented Generation Systems
Sejong Kim
Hyunseo Song
Hyunwoo Seo
Hyunjun Kim
RALM
121
3
0
19 Mar 2025
An Entailment Tree Generation Approach for Multimodal Multi-Hop Question Answering with Mixture-of-Experts and Iterative Feedback Mechanism
Qing Zhang
Haocheng Lv
Jie Liu
Zheyu Chen
Jianyong Duan
Hao Wang
Li He
Mingying Xv
136
2
0
08 Dec 2024
MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
Ziliang Gan
Yu Lu
D. Zhang
Haohan Li
Che Liu
...
Haipang Wu
Chaoyou Fu
Z. Xu
Rongjunchen Zhang
Yong Dai
110
13
0
05 Nov 2024
CT2C-QA: Multimodal Question Answering over Chinese Text, Table and Chart
Bowen Zhao
Tianhao Cheng
Yuejie Zhang
Ying Cheng
Rui Feng
Xiaobo Zhang
LMTD
59
1
0
28 Oct 2024
MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Yuan Yao
Tianyu Yu
Ao Zhang
Chongyi Wang
Junbo Cui
...
Xu Han
Guoyang Zeng
Dahai Li
Zhiyuan Liu
Maosong Sun
VLM
MLLM
149
481
0
03 Aug 2024
Uncovering Limitations of Large Language Models in Information Seeking from Tables
Chaoxu Pang
Yixuan Cao
Chunhao Yang
Ping Luo
RALM
LMTD
75
6
0
06 Jun 2024
TANQ: An open domain dataset of table answered questions
Mubashara Akhtar
Chenxi Pang
Andreea Marzoca
Yasemin Altun
Julian Martin Eisenschlos
LMTD
RALM
102
2
0
13 May 2024
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
Kaining Ying
Fanqing Meng
Jin Wang
Zhiqiang Li
Han Lin
...
Yali Wang
Yuning Qiao
Ping Luo
Kaipeng Zhang
Wenqi Shao
99
99
0
24 Apr 2024
ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning
Ahmed Masry
Mehrad Shahmohammadi
Md. Rizwan Parvez
Enamul Hoque
Shafiq Joty
112
37
0
14 Mar 2024
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Xiang Yue
Yuansheng Ni
Kai Zhang
Tianyu Zheng
Ruoqi Liu
...
Yibo Liu
Wenhao Huang
Huan Sun
Yu-Chuan Su
Wenhu Chen
OSLM
ELM
VLM
471
960
0
27 Nov 2023
Improved Baselines with Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Yuheng Li
Yong Jae Lee
VLM
MLLM
249
2,834
0
05 Oct 2023
SEED-Bench: Benchmarking Multimodal LLMs with Generative Comprehension
Bohao Li
Rui Wang
Guangzhi Wang
Yuying Ge
Yixiao Ge
Ying Shan
MLLM
ELM
136
572
0
30 Jul 2023
MMBench: Is Your Multi-modal Model an All-around Player?
Yuanzhan Liu
Haodong Duan
Yuanhan Zhang
Yue Liu
Songyang Zhang
...
Jiaqi Wang
Conghui He
Ziwei Liu
Kai-xiang Chen
Dahua Lin
204
1,060
0
12 Jul 2023
Unified Language Representation for Question Answering over Text, Tables, and Images
Yu Bowen
Cheng Fu
Haiyang Yu
Fei Huang
Yongbin Li
LMTD
85
23
0
29 Jun 2023
Self-Refine: Iterative Refinement with Self-Feedback
Aman Madaan
Niket Tandon
Prakhar Gupta
Skyler Hallinan
Luyu Gao
...
Bodhisattwa Prasad Majumder
Katherine Hermann
Sean Welleck
Amir Yazdanbakhsh
Peter Clark
ReLM
LRM
DiffM
256
1,690
0
30 Mar 2023
LLaMA: Open and Efficient Foundation Language Models
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
...
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
1.7K
13,558
0
27 Feb 2023
DePlot: One-shot visual language reasoning by plot-to-table translation
Fangyu Liu
Julian Martin Eisenschlos
Francesco Piccinno
Syrine Krichene
Chenxi Pang
Kenton Lee
Mandar Joshi
Wenhu Chen
Nigel Collier
Yasemin Altun
VLM
ReLM
LRM
111
102
0
20 Dec 2022
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Revanth Reddy Gangi Reddy
Xilin Rui
Manling Li
Xudong Lin
Haoyang Wen
...
Joey Tianyi Zhou
Avirup Sil
Shih-Fu Chang
Alex Schwing
Heng Ji
82
32
0
20 Dec 2021
FinQA: A Dataset of Numerical Reasoning over Financial Data
Zhiyu Chen
Wenhu Chen
Charese Smiley
Sameena Shah
Iana Borova
...
Reema N Moussa
Matthew I. Beane
Ting-Hao 'Kenneth' Huang
Bryan R. Routledge
Wenjie Wang
AIMat
166
356
0
01 Sep 2021
TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance
Fengbin Zhu
Wenqiang Lei
Youcheng Huang
Chao Wang
Shuo Zhang
Jiancheng Lv
Fuli Feng
Tat-Seng Chua
AIMat
137
305
0
17 May 2021
MultiModalQA: Complex Question Answering over Text, Tables and Images
Alon Talmor
Ori Yoran
Amnon Catav
Dan Lahav
Yizhong Wang
Akari Asai
Gabriel Ilharco
Hannaneh Hajishirzi
Jonathan Berant
LMTD
102
163
0
13 Apr 2021
Open Question Answering over Tables and Text
Wenhu Chen
Ming-Wei Chang
Eva Schlinger
Wenjie Wang
William W. Cohen
LMTD
RALM
112
202
0
20 Oct 2020
HybridQA: A Dataset of Multi-Hop Question Answering over Tabular and Textual Data
Wenhu Chen
Hanwen Zha
Zhiyu Zoey Chen
Wenhan Xiong
Hong Wang
Wenjie Wang
85
309
0
15 Apr 2020
ManyModalQA: Modality Disambiguation and QA over Diverse Inputs
Darryl Hannan
Akshay Jain
Joey Tianyi Zhou
AAML
93
60
0
22 Jan 2020
PAWS: Paraphrase Adversaries from Word Scrambling
Yuan Zhang
Jason Baldridge
Luheng He
90
545
0
01 Apr 2019
1