Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.06915
Cited By
DocFinQA: A Long-Context Financial Reasoning Dataset
12 January 2024
Varshini Reddy
Rik Koncel-Kedziorski
Viet Dac Lai
Michael Krumdick
Charles Lovering
Chris Tanner
RALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DocFinQA: A Long-Context Financial Reasoning Dataset"
16 / 16 papers shown
Title
LLM-BABYBENCH: Understanding and Evaluating Grounded Planning and Reasoning in LLMs
Omar Choukrani
Idriss Malek
Daniil Orel
Zhuohan Xie
Zangir Iklassov
Martin Takáč
Salem Lahlou
LLMAG
ELM
LRM
17
0
0
17 May 2025
FinDER: Financial Dataset for Question Answering and Evaluating Retrieval-Augmented Generation
Chanyeol Choi
Jihoon Kwon
Jaeseon Ha
Hojun Choi
Chaewoon Kim
Yongjae Lee
Jy-yong Sohn
Alejandro Lopez-Lira
RALM
61
0
0
22 Apr 2025
The Use of Gaze-Derived Confidence of Inferred Operator Intent in Adjusting Safety-Conscious Haptic Assistance
Jeremy D. Webb
Michael Bowman
Songpo Li
Xiaoli Zhang
36
0
0
04 Apr 2025
Extract, Match, and Score: An Evaluation Paradigm for Long Question-context-answer Triplets in Financial Analysis
Bo Hu
Han Yuan
Vlad Pandelea
Wuqiong Luo
Yingzhu Zhao
Zheng Ma
60
0
0
20 Mar 2025
DocPuzzle: A Process-Aware Benchmark for Evaluating Realistic Long-Context Reasoning Capabilities
Tianyi Zhuang
Chuqiao Kuang
Xiaoguang Li
Yihua Teng
Jihao Wu
Yufei Wang
Lifeng Shang
RALM
ELM
LRM
72
0
0
25 Feb 2025
PlanGEN: A Multi-Agent Framework for Generating Planning and Reasoning Trajectories for Complex Problem Solving
Mihir Parmar
Xin Liu
Palash Goyal
Yanfei Chen
L. Le
...
Hootan Nakhost
Chitta Baral
Chen-Yu Lee
Tomas Pfister
Hamid Palangi
46
1
0
22 Feb 2025
Verbosity-Aware Rationale Reduction: Effective Reduction of Redundant Rationale via Principled Criteria
Joonwon Jang
Jaehee Kim
Wonbin Kweon
Hwanjo Yu
LRM
47
1
0
03 Jan 2025
M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework
Yew Ken Chia
Liying Cheng
Hou Pong Chan
Chaoqun Liu
Maojia Song
Sharifah Mahani Aljunied
Soujanya Poria
Lidong Bing
RALM
VLM
52
5
0
09 Nov 2024
STRUX: An LLM for Decision-Making with Structured Explanations
Yiming Lu
Yebowen Hu
H. Foroosh
Wei Jin
Fei Liu
33
0
0
16 Oct 2024
MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs
Lei Wang
Shan Dong
Yuhui Xu
Hanze Dong
Yalu Wang
Amrita Saha
Ee-Peng Lim
Caiming Xiong
Doyen Sahoo
LRM
48
2
0
07 Oct 2024
Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP
Omer Goldman
Alon Jacovi
Aviv Slobodkin
Aviya Maimon
Ido Dagan
Reut Tsarfaty
69
11
0
29 Jun 2024
BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack
Yuri Kuratov
Aydar Bulatov
Petr Anokhin
Ivan Rodkin
Dmitry Sorokin
Artyom Sorokin
Andrey Kravchenko
RALM
ALM
LRM
ReLM
ELM
51
61
0
14 Jun 2024
Venn Diagram Prompting : Accelerating Comprehension with Scaffolding Effect
Sakshi Mahendru
Tejul Pandit
38
0
0
08 Jun 2024
TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
Hanshi Sun
Zhuoming Chen
Xinyu Yang
Yuandong Tian
Beidi Chen
46
49
0
18 Apr 2024
Extractive is not Faithful: An Investigation of Broad Unfaithfulness Problems in Extractive Summarization
Shiyue Zhang
David Wan
Joey Tianyi Zhou
HILM
52
27
0
08 Sep 2022
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation
Ofir Press
Noah A. Smith
M. Lewis
253
701
0
27 Aug 2021
1