Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.17625
Cited By
Enhancing Large Vision-Language Models with Layout Modality for Table Question Answering on Japanese Annual Securities Reports
23 May 2025
Hayato Aida
Kosuke Takahashi
Takahiro Omi
LMTD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Enhancing Large Vision-Language Models with Layout Modality for Table Question Answering on Japanese Annual Securities Reports"
1 / 1 papers shown
Title
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding
Jinghui Lu
Haiyang Yu
Yanjie Wang
Yongjie Ye
Jingqun Tang
...
Qi Liu
Hao Feng
Han Wang
Hao Liu
Can Huang
178
23
0
02 Jul 2024
1