ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.06486
  4. Cited By
Chart-to-Text: A Large-Scale Benchmark for Chart Summarization

Chart-to-Text: A Large-Scale Benchmark for Chart Summarization

12 March 2022
Shankar Kanthara
Rixie Tiffany Ko Leong
Xiang Lin
Ahmed Masry
Megh Thakkar
Enamul Hoque
Shafiq R. Joty
ArXivPDFHTML

Papers citing "Chart-to-Text: A Large-Scale Benchmark for Chart Summarization"

50 / 96 papers shown
Title
FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding
FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding
Zheng Liu
Mengjie Liu
J. Chen
Jingwei Xu
Bin Cui
Conghui He
Wentao Zhang
MLLM
57
0
0
14 Apr 2025
ChartQAPro: A More Diverse and Challenging Benchmark for Chart Question Answering
ChartQAPro: A More Diverse and Challenging Benchmark for Chart Question Answering
Ahmed Masry
Mohammed Saidul Islam
Mahir Ahmed
Aayush Bajaj
Firoz Kabir
...
Mehrad Shahmohammadi
Megh Thakkar
Md. Rizwan Parvez
E. Hoque
Shafiq R. Joty
ELM
30
0
0
07 Apr 2025
Enhancing Chart-to-Code Generation in Multimodal Large Language Models via Iterative Dual Preference Learning
Enhancing Chart-to-Code Generation in Multimodal Large Language Models via Iterative Dual Preference Learning
Zhihan Zhang
Yixin Cao
Lizi Liao
28
0
0
03 Apr 2025
RefChartQA: Grounding Visual Answer on Chart Images through Instruction Tuning
RefChartQA: Grounding Visual Answer on Chart Images through Instruction Tuning
Alexander Vogel
Omar Moured
Yufan Chen
Jiaming Zhang
Rainer Stiefelhagen
35
0
0
29 Mar 2025
Skip-Vision: Efficient and Scalable Acceleration of Vision-Language Models via Adaptive Token Skipping
Skip-Vision: Efficient and Scalable Acceleration of Vision-Language Models via Adaptive Token Skipping
Weili Zeng
Ziyuan Huang
Kaixiang Ji
Yichao Yan
VLM
42
1
0
26 Mar 2025
Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering
Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering
Zixin Chen
Sicheng Song
Kashun Shum
Yanna Lin
Rui Sheng
Huamin Qu
62
2
0
23 Mar 2025
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
A. Nassar
Andres Marafioti
Matteo Omenetti
Maksym Lysak
Nikolaos Livathinos
...
Yusik Kim
A. Said Gurbuz
Michele Dolfi
Miquel Farré
Peter W. J. Staar
55
3
0
14 Mar 2025
PP-DocBee: Improving Multimodal Document Understanding Through a Bag of Tricks
Feng Ni
Kui Huang
Yao Lu
Wenyu Lv
Guanzhong Wang
Zeyu Chen
Y. Liu
VLM
48
0
0
06 Mar 2025
Chart-HQA: A Benchmark for Hypothetical Question Answering in Charts
Xiangnan Chen
Yuancheng Fang
Qian Xiao
Juncheng Billy Li
J. Lin
Siliang Tang
Yi Yang
Yueting Zhuang
70
0
0
06 Mar 2025
M2-omni: Advancing Omni-MLLM for Comprehensive Modality Support with Competitive Performance
M2-omni: Advancing Omni-MLLM for Comprehensive Modality Support with Competitive Performance
Qingpei Guo
Kaiyou Song
Zipeng Feng
Ziping Ma
Qinglong Zhang
...
Yunxiao Sun
Tai-WeiChang
Jingdong Chen
Ming Yang
Jun Zhou
MLLM
VLM
84
3
0
26 Feb 2025
End-to-End Chart Summarization via Visual Chain-of-Thought in Vision-Language Models
End-to-End Chart Summarization via Visual Chain-of-Thought in Vision-Language Models
Raymond Choi
Frank Burns
Chase Lawrence
LRM
64
1
0
24 Feb 2025
Baichuan-Omni-1.5 Technical Report
Yadong Li
J. Liu
Tao Zhang
Tao Zhang
S. Chen
...
Jianhua Xu
Haoze Sun
Mingan Lin
Zenan Zhou
Weipeng Chen
AuLLM
72
10
0
28 Jan 2025
PatentLMM: Large Multimodal Model for Generating Descriptions for Patent Figures
S. Kamath S
Nakul Sharma
Manish Gupta
Anand Mishra
48
1
0
28 Jan 2025
ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation
ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation
Xuanle Zhao
Xianzhen Luo
Qi Shi
C. L. P. Chen
Shuo Wang
Wanxiang Che
Zhiyuan Liu
Maosong Sun
MLLM
54
2
0
11 Jan 2025
ChartAdapter: Large Vision-Language Model for Chart Summarization
ChartAdapter: Large Vision-Language Model for Chart Summarization
Peixin Xu
Yujuan Ding
Wenqi Fan
25
2
0
31 Dec 2024
HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding
HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding
Chenxin Tao
Shiqian Su
X. Zhu
Chenyu Zhang
Zhe Chen
...
Wenhai Wang
Lewei Lu
Gao Huang
Yu Qiao
Jifeng Dai
MLLM
VLM
104
2
0
20 Dec 2024
Chimera: Improving Generalist Model with Domain-Specific Experts
Chimera: Improving Generalist Model with Domain-Specific Experts
Tianshuo Peng
M. Li
Hongbin Zhou
Renqiu Xia
Renrui Zhang
...
Aojun Zhou
Botian Shi
Tao Chen
Bo Zhang
Xiangyu Yue
88
4
0
08 Dec 2024
ChartKG: A Knowledge-Graph-Based Representation for Chart Images
ChartKG: A Knowledge-Graph-Based Representation for Chart Images
Zhiguang Zhou
Haoxuan Wang
Zhengqing Zhao
Fengling Zheng
Yongheng Wang
Wei Chen
Yong Wang
29
0
0
13 Oct 2024
Text2Chart31: Instruction Tuning for Chart Generation with Automatic Feedback
Text2Chart31: Instruction Tuning for Chart Generation with Automatic Feedback
Fatemeh Pesaran Zadeh
Juyeon Kim
Jin-Hwa Kim
Gunhee Kim
ALM
46
1
0
05 Oct 2024
Natural Language Generation for Visualizations: State of the Art,
  Challenges and Future Directions
Natural Language Generation for Visualizations: State of the Art, Challenges and Future Directions
Enamul Hoque
Mohammed Saidul Islam
31
2
0
29 Sep 2024
SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from
  Documents guided by Multi-Aspect Feedback Refinement
SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement
Ishani Mondal
Zongxia Li
Yufang Hou
Anandhavelu Natarajan
Aparna Garimella
Jordan Boyd-Graber
31
3
0
28 Sep 2024
From Graphs to Words: A Computer-Assisted Framework for the Production
  of Accessible Text Descriptions
From Graphs to Words: A Computer-Assisted Framework for the Production of Accessible Text Descriptions
Qiang Xu
Thomas Hurtut
34
0
0
26 Sep 2024
SynChart: Synthesizing Charts from Language Models
SynChart: Synthesizing Charts from Language Models
Mengchen Liu
Qixiu Li
Dongdong Chen
Dong Chen
Jianmin Bao
Yunsheng Li
MLLM
23
0
0
25 Sep 2024
DataVisT5: A Pre-trained Language Model for Jointly Understanding Text
  and Data Visualization
DataVisT5: A Pre-trained Language Model for Jointly Understanding Text and Data Visualization
Zhuoyue Wan
Yuanfeng Song
Shuaimin Li
Chen Jason Zhang
Raymond Chi-Wing Wong
VLM
37
1
0
14 Aug 2024
DataNarrative: Automated Data-Driven Storytelling with Visualizations
  and Texts
DataNarrative: Automated Data-Driven Storytelling with Visualizations and Texts
Mohammed Saidul Islam
Md Tahmid Rahman Laskar
Md. Rizwan Parvez
Enamul Hoque
Shafiq R. Joty
DiffM
37
6
0
09 Aug 2024
VITA: Towards Open-Source Interactive Omni Multimodal LLM
VITA: Towards Open-Source Interactive Omni Multimodal LLM
Chaoyou Fu
Haojia Lin
Zuwei Long
Yunhang Shen
Meng Zhao
...
Ran He
Rongrong Ji
Yunsheng Wu
Caifeng Shan
Xing Sun
MLLM
39
80
0
09 Aug 2024
Advancing Multimodal Large Language Models in Chart Question Answering
  with Visualization-Referenced Instruction Tuning
Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction Tuning
Xingchen Zeng
Haichuan Lin
Yilin Ye
Wei Zeng
52
15
0
29 Jul 2024
On Pre-training of Multimodal Language Models Customized for Chart
  Understanding
On Pre-training of Multimodal Language Models Customized for Chart Understanding
Wan-Cyuan Fan
Yen-Chun Chen
Mengchen Liu
Lu Yuan
Leonid Sigal
40
5
0
19 Jul 2024
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild
Ahmed Masry
Megh Thakkar
Aayush Bajaj
Aaryaman Kartha
Enamul Hoque
Shafiq R. Joty
VLM
36
26
0
04 Jul 2024
MindBench: A Comprehensive Benchmark for Mind Map Structure Recognition
  and Analysis
MindBench: A Comprehensive Benchmark for Mind Map Structure Recognition and Analysis
Lei Chen
Feng Yan
Yujie Zhong
Shaoxiang Chen
Zequn Jie
Lin Ma
36
3
0
03 Jul 2024
VisEval: A Benchmark for Data Visualization in the Era of Large Language
  Models
VisEval: A Benchmark for Data Visualization in the Era of Large Language Models
Nan Chen
Yuge Zhang
Jiahang Xu
Kan Ren
Yuqing Yang
37
9
0
01 Jul 2024
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs
Shengbang Tong
Ellis L Brown
Penghao Wu
Sanghyun Woo
Manoj Middepogu
...
Xichen Pan
Austin Wang
Rob Fergus
Yann LeCun
Saining Xie
3DV
MLLM
48
278
0
24 Jun 2024
ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation
ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation
Cheng Yang
Chufan Shi
Yaxin Liu
Bo Shui
Junjie Wang
...
Yuxiang Zhang
Gongye Liu
Xiaomei Nie
Deng Cai
Yujiu Yang
MLLM
LRM
48
22
0
14 Jun 2024
MLLMGuard: A Multi-dimensional Safety Evaluation Suite for Multimodal
  Large Language Models
MLLMGuard: A Multi-dimensional Safety Evaluation Suite for Multimodal Large Language Models
Tianle Gu
Zeyang Zhou
Kexin Huang
Dandan Liang
Yixu Wang
...
Keqing Wang
Yujiu Yang
Yan Teng
Yu Qiao
Yingchun Wang
ELM
44
12
0
11 Jun 2024
Are Large Vision Language Models up to the Challenge of Chart
  Comprehension and Reasoning? An Extensive Investigation into the Capabilities
  and Limitations of LVLMs
Are Large Vision Language Models up to the Challenge of Chart Comprehension and Reasoning? An Extensive Investigation into the Capabilities and Limitations of LVLMs
Mohammed Saidul Islam
Raian Rahman
Ahmed Masry
Md Tahmid Rahman Laskar
Mir Tafseer Nayeem
Enamul Hoque
LRM
ELM
36
4
0
01 Jun 2024
StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image
  Perception, Comprehension, and Beyond
StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond
Pengyuan Lyu
Yulin Li
Hao Zhou
Weihong Ma
Xingyu Wan
...
Liang Wu
Chengquan Zhang
Kun Yao
Errui Ding
Jingdong Wang
36
7
0
31 May 2024
Alt4Blind: A User Interface to Simplify Charts Alt-Text Creation
Alt4Blind: A User Interface to Simplify Charts Alt-Text Creation
Omar Moured
Shahid Ali Farooqui
Karin Muller
Sharifeh Fadaeijouybari
Thorsten Schwarz
Mohammed Javed
Rainer Stiefelhagen
27
1
0
29 May 2024
Faithful Chart Summarization with ChaTS-Pi
Faithful Chart Summarization with ChaTS-Pi
Syrine Krichene
Francesco Piccinno
Fangyu Liu
Julian Martin Eisenschlos
32
1
0
29 May 2024
AltChart: Enhancing VLM-based Chart Summarization Through Multi-Pretext
  Tasks
AltChart: Enhancing VLM-based Chart Summarization Through Multi-Pretext Tasks
Omar Moured
Jiaming Zhang
M. Sarfraz
Rainer Stiefelhagen
32
1
0
22 May 2024
Exploring the Capability of LLMs in Performing Low-Level Visual Analytic
  Tasks on SVG Data Visualizations
Exploring the Capability of LLMs in Performing Low-Level Visual Analytic Tasks on SVG Data Visualizations
Zhongzhen Xu
Emily Wall
38
11
0
29 Apr 2024
Generative AI for Visualization: State of the Art and Future Directions
Generative AI for Visualization: State of the Art and Future Directions
Yilin Ye
Jianing Hao
Yihan Hou
Zhan Wang
Shishi Xiao
Yuyu Luo
Wei Zeng
31
39
0
28 Apr 2024
TinyChart: Efficient Chart Understanding with Visual Token Merging and
  Program-of-Thoughts Learning
TinyChart: Efficient Chart Understanding with Visual Token Merging and Program-of-Thoughts Learning
Liang Zhang
Anwen Hu
Haiyang Xu
Mingshi Yan
Yichen Xu
Qin Jin
Ji Zhang
Fei Huang
39
15
0
25 Apr 2024
MoVA: Adapting Mixture of Vision Experts to Multimodal Context
MoVA: Adapting Mixture of Vision Experts to Multimodal Context
Zhuofan Zong
Bingqi Ma
Dazhong Shen
Guanglu Song
Hao Shao
Dongzhi Jiang
Hongsheng Li
Yu Liu
MoE
40
40
0
19 Apr 2024
OneChart: Purify the Chart Structural Extraction via One Auxiliary Token
OneChart: Purify the Chart Structural Extraction via One Auxiliary Token
Jinyue Chen
Lingyu Kong
Haoran Wei
Chenglong Liu
Zheng Ge
Liang Zhao
Jian‐Yuan Sun
Chunrui Han
Xiangyu Zhang
41
22
0
15 Apr 2024
Prompting for Numerical Sequences: A Case Study on Market Comment
  Generation
Prompting for Numerical Sequences: A Case Study on Market Comment Generation
Masayuki Kawarada
Tatsuya Ishigaki
Hiroya Takamura
27
3
0
03 Apr 2024
SciCapenter: Supporting Caption Composition for Scientific Figures with
  Machine-Generated Captions and Ratings
SciCapenter: Supporting Caption Composition for Scientific Figures with Machine-Generated Captions and Ratings
Ting-Yao Hsu
Chieh-Yang Huang
Shih-Hong Huang
Ryan A. Rossi
Sungchul Kim
Tong Yu
C. Lee Giles
‘Kenneth’ Huang
19
6
0
26 Mar 2024
Synthesize Step-by-Step: Tools, Templates and LLMs as Data Generators
  for Reasoning-Based Chart VQA
Synthesize Step-by-Step: Tools, Templates and LLMs as Data Generators for Reasoning-Based Chart VQA
Zhuowan Li
Bhavan A. Jasani
Peng Tang
Shabnam Ghadar
LRM
32
8
0
25 Mar 2024
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document
  Understanding
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding
Anwen Hu
Haiyang Xu
Jiabo Ye
Mingshi Yan
Liang Zhang
...
Chen Li
Ji Zhang
Qin Jin
Fei Huang
Jingren Zhou
VLM
45
105
0
19 Mar 2024
From Pixels to Insights: A Survey on Automatic Chart Understanding in
  the Era of Large Foundation Models
From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models
Kung-Hsiang Huang
Hou Pong Chan
Yi Ren Fung
Haoyi Qiu
Mingyang Zhou
Shafiq R. Joty
Shih-Fu Chang
Heng Ji
AI4TS
64
18
0
18 Mar 2024
ChartThinker: A Contextual Chain-of-Thought Approach to Optimized Chart
  Summarization
ChartThinker: A Contextual Chain-of-Thought Approach to Optimized Chart Summarization
Mengsha Liu
Daoyuan Chen
Yaliang Li
Guian Fang
Ying Shen
30
18
0
17 Mar 2024
12
Next