ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.00157
  4. Cited By
Large Language Models for Mathematical Reasoning: Progresses and
  Challenges

Large Language Models for Mathematical Reasoning: Progresses and Challenges

31 January 2024
Janice Ahn
Rishu Verma
Renze Lou
Di Liu
Rui Zhang
Wenpeng Yin
    LRM
ArXivPDFHTML

Papers citing "Large Language Models for Mathematical Reasoning: Progresses and Challenges"

50 / 91 papers shown
Title
LLM-based Automated Theorem Proving Hinges on Scalable Synthetic Data Generation
LLM-based Automated Theorem Proving Hinges on Scalable Synthetic Data Generation
Junyu Lai
Jiakun Zhang
Shuo Xu
Taolue Chen
Zihang Wang
Yao Yang
Jiarui Zhang
Chun Cao
Jingwei Xu
22
0
0
17 May 2025
CellVerse: Do Large Language Models Really Understand Cell Biology?
CellVerse: Do Large Language Models Really Understand Cell Biology?
Fan Zhang
Tianyu Liu
Zhihong Zhu
Yu Wang
Haoyu Wang
Donghao Zhou
Yefeng Zheng
Kun Wang
X. Wu
Pheng-Ann Heng
ELM
41
0
0
09 May 2025
Understanding LLM Scientific Reasoning through Promptings and Model's Explanation on the Answers
Understanding LLM Scientific Reasoning through Promptings and Model's Explanation on the Answers
Alice Rueda
Mohammed S. Hassan
Argyrios Perivolaris
Bazen G. Teferra
Reza Samavi
...
Y. Wu
Wenjie Qu
Bo Cao
Divya Sharma
Sridhar Krishnan Venkat Bhat
ELM
LRM
58
0
0
02 May 2025
Lightweight Latent Verifiers for Efficient Meta-Generation Strategies
Lightweight Latent Verifiers for Efficient Meta-Generation Strategies
Bartosz Piotrowski
Witold Drzewakowski
Konrad Staniszewski
Piotr Miłoś
LRM
36
0
0
23 Apr 2025
Evaluating the Goal-Directedness of Large Language Models
Evaluating the Goal-Directedness of Large Language Models
Tom Everitt
Cristina Garbacea
Alexis Bellot
Jonathan G. Richens
Henry Papadatos
Simeon Campos
Rohin Shah
ELM
LM&MA
LM&Ro
LRM
82
0
0
16 Apr 2025
Mathematical Capabilities of Large Language Models in Finnish Matriculation Examination
Mathematical Capabilities of Large Language Models in Finnish Matriculation Examination
Mika Setälä
Pieta Sikström
Ville Heilala
T. Karkkainen
ELM
LRM
36
1
0
15 Apr 2025
How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
Ming Li
Yongqian Li
Ziyue Li
Tianyi Zhou
LRM
34
1
0
14 Apr 2025
A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis
A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis
Xin Gao
Qizhi Pei
Zinan Tang
Yongqian Li
Honglin Lin
Jiang Wu
Conghui He
Lijun Wu
SyDa
45
0
0
11 Apr 2025
Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?
Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?
Chenrui Fan
Ming Li
Lichao Sun
Tianyi Zhou
LRM
51
4
0
09 Apr 2025
SEAL: Steerable Reasoning Calibration of Large Language Models for Free
SEAL: Steerable Reasoning Calibration of Large Language Models for Free
Runjin Chen
Zhenyu Zhang
Junyuan Hong
Souvik Kundu
Zhangyang Wang
OffRL
LRM
55
6
0
07 Apr 2025
Brains vs. Bytes: Evaluating LLM Proficiency in Olympiad Mathematics
Brains vs. Bytes: Evaluating LLM Proficiency in Olympiad Mathematics
Hamed Mahdavi
Alireza Hashemi
Majid Daliri
Pegah Mohammadipour
Alireza Farhadi
Samira Malek
Yekta Yazdanifard
Amir Khasahmadi
V. Honavar
ELM
LRM
66
1
0
01 Apr 2025
CrossWordBench: Evaluating the Reasoning Capabilities of LLMs and LVLMs with Controllable Puzzle Generation
CrossWordBench: Evaluating the Reasoning Capabilities of LLMs and LVLMs with Controllable Puzzle Generation
Jixuan Leng
Chengsong Huang
Langlin Huang
Bill Yuchen Lin
William W. Cohen
Haohan Wang
Jiaxin Huang
LRM
56
0
0
30 Mar 2025
What Makes an Evaluation Useful? Common Pitfalls and Best Practices
What Makes an Evaluation Useful? Common Pitfalls and Best Practices
Gil Gekker
Meirav Segal
Dan Lahav
Omer Nevo
ELM
52
0
0
30 Mar 2025
TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning
TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning
Sheng Wang
Pengan Chen
Jingqi Zhou
Qintong Li
Jingwei Dong
Jiahui Gao
Boyang Xue
Jiyue Jiang
Lingpeng Kong
Chuan Wu
SyDa
71
0
0
21 Mar 2025
MathFlow: Enhancing the Perceptual Flow of MLLMs for Visual Mathematical Problems
MathFlow: Enhancing the Perceptual Flow of MLLMs for Visual Mathematical Problems
Felix Chen
Hangjie Yuan
Yunqiu Xu
Tao Feng
Jun Cen
Pengwei Liu
Zeying Huang
Yi Yang
LRM
50
1
0
19 Mar 2025
Code-Driven Inductive Synthesis: Enhancing Reasoning Abilities of Large Language Models with Sequences
Code-Driven Inductive Synthesis: Enhancing Reasoning Abilities of Large Language Models with Sequences
Kedi Chen
Zhikai Lei
Fan Zhang
Yinqi Zhang
Qin Chen
Jie Zhou
Liang He
Qipeng Guo
K. Chen
Wei-na Zhang
ELM
LRM
70
0
0
17 Mar 2025
Bridging Language Models and Financial Analysis
Bridging Language Models and Financial Analysis
Alejandro Lopez-Lira
Jihoon Kwon
Sangwoon Yoon
Jy-yong Sohn
Chanyeol Choi
AIFin
46
0
0
14 Mar 2025
Evaluating Mathematical Reasoning Across Large Language Models: A Fine-Grained Approach
Evaluating Mathematical Reasoning Across Large Language Models: A Fine-Grained Approach
Afrar Jahin
Arif Hassan Zidan
Wei Zhang
Yu Bao
Tianming Liu
LRM
81
1
0
13 Mar 2025
R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization
R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization
Yi Yang
Xiaoxuan He
Hongkun Pan
Xiyan Jiang
Yan Deng
...
Dacheng Yin
Fengyun Rao
Minfeng Zhu
Bo Zhang
Wei Chen
VLM
LRM
58
29
1
13 Mar 2025
StepMathAgent: A Step-Wise Agent for Evaluating Mathematical Processes through Tree-of-Error
Steve Yang
C. Wang
Yidong Wang
Xiaotao Gu
Minlie Huang
J. Tang
LRM
LLMAG
68
0
0
13 Mar 2025
Performance Comparison of Large Language Models on Advanced Calculus Problems
In Hak Moon
LRM
ELM
64
0
0
05 Mar 2025
Graph-Augmented Reasoning: Evolving Step-by-Step Knowledge Graph Retrieval for LLM Reasoning
Wenjie Wu
Yongcheng Jing
Yingjie Wang
Wenbin Hu
Dacheng Tao
RALM
LRM
76
2
0
03 Mar 2025
Collective Reasoning Among LLMs A Framework for Answer Validation Without Ground Truth
Collective Reasoning Among LLMs A Framework for Answer Validation Without Ground Truth
Seyed Pouyan Mousavi Davoudi
Alireza Shafiee Fard
Alireza Amiri-Margavi
LRM
64
0
0
28 Feb 2025
Starjob: Dataset for LLM-Driven Job Shop Scheduling
Starjob: Dataset for LLM-Driven Job Shop Scheduling
Henrik Abgaryan
Tristan Cazenave
Ararat Harutyunyan
41
0
0
26 Feb 2025
Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support
Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support
G. Wang
Minyu Gao
Shuai Yang
Ya Zhang
Lizhi He
...
Yexuan Zhang
Wanyue Li
Lu Chen
Jintao Fei
Xin Li
200
1
0
25 Feb 2025
PersonaMath: Boosting Mathematical Reasoning via Persona-Driven Data Augmentation
PersonaMath: Boosting Mathematical Reasoning via Persona-Driven Data Augmentation
Jing Luo
Longze Chen
Run Luo
Liang Zhu
Chang Ao
...
A. Argha
Hamid Alinejad-Rokny
Chengming Li
Shiwen Ni
Min Yang
SyDa
AIMat
92
0
0
24 Feb 2025
The Philosophical Foundations of Growing AI Like A Child
The Philosophical Foundations of Growing AI Like A Child
Dezhi Luo
Yijiang Li
Hokin Deng
ReLM
LRM
53
2
0
15 Feb 2025
TableMaster: A Recipe to Advance Table Understanding with Language Models
TableMaster: A Recipe to Advance Table Understanding with Language Models
Lang Cao
Hanbing Liu
LMTD
RALM
316
1
1
31 Jan 2025
AnalogXpert: Automating Analog Topology Synthesis by Incorporating Circuit Design Expertise into Large Language Models
AnalogXpert: Automating Analog Topology Synthesis by Incorporating Circuit Design Expertise into Large Language Models
Haoyi Zhang
Shizhao Sun
Yibo Lin
Runsheng Wang
Jiang Bian
57
0
0
31 Dec 2024
TransitGPT: A Generative AI-based framework for interacting with GTFS
  data using Large Language Models
TransitGPT: A Generative AI-based framework for interacting with GTFS data using Large Language Models
Saipraneeth Devunuri
Lewis J. Lehe
LM&MA
73
1
0
07 Dec 2024
Neuro-Symbolic Data Generation for Math Reasoning
Neuro-Symbolic Data Generation for Math Reasoning
Zenan Li
Zhi-Hua Zhou
Yuan Yao
Yu-Feng Li
Chun Cao
Fan Yang
Xian Zhang
Xiaoxing Ma
OffRL
LRM
86
8
0
06 Dec 2024
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context
  Learning via MCTS
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS
Jinyang Wu
Mingkuan Feng
Shuai Zhang
Feihu Che
Zengqi Wen
J. Tao
ReLM
LRM
115
10
0
27 Nov 2024
Enhancing Answer Reliability Through Inter-Model Consensus of Large Language Models
Enhancing Answer Reliability Through Inter-Model Consensus of Large Language Models
Alireza Amiri-Margavi
Iman Jebellat
Ehsan Jebellat
Seyed Pouyan Mousavi Davoudi
99
2
0
25 Nov 2024
DSTC: Direct Preference Learning with Only Self-Generated Tests and Code
  to Improve Code LMs
DSTC: Direct Preference Learning with Only Self-Generated Tests and Code to Improve Code LMs
Zhihan Liu
Shenao Zhang
Yongfei Liu
Boyi Liu
Yingxiang Yang
Zhaoran Wang
113
3
0
20 Nov 2024
Open-World Task and Motion Planning via Vision-Language Model Inferred Constraints
Open-World Task and Motion Planning via Vision-Language Model Inferred Constraints
Nishanth Kumar
F. Ramos
Dieter Fox
Caelan Reed Garrett
Tomás Lozano-Pérez
Leslie Pack Kaelbling
Caelan Reed Garrett
LRM
LM&Ro
68
3
0
13 Nov 2024
STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document
  Parsing
STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document Parsing
Jiaru Zou
Qing Wang
Pratyush Thakur
Nickvash Kani
LRM
50
2
0
01 Nov 2024
Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies
Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies
Liwen Wang
Sheng Chen
Linnan Jiang
Shu Pan
Runze Cai
Sen Yang
Fei Yang
52
3
0
24 Oct 2024
Learning Mathematical Rules with Large Language Models
Learning Mathematical Rules with Large Language Models
Antoine Gorceix
Bastien Le Chenadec
Ahmad Rammal
N. Vadori
Manuela Veloso
23
1
0
22 Oct 2024
Make LLMs better zero-shot reasoners: Structure-orientated autonomous
  reasoning
Make LLMs better zero-shot reasoners: Structure-orientated autonomous reasoning
Pengfei He
Zitao Li
Yue Xing
Yaling Li
Jiliang Tang
Bolin Ding
LLMAG
LRM
38
1
0
18 Oct 2024
SoK: Prompt Hacking of Large Language Models
SoK: Prompt Hacking of Large Language Models
Baha Rababah
Shang
Wu
Matthew Kwiatkowski
Carson Leung
Cuneyt Gurcan Akcora
AAML
43
2
0
16 Oct 2024
Language Imbalance Driven Rewarding for Multilingual Self-improving
Language Imbalance Driven Rewarding for Multilingual Self-improving
Wen Yang
Junhong Wu
Chen Wang
Chengqing Zong
J.N. Zhang
ALM
LRM
74
4
0
11 Oct 2024
TraderTalk: An LLM Behavioural ABM applied to Simulating Human Bilateral
  Trading Interactions
TraderTalk: An LLM Behavioural ABM applied to Simulating Human Bilateral Trading Interactions
Alicia Vidler
Toby Walsh
33
1
0
10 Oct 2024
Give me a hint: Can LLMs take a hint to solve math problems?
Give me a hint: Can LLMs take a hint to solve math problems?
Vansh Agrawal
Pratham Singla
Amitoj Singh Miglani
Shivank Garg
Ayush Mangal
ReLM
LRM
28
5
0
08 Oct 2024
ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal
  Large Language Models Via Error Detection
ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
Yibo Yan
Shen Wang
Jiahao Huo
Hang Li
Yangqiu Song
...
Kun Wang
Hui Xiong
Philip S. Yu
Xuming Hu
Qingsong Wen
LRM
41
15
0
06 Oct 2024
Domain-Oriented Time Series Inference Agents for Reasoning and Automated Analysis
Domain-Oriented Time Series Inference Agents for Reasoning and Automated Analysis
Wen Ye
Wei Yang
Defu Cao
Yizhou Zhang
Lumingyuan Tang
Jie Cai
Yan Liu
AI4TS
BDL
CoGe
41
2
0
05 Oct 2024
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning
  Trajectories Search
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
Murong Yue
Wenlin Yao
Haitao Mi
Dian Yu
Ziyu Yao
Dong Yu
LRM
48
4
0
04 Oct 2024
GraphRouter: A Graph-based Router for LLM Selections
GraphRouter: A Graph-based Router for LLM Selections
Tao Feng
Yanzhen Shen
Jiaxuan You
100
10
0
04 Oct 2024
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level
  Mathematical Reasoning
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning
Di Zhang
Jianbo Wu
Jingdi Lei
Tong Che
Jiatong Li
...
Shufei Zhang
Marco Pavone
Yuqiang Li
Wanli Ouyang
Dongzhan Zhou
LRM
41
48
0
03 Oct 2024
Agent-Oriented Planning in Multi-Agent Systems
Agent-Oriented Planning in Multi-Agent Systems
Ao Li
Yuexiang Xie
Songze Li
Fugee Tsung
Bolin Ding
Yaliang Li
AIFin
188
6
0
03 Oct 2024
Not All LLM Reasoners Are Created Equal
Not All LLM Reasoners Are Created Equal
Arian Hosseini
Alessandro Sordoni
Daniel Toyama
Rameswar Panda
Rishabh Agarwal
LRM
51
11
0
02 Oct 2024
12
Next