Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.01557
Cited By
Analysing Mathematical Reasoning Abilities of Neural Models
2 April 2019
D. Saxton
Edward Grefenstette
Felix Hill
Pushmeet Kohli
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Analysing Mathematical Reasoning Abilities of Neural Models"
50 / 286 papers shown
Title
Synthesis by Design: Controlled Data Generation via Structural Guidance
Lei Xu
Sirui Chen
Yuxuan Huang
Chaochao Lu
24
0
0
09 Jun 2025
Inference-Time Hyper-Scaling with KV Cache Compression
Adrian Łańcucki
Konrad Staniszewski
Piotr Nawrot
Edoardo Ponti
74
0
0
05 Jun 2025
Adaptive Task Vectors for Large Language Models
Joonseong Kang
Soojeong Lee
Subeen Park
Sumin Park
Taero Kim
Jihee Kim
Ryunyi Lee
Kyungwoo Song
29
0
0
03 Jun 2025
TRACE for Tracking the Emergence of Semantic Representations in Transformers
Nura Aljaafari
Danilo S. Carvalho
André Freitas
92
0
0
23 May 2025
VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models
Yuchen Yan
Jin Jiang
Zhenbang Ren
Yijun Li
Xudong Cai
...
Mengdi Zhang
Jian Shao
Yongliang Shen
Jun Xiao
Yueting Zhuang
OffRL
ALM
LRM
132
0
0
21 May 2025
Do Language Models Use Their Depth Efficiently?
Róbert Csordás
Christopher D. Manning
Christopher Potts
208
2
0
20 May 2025
YESciEval: Robust LLM-as-a-Judge for Scientific Question Answering
Jennifer D'Souza
Hamed Babaei Giglou
Quentin Münch
ELM
109
0
0
20 May 2025
Advancing Sequential Numerical Prediction in Autoregressive Models
Xiang Fei
Jinghui Lu
Qi Sun
Hao Feng
Yanjie Wang
Wei Shi
An-Lan Wang
Jingqun Tang
Can Huang
AI4TS
203
3
0
19 May 2025
Lightweight Latent Verifiers for Efficient Meta-Generation Strategies
Bartosz Piotrowski
Witold Drzewakowski
Konrad Staniszewski
Piotr Miłoś
LRM
67
0
0
23 Apr 2025
Hadamard product in deep learning: Introduction, Advances and Challenges
Grigorios G. Chrysos
Yongtao Wu
Razvan Pascanu
Philip Torr
Volkan Cevher
AAML
170
2
0
17 Apr 2025
FLIP Reasoning Challenge
Andreas Plesner
Turlan Kuzhagaliyev
Roger Wattenhofer
AAML
VLM
LRM
187
0
0
16 Apr 2025
Malicious and Unintentional Disclosure Risks in Large Language Models for Code Generation
Rafiqul Rabin
Sean McGregor
Nick Judd
AAML
PILM
87
0
0
27 Mar 2025
A Survey on Mathematical Reasoning and Optimization with Large Language Models
Ali Forootani
OffRL
LRM
AI4CE
118
1
0
22 Mar 2025
MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer
Honglin Lin
Zhuoshi Pan
Yu Li
Qizhi Pei
Xin Gao
Mengzhang Cai
Zeang Sheng
Lijun Wu
OffRL
LRM
93
1
0
19 Mar 2025
The Lucie-7B LLM and the Lucie Training Dataset: Open resources for multilingual language generation
Olivier Gouvert
Julie Hunter
Jérôme Louradour
Christophe Cerisara
Evan Dufraisse
Yaya Sy
Laura Rivière
Jean-Pierre Lorré
OpenLLM-France community
454
0
0
15 Mar 2025
Hierarchical Balance Packing: Towards Efficient Supervised Fine-tuning for Long-Context LLM
Yongqiang Yao
Jingru Tan
Kaihuan Liang
Feizhao Zhang
Yazhe Niu
Jiahao Hu
Ruihao Gong
Dahua Lin
Ningyi Xu
98
0
0
10 Mar 2025
Deep Learning of the Evolution Operator Enables Forecasting of Out-of-Training Dynamics in Chaotic Systems
Ira J. S. Shokar
Peter H. Haynes
R. Kerswell
AI4TS
86
1
0
28 Feb 2025
Tokenization is Sensitive to Language Variation
Anna Wegmann
Dong Nguyen
David Jurgens
150
2
0
21 Feb 2025
Stepwise Perplexity-Guided Refinement for Efficient Chain-of-Thought Reasoning in Large Language Models
Yingqian Cui
Pengfei He
Jingying Zeng
Hui Liu
Xianfeng Tang
...
Zhen Li
Suhang Wang
Yue Xing
Jiliang Tang
Qi He
LRM
131
19
0
18 Feb 2025
Advanced Weakly-Supervised Formula Exploration for Neuro-Symbolic Mathematical Reasoning
Yuxuan Wu
Hideki Nakayama
NAI
89
1
0
02 Feb 2025
Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap
Hyunwoo Ko
Guijin Son
Dasol Choi
RALM
LRM
154
12
0
05 Jan 2025
Mathematical Language Models: A Survey
Wen Liu
Hanglei Hu
Jie Zhou
Yuyang Ding
Junsong Li
...
Mengliang He
Qin Chen
Bo Jiang
Aimin Zhou
Liang He
LRM
235
14
0
03 Jan 2025
LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning
Shuguang Chen
Guang Lin
LRM
484
1
0
28 Dec 2024
Are LLMs Prescient? A Continuous Evaluation using Daily News as the Oracle
Hui Dai
Ryan Teehan
Mengye Ren
KELM
ELM
AIFin
47
1
0
13 Nov 2024
Quantifying artificial intelligence through algorithmic generalization
Takuya Ito
Murray Campbell
L. Horesh
Tim Klinger
Parikshit Ram
ELM
122
0
0
08 Nov 2024
Number Cookbook: Number Understanding of Language Models and How to Improve It
Haotong Yang
Yi Hu
Shijia Kang
Zhouchen Lin
Muhan Zhang
LRM
112
8
0
06 Nov 2024
Regress, Don't Guess -- A Regression-like Loss on Number Tokens for Language Models
Jonas Zausinger
Lars Pennig
Anamarija Kozina
Sean Sdahl
Julian Sikora
...
Anna Ketteler
Thorben Prein
Vishwa Mohan Singh
Michael Morris Danziger
Jannis Born
82
3
0
04 Nov 2024
Interleaving Text and Number Embeddings to Solve Mathemathics Problems
Marvin Alberts
Gianmarco Gabrieli
Irina Espejo Morales
51
2
0
25 Oct 2024
Mixture of Parrots: Experts improve memorization more than reasoning
Samy Jelassi
Clara Mohri
David Brandfonbrener
Alex Gu
Nikhil Vyas
Nikhil Anand
David Alvarez-Melis
Yuanzhi Li
Sham Kakade
Eran Malach
MoE
113
5
0
24 Oct 2024
How Numerical Precision Affects Arithmetical Reasoning Capabilities of LLMs
Guhao Feng
Kai-Bo Yang
Yuntian Gu
Xinyue Ai
Shengjie Luo
Jiacheng Sun
Di He
Zechao Li
Liwei Wang
LRM
87
13
0
17 Oct 2024
A Survey on Data Synthesis and Augmentation for Large Language Models
Ke Wang
Jiahui Zhu
Minjie Ren
Ziqiang Liu
Shiwei Li
...
Yiming Lei
Xiaoyu Wu
Qiqi Zhan
Qingjie Liu
Yunhong Wang
SyDa
177
21
0
16 Oct 2024
ELICIT: LLM Augmentation via External In-Context Capability
Futing Wang
Jianhao Yan
Yue Zhang
Tao Lin
137
1
0
12 Oct 2024
PLaMo-100B: A Ground-Up Language Model Designed for Japanese Proficiency
Preferred Elements
:
Kenshin Abe
Kaizaburo Chubachi
Yasuhiro Fujita
...
Yoshihiko Ozaki
Shotaro Sano
Shuji Suzuki
Tianqi Xu
Toshihiko Yanase
87
0
0
10 Oct 2024
Chain-of-Sketch: Enabling Global Visual Reasoning
Aryo Lotfi
Enrico Fini
Samy Bengio
Moin Nabi
Emmanuel Abbe
LRM
92
0
0
10 Oct 2024
MoDEM: Mixture of Domain Expert Models
Toby Simonds
Kemal Kurniawan
Jey Han Lau
MoE
74
2
0
09 Oct 2024
RespDiff: An End-to-End Multi-scale RNN Diffusion Model for Respiratory Waveform Estimation from PPG Signals
Yuyang Miao
Zehua Chen
Chong Li
Danilo Mandic
DiffM
MedIm
79
9
0
06 Oct 2024
Consistent Autoformalization for Constructing Mathematical Libraries
Lan Zhang
Xin Quan
André Freitas
AI4CE
64
4
0
05 Oct 2024
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
Murong Yue
Wenlin Yao
Haitao Mi
Dian Yu
Ziyu Yao
Dong Yu
LRM
72
7
0
04 Oct 2024
ProcBench: Benchmark for Multi-Step Reasoning and Following Procedure
Ippei Fujisawa
Sensho Nobe
Hiroki Seto
Rina Onda
Yoshiaki Uchida
Hiroki Ikoma
Pei-Chun Chien
Ryota Kanai
LRM
69
4
0
04 Oct 2024
Can Models Learn Skill Composition from Examples?
Haoyu Zhao
Simran Kaur
Dingli Yu
Anirudh Goyal
Sanjeev Arora
CoGe
MoE
126
8
0
29 Sep 2024
HybridFlow: A Flexible and Efficient RLHF Framework
Guangming Sheng
Chi Zhang
Zilingfeng Ye
Xibin Wu
Wang Zhang
Ru Zhang
Size Zheng
Haibin Lin
Chuan Wu
AI4CE
222
240
0
28 Sep 2024
HDFlow: Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows
Wenlin Yao
Haitao Mi
Dong Yu
LRM
AI4CE
104
8
0
25 Sep 2024
ControlMath: Controllable Data Generation Promotes Math Generalist Models
Nuo Chen
Ning Wu
Jianhui Chang
Jia Li
100
4
0
20 Sep 2024
Interpreting and Improving Large Language Models in Arithmetic Calculation
Wei Zhang
Chaoqun Wan
Yonggang Zhang
Yiu-ming Cheung
Xinmei Tian
Xu Shen
Jieping Ye
LRM
109
22
0
03 Sep 2024
S^3cMath: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners
Yuchen Yan
Jin Jiang
Yang Liu
Yixin Cao
Xin Xu
Hao Fei
Xunliang Cai
Jian Shao
ReLM
LRM
KELM
236
9
0
03 Sep 2024
Chatting Up Attachment: Using LLMs to Predict Adult Bonds
Paulo Soares
Sean McCurdy
Andrew J. Gerber
Peter Fonagy
90
1
0
31 Aug 2024
Can Transformers Do Enumerative Geometry?
Baran Hashemi
Roderic G. Corominas
Alessandro Giacchetto
542
5
0
27 Aug 2024
Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning
Wenwen Zhuang
Xin Huang
Xiantao Zhang
Jin Zeng
LRM
123
31
0
16 Aug 2024
Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2
Tom Lieberum
Senthooran Rajamanoharan
Arthur Conmy
Lewis Smith
Nicolas Sonnerat
Vikrant Varma
János Kramár
Anca Dragan
Rohin Shah
Neel Nanda
121
128
0
09 Aug 2024
InfinityMATH: A Scalable Instruction Tuning Dataset in Programmatic Mathematical Reasoning
Bo-Wen Zhang
Yan Yan
Lin Li
Guang Liu
ReLM
LRM
33
6
0
09 Aug 2024
1
2
3
4
5
6
Next