Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.14858
Cited By
v1
v2 (latest)
Solving Quantitative Reasoning Problems with Language Models
29 June 2022
Aitor Lewkowycz
Anders Andreassen
David Dohan
Ethan Dyer
Henryk Michalewski
V. Ramasesh
Ambrose Slone
Cem Anil
Imanol Schlag
Theo Gutman-Solo
Yuhuai Wu
Behnam Neyshabur
Guy Gur-Ari
Vedant Misra
ReLM
ELM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Solving Quantitative Reasoning Problems with Language Models"
50 / 609 papers shown
Title
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement
An Yang
Beichen Zhang
Binyuan Hui
Bofei Gao
Bowen Yu
...
Mingfeng Xue
Runji Lin
Tianyu Liu
Xingzhang Ren
Zhenru Zhang
OSLM
LRM
162
321
0
18 Sep 2024
Jailbreaking Large Language Models with Symbolic Mathematics
Emet Bethany
Mazal Bethany
Juan Arturo Nolazco Flores
S. Jha
Peyman Najafirad
AAML
60
6
0
17 Sep 2024
Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles
Kulin Shah
Nishanth Dikkala
Xin Wang
Rina Panigrahy
ELM
ReLM
LRM
115
14
0
16 Sep 2024
Synthetic continued pretraining
Zitong Yang
Neil Band
Shuangping Li
Emmanuel Candès
Tatsunori Hashimoto
CLL
SyDa
105
16
0
11 Sep 2024
Semi-Supervised Reward Modeling via Iterative Self-Training
Yifei He
Haoxiang Wang
Ziyan Jiang
Alexandros Papangelis
Han Zhao
OffRL
117
4
0
10 Sep 2024
E2CL: Exploration-based Error Correction Learning for Embodied Agents
Hanlin Wang
Chak Tou Leong
Jian Wang
Wenjie Li
131
4
0
05 Sep 2024
Reasoning Aware Self-Consistency: Leveraging Reasoning Paths for Efficient LLM Sampling
Guangya Wan
Yuqi Wu
Jie Chen
Sheng Li
LRM
114
8
0
30 Aug 2024
Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling
Hritik Bansal
Arian Hosseini
Rishabh Agarwal
Vinh Q. Tran
Mehran Kazemi
SyDa
OffRL
LRM
124
49
0
29 Aug 2024
A Statistical Framework for Data-dependent Retrieval-Augmented Models
Soumya Basu
A. S. Rawat
Manzil Zaheer
RALM
88
0
0
27 Aug 2024
Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler
Songlin Yang
Matthew Stallone
Mayank Mishra
Gaoyuan Zhang
Shawn Tan
Aditya Prasad
Adriana Meza Soria
David D. Cox
Yikang Shen
118
16
0
23 Aug 2024
Image Segmentation in Foundation Model Era: A Survey
Tianfei Zhou
Fei Zhang
Boyu Chang
Wenguan Wang
Ye Yuan
E. Konukoglu
Daniel Cremers
VLM
147
12
0
23 Aug 2024
Fine-tuning Smaller Language Models for Question Answering over Financial Documents
Karmvir Singh Phogat
Sai Akhil Puranam
Sridhar Dasaratha
Chetan Harsha
Shashishekar Ramakrishna
LRM
54
4
0
22 Aug 2024
SubgoalXL: Subgoal-based Expert Learning for Theorem Proving
Xueliang Zhao
Lin Zheng
Haige Bo
Changran Hu
Urmish Thakker
Lingpeng Kong
LRM
105
6
0
20 Aug 2024
Plan with Code: Comparing approaches for robust NL to DSL generation
Nastaran Bassamzadeh
Chhaya Methani
35
1
0
15 Aug 2024
CoverBench: A Challenging Benchmark for Complex Claim Verification
Alon Jacovi
Moran Ambar
Eyal Ben-David
Uri Shaham
Amir Feder
Mor Geva
Dror Marcus
Avi Caciularu
LMTD
100
4
0
06 Aug 2024
Apple Intelligence Foundation Language Models
Tom Gunter
Zirui Wang
Chong-Jun Wang
Ruoming Pang
Andy Narayanan
...
Xinwen Liu
Yang Zhao
Yin Xia
Zhile Ren
Zhongzheng Ren
148
40
0
29 Jul 2024
Questionable practices in machine learning
Gavin Leech
Juan J. Vazquez
Misha Yagudin
Niclas Kupper
Laurence Aitchison
110
6
0
17 Jul 2024
A Survey on Symbolic Knowledge Distillation of Large Language Models
Kamal Acharya
Alvaro Velasquez
Haoze Song
SyDa
78
7
0
12 Jul 2024
Retrieved In-Context Principles from Previous Mistakes
Hao Sun
Yong Jiang
Bo Wang
Yingyan Hou
Yan Zhang
Pengjun Xie
Fei Huang
96
1
0
08 Jul 2024
On Speeding Up Language Model Evaluation
Jin Peng Zhou
Christian K. Belardi
Ruihan Wu
Travis Zhang
Carla P. Gomes
Wen Sun
Kilian Q. Weinberger
168
2
0
08 Jul 2024
Solving for X and Beyond: Can Large Language Models Solve Complex Math Problems with More-Than-Two Unknowns?
Kuei-Chun Kao
Ruochen Wang
Cho-Jui Hsieh
ELM
LRM
82
4
0
06 Jul 2024
Universal Length Generalization with Turing Programs
Kaiying Hou
David Brandfonbrener
Sham Kakade
Samy Jelassi
Eran Malach
121
11
0
03 Jul 2024
A Comparative Study of DSL Code Generation: Fine-Tuning vs. Optimized Retrieval Augmentation
Nastaran Bassamzadeh
Chhaya Methani
3DV
71
7
0
03 Jul 2024
FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in Large Language Models
Yiyuan Li
Shichao Sun
Pengfei Liu
LRM
144
0
0
01 Jul 2024
How to Leverage Digit Embeddings to Represent Numbers?
Jasivan Sivakumar
N. Moosavi
65
0
0
01 Jul 2024
Advancing Process Verification for Large Language Models via Tree-Based Preference Learning
Mingqian He
Yongliang Shen
Wenqi Zhang
Zeqi Tan
Weiming Lu
LRM
80
7
0
29 Jun 2024
MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math Data
Meng Fang
Xiangpeng Wan
Fei Lu
Fei Xing
Kai Zou
74
28
0
26 Jun 2024
From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models
Sean Welleck
Amanda Bertsch
Matthew Finlayson
Hailey Schoelkopf
Alex Xie
Graham Neubig
Ilia Kulikov
Zaid Harchaoui
161
77
0
24 Jun 2024
Anomaly Detection of Tabular Data Using LLMs
Aodong Li
Yunhan Zhao
Chen Qiu
Marius Kloft
Padhraic Smyth
Maja R. Rudolph
Stephan Mandt
118
9
0
24 Jun 2024
PORT: Preference Optimization on Reasoning Traces
Salem Lahlou
Abdalgader Abubaker
Hakim Hacid
LRM
124
5
0
23 Jun 2024
Proving Olympiad Algebraic Inequalities without Human Demonstrations
Chenrui Wei
Mengzhou Sun
Wei Wang
LRM
112
9
0
20 Jun 2024
What Are the Odds? Language Models Are Capable of Probabilistic Reasoning
Akshay Paruchuri
Jake Garrison
Shun Liao
John Hernandez
Jacob Sunshine
Tim Althoff
Xin Liu
Daniel J. McDuff
LRM
89
9
0
18 Jun 2024
Step-level Value Preference Optimization for Mathematical Reasoning
Guoxin Chen
Minpeng Liao
Chengxi Li
Kai Fan
LRM
99
42
0
16 Jun 2024
A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery
Yu Zhang
Xiusi Chen
Bowen Jin
Sheng Wang
Shuiwang Ji
Wei Wang
Jiawei Han
142
43
0
16 Jun 2024
ReMI: A Dataset for Reasoning with Multiple Images
Mehran Kazemi
Nishanth Dikkala
Ankit Anand
Petar Dević
Ishita Dasgupta
...
Bahare Fatemi
Pranjal Awasthi
Dee Guo
Sreenivas Gollapudi
Ahmed Qureshi
LRM
VLM
112
17
0
13 Jun 2024
Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
Bahare Fatemi
Mehran Kazemi
Anton Tsitsulin
Karishma Malkan
Jinyeong Yim
John Palowitch
Sungyong Seo
Jonathan J. Halcrow
Bryan Perozzi
LRM
100
39
0
13 Jun 2024
TelecomRAG: Taming Telecom Standards with Retrieval Augmented Generation and LLMs
G. M. Yilma
J. Ayala-Romero
A. Garcia-Saavedra
Xavier Costa-Perez
51
14
0
11 Jun 2024
Improving Autoformalization using Type Checking
Auguste Poiroux
Gail Weiss
Viktor Kunčak
Antoine Bosselut
123
4
0
11 Jun 2024
On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions
Denys Pushkin
Raphael Berthier
Emmanuel Abbe
65
0
0
10 Jun 2024
LoCoCo: Dropping In Convolutions for Long Context Compression
Ruisi Cai
Yuandong Tian
Zhangyang Wang
Beidi Chen
95
11
0
08 Jun 2024
UltraMedical: Building Specialized Generalists in Biomedicine
Kaiyan Zhang
Sihang Zeng
Ermo Hua
Ning Ding
Zhang-Ren Chen
...
Xuekai Zhu
Xingtai Lv
Hu Jinfang
Zhiyuan Liu
Bowen Zhou
LM&MA
115
33
0
06 Jun 2024
Lean Workbook: A large-scale Lean problem set formalized from natural language math problems
Huaiyuan Ying
Zijian Wu
Yihan Geng
Zheng Yuan
Dahua Lin
Kai Chen
173
42
0
06 Jun 2024
Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data
Haolong Li
Yu Ma
Yinqi Zhang
Chen Ye
Jie Chen
ReLM
LRM
64
4
0
04 Jun 2024
Conditional Language Learning with Context
X. Zhang
Miao Li
Ji Wu
96
4
0
04 Jun 2024
Explicitly Encoding Structural Symmetry is Key to Length Generalization in Arithmetic Tasks
Mahdi Sabbaghi
George Pappas
Hamed Hassani
Surbhi Goel
114
6
0
04 Jun 2024
Evaluating Mathematical Reasoning of Large Language Models: A Focus on Error Identification and Correction
Xiaoyuan Li
Wenjie Wang
Moxin Li
Junrong Guo
Yang Zhang
Fuli Feng
ELM
LRM
94
20
0
02 Jun 2024
Stress-Testing Capability Elicitation With Password-Locked Models
Ryan Greenblatt
Fabien Roger
Dmitrii Krasheninnikov
David M. Krueger
93
19
0
29 May 2024
MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Zhenwen Liang
Dian Yu
Wenhao Yu
Wenlin Yao
Zhihan Zhang
Xiangliang Zhang
Dong Yu
LRM
113
13
0
29 May 2024
Towards Dialogues for Joint Human-AI Reasoning and Value Alignment
Elfia Bezou-Vrakatseli
O. Cocarascu
Sanjay Modgil
78
0
0
28 May 2024
Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment
Jiaxiang Li
Siliang Zeng
Hoi-To Wai
Chenliang Li
Alfredo García
Mingyi Hong
135
18
0
28 May 2024
Previous
1
2
3
4
5
6
...
11
12
13
Next