ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.01557
  4. Cited By
Analysing Mathematical Reasoning Abilities of Neural Models

Analysing Mathematical Reasoning Abilities of Neural Models

2 April 2019
D. Saxton
Edward Grefenstette
Felix Hill
Pushmeet Kohli
    LRM
ArXiv (abs)PDFHTML

Papers citing "Analysing Mathematical Reasoning Abilities of Neural Models"

50 / 286 papers shown
Title
ALLaM: Large Language Models for Arabic and English
ALLaM: Large Language Models for Arabic and English
M Saiful Bari
Yazeed Alnumay
Norah A. Alzahrani
Nouf M. Alotaibi
H. A. Alyahya
...
Jeril Kuriakose
Abdalghani Abujabal
Nora Al-Twairesh
Areeb Alowisheq
Haidar Khan
73
17
0
22 Jul 2024
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains
Guoli Yin
Haoping Bai
Shuang Ma
Feng Nan
Yanchao Sun
...
Xiaoming Wang
Jiulong Shan
Meng Cao
Ruoming Pang
Zirui Wang
LLMAGELM
79
7
0
18 Jul 2024
A Survey on Symbolic Knowledge Distillation of Large Language Models
A Survey on Symbolic Knowledge Distillation of Large Language Models
Kamal Acharya
Alvaro Velasquez
Haoze Song
SyDa
74
7
0
12 Jul 2024
AutoBencher: Towards Declarative Benchmark Construction
AutoBencher: Towards Declarative Benchmark Construction
Xiang Lisa Li
Emmy Liu
Percy Liang
Tatsunori Hashimoto
Percy Liang
Tatsunori Hashimoto
88
9
0
11 Jul 2024
Solving for X and Beyond: Can Large Language Models Solve Complex Math
  Problems with More-Than-Two Unknowns?
Solving for X and Beyond: Can Large Language Models Solve Complex Math Problems with More-Than-Two Unknowns?
Kuei-Chun Kao
Ruochen Wang
Cho-Jui Hsieh
ELMLRM
82
4
0
06 Jul 2024
DotaMath: Decomposition of Thought with Code Assistance and
  Self-correction for Mathematical Reasoning
DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning
Chengpeng Li
Guanting Dong
Mingfeng Xue
Ru Peng
Xiang Wang
Dayiheng Liu
LRMReLM
98
13
0
04 Jul 2024
How to Leverage Digit Embeddings to Represent Numbers?
How to Leverage Digit Embeddings to Represent Numbers?
Jasivan Sivakumar
N. Moosavi
65
0
0
01 Jul 2024
What Are the Odds? Language Models Are Capable of Probabilistic
  Reasoning
What Are the Odds? Language Models Are Capable of Probabilistic Reasoning
Akshay Paruchuri
Jake Garrison
Shun Liao
John Hernandez
Jacob Sunshine
Tim Althoff
Xin Liu
Daniel J. McDuff
LRM
84
9
0
18 Jun 2024
DART-Math: Difficulty-Aware Rejection Tuning for Mathematical
  Problem-Solving
DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving
Yuxuan Tong
Xiwen Zhang
Rui Wang
R. Wu
Junxian He
AIMatLRM
85
43
0
18 Jun 2024
MetaGPT: Merging Large Language Models Using Model Exclusive Task
  Arithmetic
MetaGPT: Merging Large Language Models Using Model Exclusive Task Arithmetic
Yuyan Zhou
Liang Song
Bingning Wang
Weipeng Chen
MoMe
102
23
0
17 Jun 2024
GenQA: Generating Millions of Instructions from a Handful of Prompts
GenQA: Generating Millions of Instructions from a Handful of Prompts
Jiuhai Chen
Rifaa Qadri
Yuxin Wen
Neel Jain
John Kirchenbauer
Dinesh Manocha
Tom Goldstein
ALM
154
24
0
14 Jun 2024
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning
Joongwon Kim
Bhargavi Paranjape
Tushar Khot
Hannaneh Hajishirzi
LM&RoELMLLMAGLRM
85
9
0
10 Jun 2024
On the Minimal Degree Bias in Generalization on the Unseen for
  non-Boolean Functions
On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions
Denys Pushkin
Raphael Berthier
Emmanuel Abbe
65
0
0
10 Jun 2024
SPOT: Text Source Prediction from Originality Score Thresholding
SPOT: Text Source Prediction from Originality Score Thresholding
Edouard Yvinec
Gabriel Kasser
DeLMO
79
0
0
30 May 2024
Transformers Can Do Arithmetic with the Right Embeddings
Transformers Can Do Arithmetic with the Right Embeddings
Sean McLeish
Arpit Bansal
Alex Stein
Neel Jain
John Kirchenbauer
...
B. Kailkhura
A. Bhatele
Jonas Geiping
Avi Schwarzschild
Tom Goldstein
78
37
0
27 May 2024
A Survey of Multimodal Large Language Model from A Data-centric
  Perspective
A Survey of Multimodal Large Language Model from A Data-centric Perspective
Tianyi Bai
Hao Liang
Binwang Wan
Yanran Xu
Xi Li
...
Ping Huang
Jiulong Shan
Conghui He
Binhang Yuan
Wentao Zhang
137
45
0
26 May 2024
Cost-Effective Online Multi-LLM Selection with Versatile Reward Models
Cost-Effective Online Multi-LLM Selection with Versatile Reward Models
Xiangxiang Dai
Jin Li
Xutong Liu
Anqi Yu
J. C. Lui
99
13
0
26 May 2024
Disentangling and Integrating Relational and Sensory Information in Transformer Architectures
Disentangling and Integrating Relational and Sensory Information in Transformer Architectures
Awni Altabaa
John Lafferty
59
3
0
26 May 2024
Models That Prove Their Own Correctness
Models That Prove Their Own Correctness
Noga Amit
S. Goldwasser
Orr Paradise
G. Rothblum
LRM
75
5
0
24 May 2024
GECKO: Generative Language Model for English, Code and Korean
GECKO: Generative Language Model for English, Code and Korean
Sungwoo Oh
Donggyu Kim
VLM
82
0
0
24 May 2024
From Frege to chatGPT: Compositionality in language, cognition, and deep
  neural networks
From Frege to chatGPT: Compositionality in language, cognition, and deep neural networks
Jacob Russin
Sam Whitman McGrath
Danielle J. Williams
Lotem Elber-Dorozko
AI4CE
189
4
0
24 May 2024
LARS-VSA: A Vector Symbolic Architecture For Learning with Abstract
  Rules
LARS-VSA: A Vector Symbolic Architecture For Learning with Abstract Rules
Mohamed Mejri
C. Amarnath
Abhijit Chatterjee
114
1
0
23 May 2024
Investigating Symbolic Capabilities of Large Language Models
Investigating Symbolic Capabilities of Large Language Models
Neisarg Dave
Daniel Kifer
C. Lee Giles
A. Mali
ELMLRM
51
3
0
21 May 2024
MathBench: Evaluating the Theory and Application Proficiency of LLMs
  with a Hierarchical Mathematics Benchmark
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark
Hongwei Liu
Zilong Zheng
Yuxuan Qiao
Haodong Duan
Zhiwei Fei
Fengzhe Zhou
Wenwei Zhang
Songyang Zhang
Dahua Lin
Kai-xiang Chen
121
68
0
20 May 2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks
  via Multi-modal Large Language Models
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
Xianzheng Ma
Yash Bhalgat
Brandon Smart
Shuai Chen
Xinghui Li
...
Matthias Nießner
Ian D Reid
Angel X. Chang
Iro Laina
V. Prisacariu
LRM
130
21
0
16 May 2024
A Systematic Evaluation of Large Language Models for Natural Language
  Generation Tasks
A Systematic Evaluation of Large Language Models for Natural Language Generation Tasks
Xuanfan Ni
Piji Li
ELMLRM
65
8
0
16 May 2024
Thinking Tokens for Language Modeling
Thinking Tokens for Language Modeling
David Herel
Tomas Mikolov
LRM
94
3
0
14 May 2024
Quantifying the Capabilities of LLMs across Scale and Precision
Quantifying the Capabilities of LLMs across Scale and Precision
Sher Badshah
Hassan Sajjad
74
14
0
06 May 2024
Exploring the Limits of Fine-grained LLM-based Physics Inference via
  Premise Removal Interventions
Exploring the Limits of Fine-grained LLM-based Physics Inference via Premise Removal Interventions
Jordan Meadows
Tamsin James
André Freitas
ReLMLRMAI4CE
70
1
0
29 Apr 2024
Exploring Internal Numeracy in Language Models: A Case Study on ALBERT
Exploring Internal Numeracy in Language Models: A Case Study on ALBERT
Ulme Wennberg
G. Henter
MILM
71
1
0
25 Apr 2024
Mathify: Evaluating Large Language Models on Mathematical Problem
  Solving Tasks
Mathify: Evaluating Large Language Models on Mathematical Problem Solving Tasks
Avinash Anand
Mohit Gupta
Kritarth Prasad
Navya Singla
Sanjana Sanjeev
Jatin Kumar
A. Shivam
R. Shah
LRM
86
14
0
19 Apr 2024
Enhancing Length Extrapolation in Sequential Models with
  Pointer-Augmented Neural Memory
Enhancing Length Extrapolation in Sequential Models with Pointer-Augmented Neural Memory
Hung Le
D. Nguyen
Kien Do
Svetha Venkatesh
T. Tran
57
0
0
18 Apr 2024
Best Practices and Lessons Learned on Synthetic Data for Language Models
Best Practices and Lessons Learned on Synthetic Data for Language Models
Ruibo Liu
Jerry W. Wei
Fangyu Liu
Chenglei Si
Yanzhe Zhang
...
Steven Zheng
Daiyi Peng
Diyi Yang
Denny Zhou
Andrew M. Dai
SyDaEgoV
126
96
0
11 Apr 2024
ChatGLM-Math: Improving Math Problem-Solving in Large Language Models
  with a Self-Critique Pipeline
ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline
Yifan Xu
Xiao Liu
Xinghan Liu
Zhenyu Hou
Yueyan Li
...
Aohan Zeng
Zhengxiao Du
Wenyi Zhao
Jie Tang
Yuxiao Dong
LRM
99
42
0
03 Apr 2024
Reasoning in Transformers -- Mitigating Spurious Correlations and
  Reasoning Shortcuts
Reasoning in Transformers -- Mitigating Spurious Correlations and Reasoning Shortcuts
Daniel Enström
Viktor Kjellberg
Moa Johansson
LRM
51
3
0
17 Mar 2024
Incorporating Graph Attention Mechanism into Geometric Problem Solving
  Based on Deep Reinforcement Learning
Incorporating Graph Attention Mechanism into Geometric Problem Solving Based on Deep Reinforcement Learning
Xiuqin Zhong
Shengyuan Yan
Gongqi Lin
Hongguang Fu
Liang Xu
Siwen Jiang
Lei Huang
Wei Fang
AIMat
54
0
0
14 Mar 2024
ResLoRA: Identity Residual Mapping in Low-Rank Adaption
ResLoRA: Identity Residual Mapping in Low-Rank Adaption
Shuhua Shi
Shaohan Huang
Minghui Song
Zhoujun Li
Zihan Zhang
Haizhen Huang
Furu Wei
Weiwei Deng
Feng Sun
Qi Zhang
AI4CE
74
15
0
28 Feb 2024
Adversarial Math Word Problem Generation
Adversarial Math Word Problem Generation
Roy Xie
Chengxuan Huang
Junlin Wang
Bhuwan Dhingra
AAML
90
2
0
27 Feb 2024
Measuring Vision-Language STEM Skills of Neural Models
Measuring Vision-Language STEM Skills of Neural Models
Jianhao Shen
Ye Yuan
Srbuhi Mirzoyan
Ming Zhang
Chenguang Wang
VLM
117
12
0
27 Feb 2024
How Do Humans Write Code? Large Models Do It the Same Way Too
How Do Humans Write Code? Large Models Do It the Same Way Too
Long Li
Xuzheng He
LRM
43
0
0
24 Feb 2024
Addressing Order Sensitivity of In-Context Demonstration Examples in
  Causal Language Models
Addressing Order Sensitivity of In-Context Demonstration Examples in Causal Language Models
Yanzheng Xiang
Hanqi Yan
Lin Gui
Yulan He
68
9
0
23 Feb 2024
Tokenization counts: the impact of tokenization on arithmetic in
  frontier LLMs
Tokenization counts: the impact of tokenization on arithmetic in frontier LLMs
Aaditya K. Singh
DJ Strouse
116
61
0
22 Feb 2024
ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning
ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning
Renqiu Xia
Bo Zhang
Hancheng Ye
Xiangchao Yan
Qi Liu
...
Min Dou
Botian Shi
Junchi Yan
Junchi Yan
Yu Qiao
LRM
182
68
0
19 Feb 2024
CliqueParcel: An Approach For Batching LLM Prompts That Jointly
  Optimizes Efficiency And Faithfulness
CliqueParcel: An Approach For Batching LLM Prompts That Jointly Optimizes Efficiency And Faithfulness
Jiayi Liu
Tinghan Yang
Jennifer Neville
56
11
0
17 Feb 2024
LogicPrpBank: A Corpus for Logical Implication and Equivalence
LogicPrpBank: A Corpus for Logical Implication and Equivalence
Zhexiong Liu
Jing Zhang
Jiaying Lu
Wenjing Ma
Joyce C. Ho
ReLMLRM
71
0
0
14 Feb 2024
Limits of Transformer Language Models on Learning to Compose Algorithms
Limits of Transformer Language Models on Learning to Compose Algorithms
Jonathan Thomm
Aleksandar Terzić
Giacomo Camposampiero
Michael Hersche
Bernhard Schölkopf
Abbas Rahimi
139
8
0
08 Feb 2024
Integration of cognitive tasks into artificial general intelligence test
  for large models
Integration of cognitive tasks into artificial general intelligence test for large models
Youzhi Qu
Chen Wei
Penghui Du
Wenxin Che
Chi Zhang
...
Bin Hu
Kai Du
Haiyan Wu
Jia Liu
Quanying Liu
ELM
64
10
0
04 Feb 2024
Hashmarks: Privacy-Preserving Benchmarks for High-Stakes AI Evaluation
Hashmarks: Privacy-Preserving Benchmarks for High-Stakes AI Evaluation
P. Bricman
55
0
0
01 Dec 2023
YUAN 2.0: A Large Language Model with Localized Filtering-based
  Attention
YUAN 2.0: A Large Language Model with Localized Filtering-based Attention
Shaohua Wu
Xudong Zhao
Shenling Wang
Jiangang Luo
Lingjun Li
...
Wei Wang
Tong Yu
Rongguo Zhang
Jiahua Zhang
Chao Wang
OSLM
101
6
0
27 Nov 2023
Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image
  Generation
Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation
Yuhui Zhang
Brandon McKinzie
Zhe Gan
Vaishaal Shankar
Alexander Toshev
38
3
0
27 Nov 2023
Previous
123456
Next