ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.01557
  4. Cited By
Analysing Mathematical Reasoning Abilities of Neural Models

Analysing Mathematical Reasoning Abilities of Neural Models

2 April 2019
D. Saxton
Edward Grefenstette
Felix Hill
Pushmeet Kohli
    LRM
ArXiv (abs)PDFHTML

Papers citing "Analysing Mathematical Reasoning Abilities of Neural Models"

50 / 286 papers shown
Title
Feature emergence via margin maximization: case studies in algebraic
  tasks
Feature emergence via margin maximization: case studies in algebraic tasks
Depen Morwani
Benjamin L. Edelman
Costin-Andrei Oncescu
Rosie Zhao
Sham Kakade
84
16
0
13 Nov 2023
Emergent Communication for Rules Reasoning
Emergent Communication for Rules Reasoning
Yuxuan Guo
Yifan Hao
Rui Zhang
Enshuai Zhou
Zidong Du
...
Shaohui Peng
Di Huang
Rui Chen
Qi Guo
Yunji Chen
LLMAGLRMAI4CE
96
0
0
08 Nov 2023
Multi-Operational Mathematical Derivations in Latent Space
Multi-Operational Mathematical Derivations in Latent Space
Marco Valentino
Jordan Meadows
Lan Zhang
André Freitas
82
5
0
02 Nov 2023
FP8-LM: Training FP8 Large Language Models
FP8-LM: Training FP8 Large Language Models
Houwen Peng
Kan Wu
Yixuan Wei
Guoshuai Zhao
Yuxiang Yang
...
Zheng Zhang
Shuguang Liu
Joe Chau
Han Hu
Peng Cheng
MQ
111
45
0
27 Oct 2023
SoK: Memorization in General-Purpose Large Language Models
SoK: Memorization in General-Purpose Large Language Models
Valentin Hartmann
Anshuman Suri
Vincent Bindschaedler
David Evans
Shruti Tople
Robert West
KELMLLMAG
92
24
0
24 Oct 2023
Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in
  the Real World
Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World
Rujie Wu
Xiaojian Ma
Zhenliang Zhang
Wei Wang
Qing Li
Song-Chun Zhu
Yizhou Wang
LRMVLM
153
9
0
16 Oct 2023
TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative
  Language Models
TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models
Jing Xiong
Jianhao Shen
Ye Yuan
Haiming Wang
Yichun Yin
...
Yinya Huang
Chuanyang Zheng
Xiaodan Liang
Ming Zhang
Qun Liu
AIMatLRM
54
20
0
16 Oct 2023
Adaptivity and Modularity for Efficient Generalization Over Task
  Complexity
Adaptivity and Modularity for Efficient Generalization Over Task Complexity
Samira Abnar
Omid Saremi
Laurent Dinh
Shantel Wilson
Miguel Angel Bautista
...
Vimal Thilak
Etai Littwin
Jiatao Gu
Josh Susskind
Samy Bengio
108
6
0
13 Oct 2023
Lemur: Harmonizing Natural Language and Code for Language Agents
Lemur: Harmonizing Natural Language and Code for Language Agents
Yiheng Xu
Hongjin Su
Chen Xing
Boyu Mi
Qian Liu
...
Siheng Zhao
Lingpeng Kong
Bailin Wang
Caiming Xiong
Tao Yu
99
74
0
10 Oct 2023
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical
  Reasoning
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning
Ke Wang
Houxing Ren
Aojun Zhou
Zimu Lu
Sichun Luo
Weikang Shi
Renrui Zhang
Linqi Song
Mingjie Zhan
Hongsheng Li
ReLMLRMSyDa
119
106
0
05 Oct 2023
SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified
  Pre-training
SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified Pre-training
Kazem Meidani
Parshin Shojaee
Chandan K. Reddy
A. Farimani
126
19
0
03 Oct 2023
Boolformer: Symbolic Regression of Logic Functions with Transformers
Boolformer: Symbolic Regression of Logic Functions with Transformers
Stéphane dÁscoli
Samy Bengio
Josh Susskind
Emmanuel Abbe
82
5
0
21 Sep 2023
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open
  Generative Large Language Models
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models
Neha Sengupta
Sunil Kumar Sahu
Bokang Jia
Satheesh Katipomu
Haonan Li
...
A. Jackson
Hector Xuguang Ren
Preslav Nakov
Timothy Baldwin
Eric P. Xing
LRM
101
41
0
30 Aug 2023
AskIt: Unified Programming Interface for Programming with Large Language
  Models
AskIt: Unified Programming Interface for Programming with Large Language Models
Katsumi Okuda
Saman P. Amarasinghe
ELM
40
4
0
29 Aug 2023
Learning the greatest common divisor: explaining transformer predictions
Learning the greatest common divisor: explaining transformer predictions
Franccois Charton
99
18
0
29 Aug 2023
Boosting Logical Reasoning in Large Language Models through a New
  Framework: The Graph of Thought
Boosting Logical Reasoning in Large Language Models through a New Framework: The Graph of Thought
Bin Lei
Pei-Hung Lin
C. Liao
Caiwen Ding
ReLMELMLRMAI4CE
78
40
0
16 Aug 2023
SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
Sewon Min
Suchin Gururangan
Eric Wallace
Hannaneh Hajishirzi
Noah A. Smith
Luke Zettlemoyer
AILaw
105
67
0
08 Aug 2023
Scaling Relationship on Learning Mathematical Reasoning with Large
  Language Models
Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
Zheng Yuan
Hongyi Yuan
Cheng Li
Guanting Dong
Keming Lu
Chuanqi Tan
Chang Zhou
Jingren Zhou
LRMALM
127
205
0
03 Aug 2023
IntelliGraphs: Datasets for Benchmarking Knowledge Graph Generation
IntelliGraphs: Datasets for Benchmarking Knowledge Graph Generation
Thiviyan Thanapalasingam
Emile van Krieken
Peter Bloem
Paul T. Groth
62
1
0
13 Jul 2023
Towards Robust and Efficient Continual Language Learning
Towards Robust and Efficient Continual Language Learning
Adam Fisch
Amal Rannen-Triki
Razvan Pascanu
J. Bornschein
Angeliki Lazaridou
E. Gribovskaya
MarcÁurelio Ranzato
CLL
59
1
0
11 Jul 2023
SALSA VERDE: a machine learning attack on Learning With Errors with
  sparse small secrets
SALSA VERDE: a machine learning attack on Learning With Errors with sparse small secrets
Cathy Li
Emily Wenger
Zeyuan Allen-Zhu
François Charton
Kristin E. Lauter
AAML
65
11
0
20 Jun 2023
Graph Structure and Feature Extrapolation for Out-of-Distribution
  Generalization
Graph Structure and Feature Extrapolation for Out-of-Distribution Generalization
Xiner Li
Shurui Gui
Youzhi Luo
Shuiwang Ji
OODDOOD
89
14
0
13 Jun 2023
SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling
  with Backtracking
SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking
Chris Cundy
Stefano Ermon
91
12
0
08 Jun 2023
PromptRobust: Towards Evaluating the Robustness of Large Language Models
  on Adversarial Prompts
PromptRobust: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts
Kaijie Zhu
Jindong Wang
Jiaheng Zhou
Zichen Wang
Hao Chen
...
Linyi Yang
Weirong Ye
Yue Zhang
Neil Zhenqiang Gong
Xingxu Xie
SILM
135
144
0
07 Jun 2023
Joint Learning of Label and Environment Causal Independence for Graph
  Out-of-Distribution Generalization
Joint Learning of Label and Environment Causal Independence for Graph Out-of-Distribution Generalization
Shurui Gui
Meng Liu
Xiner Li
Youzhi Luo
Shuiwang Ji
CMLOOD
90
30
0
01 Jun 2023
The Impact of Positional Encoding on Length Generalization in
  Transformers
The Impact of Positional Encoding on Length Generalization in Transformers
Amirhossein Kazemnejad
Inkit Padhi
Karthikeyan N. Ramamurthy
Payel Das
Siva Reddy
79
207
0
31 May 2023
Determinantal Point Process Attention Over Grid Cell Code Supports Out
  of Distribution Generalization
Determinantal Point Process Attention Over Grid Cell Code Supports Out of Distribution Generalization
S. S. Mondal
Steven M. Frankland
Taylor Webb
Jonathan D. Cohen
77
1
0
28 May 2023
FERMAT: An Alternative to Accuracy for Numerical Reasoning
FERMAT: An Alternative to Accuracy for Numerical Reasoning
Jasivan Sivakumar
N. Moosavi
ReLMLRM
93
4
0
27 May 2023
Calc-X and Calcformers: Empowering Arithmetical Chain-of-Thought through
  Interaction with Symbolic Systems
Calc-X and Calcformers: Empowering Arithmetical Chain-of-Thought through Interaction with Symbolic Systems
Marek Kadlcík
Michal Štefánik
Ondřej Sotolář
Vlastimil Martinek
LRM
72
15
0
24 May 2023
UniChart: A Universal Vision-language Pretrained Model for Chart
  Comprehension and Reasoning
UniChart: A Universal Vision-language Pretrained Model for Chart Comprehension and Reasoning
Ahmed Masry
P. Kavehzadeh
Do Xuan Long
Enamul Hoque
Shafiq Joty
LRM
95
113
0
24 May 2023
GPT-SW3: An Autoregressive Language Model for the Nordic Languages
GPT-SW3: An Autoregressive Language Model for the Nordic Languages
Ariel Ekgren
Amaru Cuba Gyllensten
Felix Stollenwerk
Joey Öhman
T. Isbister
Evangelia Gogoulou
F. Carlsson
Alice Heiman
Judit Casademont
Magnus Sahlgren
88
13
0
22 May 2023
A Symbolic Framework for Evaluating Mathematical Reasoning and
  Generalisation with Transformers
A Symbolic Framework for Evaluating Mathematical Reasoning and Generalisation with Transformers
Jordan Meadows
Marco Valentino
Damien Teney
André Freitas
120
8
0
21 May 2023
TheoremQA: A Theorem-driven Question Answering dataset
TheoremQA: A Theorem-driven Question Answering dataset
Wenhu Chen
Ming Yin
Max Ku
Pan Lu
Yixin Wan
Xueguang Ma
Jianyu Xu
Xinyi Wang
Tony Xia
AIMat
131
140
0
21 May 2023
GPT-3.5, GPT-4, or BARD? Evaluating LLMs Reasoning Ability in Zero-Shot
  Setting and Performance Boosting Through Prompts
GPT-3.5, GPT-4, or BARD? Evaluating LLMs Reasoning Ability in Zero-Shot Setting and Performance Boosting Through Prompts
Jessica Nayeli López Espejel
E. Ettifouri
Mahaman Sanoussi Yahaya Alassan
El Mehdi Chouham
Walid Dahhane
ELMLRM
73
90
0
21 May 2023
VNHSGE: VietNamese High School Graduation Examination Dataset for Large
  Language Models
VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language Models
Dao Xuan-Quy
Le Ngoc-Bich
Vo The-Duy
Phan Xuan-Dung
Ngo Bac-Bien
Nguyen Van-Tien
Nguyen Thi-My-Thanh
Nguyen Hong-Phuoc
61
16
0
20 May 2023
Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized
  Language Models
Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models
Shangbin Feng
Weijia Shi
Yuyang Bai
Vidhisha Balachandran
Tianxing He
Yulia Tsvetkov
KELM
133
37
0
17 May 2023
Code Execution with Pre-trained Language Models
Code Execution with Pre-trained Language Models
Chenxiao Liu
Shuai Lu
Weizhu Chen
Daxin Jiang
Alexey Svyatkovskiy
Shengyu Fu
Neel Sundaresan
Nan Duan
ELM
112
27
0
08 May 2023
Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing
  Important Tokens
Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens
Zhanpeng Zeng
Cole Hawkins
Min-Fong Hong
Aston Zhang
Nikolaos Pappas
Vikas Singh
Shuai Zheng
72
8
0
07 May 2023
Approximating CKY with Transformers
Approximating CKY with Transformers
Ghazal Khalighinejad
Ollie Liu
Sam Wiseman
109
2
0
03 May 2023
MLCopilot: Unleashing the Power of Large Language Models in Solving
  Machine Learning Tasks
MLCopilot: Unleashing the Power of Large Language Models in Solving Machine Learning Tasks
Lei Zhang
Yuge Zhang
Kan Ren
Dongsheng Li
Yuqing Yang
LLMAG
90
41
0
28 Apr 2023
Evaluating Transformer Language Models on Arithmetic Operations Using
  Number Decomposition
Evaluating Transformer Language Models on Arithmetic Operations Using Number Decomposition
Matteo Muffo
A. Cocco
Enrico Bertino
ReLM
75
25
0
21 Apr 2023
Abstractors and relational cross-attention: An inductive bias for
  explicit relational reasoning in Transformers
Abstractors and relational cross-attention: An inductive bias for explicit relational reasoning in Transformers
Awni Altabaa
Taylor Webb
Jonathan D. Cohen
John Lafferty
82
7
0
01 Apr 2023
The Nordic Pile: A 1.2TB Nordic Dataset for Language Modeling
The Nordic Pile: A 1.2TB Nordic Dataset for Language Modeling
Joey Öhman
S. Verlinden
Ariel Ekgren
Amaru Cuba Gyllensten
T. Isbister
Evangelia Gogoulou
F. Carlsson
Magnus Sahlgren
50
11
0
30 Mar 2023
Koala: An Index for Quantifying Overlaps with Pre-training Corpora
Koala: An Index for Quantifying Overlaps with Pre-training Corpora
Thuy-Trang Vu
Xuanli He
Gholamreza Haffari
Ehsan Shareghi
CLL
73
15
0
26 Mar 2023
Can neural networks do arithmetic? A survey on the elementary numerical
  skills of state-of-the-art deep learning models
Can neural networks do arithmetic? A survey on the elementary numerical skills of state-of-the-art deep learning models
Alberto Testolin
AIMat
72
22
0
14 Mar 2023
SGD learning on neural networks: leap complexity and saddle-to-saddle
  dynamics
SGD learning on neural networks: leap complexity and saddle-to-saddle dynamics
Emmanuel Abbe
Enric Boix-Adserà
Theodor Misiakiewicz
FedMLMLT
167
86
0
21 Feb 2023
Tree-Based Representation and Generation of Natural and Mathematical
  Language
Tree-Based Representation and Generation of Natural and Mathematical Language
Alexander Scarlatos
Andrew Lan
52
19
0
15 Feb 2023
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on
  Reasoning, Hallucination, and Interactivity
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity
Yejin Bang
Samuel Cahyawijaya
Nayeon Lee
Wenliang Dai
Dan Su
...
Tiezheng Yu
Willy Chung
Quyet V. Do
Yan Xu
Pascale Fung
ReLMLRM
166
1,400
0
08 Feb 2023
GLADIS: A General and Large Acronym Disambiguation Benchmark
GLADIS: A General and Large Acronym Disambiguation Benchmark
Lihu Chen
Gaël Varoquaux
Fabian M. Suchanek
ELM
98
4
0
03 Feb 2023
Recursive Neural Networks with Bottlenecks Diagnose
  (Non-)Compositionality
Recursive Neural Networks with Bottlenecks Diagnose (Non-)Compositionality
Verna Dankers
Ivan Titov
78
2
0
31 Jan 2023
Previous
123456
Next