ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.05660
  4. Cited By
NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning
  Tasks

NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks

12 April 2022
Swaroop Mishra
Arindam Mitra
Neeraj Varshney
Bhavdeep Singh Sachdeva
Peter Clark
Chitta Baral
Ashwin Kalyan
    AIMat
    ReLM
    ELM
    LRM
ArXivPDFHTML

Papers citing "NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks"

23 / 73 papers shown
Title
Teaching Probabilistic Logical Reasoning to Transformers
Teaching Probabilistic Logical Reasoning to Transformers
Aliakbar Nafar
K. Venable
Parisa Kordjamshidi
ReLM
LRM
24
3
0
22 May 2023
Can NLP Models Correctly Reason Over Contexts that Break the Common
  Assumptions?
Can NLP Models Correctly Reason Over Contexts that Break the Common Assumptions?
Neeraj Varshney
Mihir Parmar
Nisarg Patel
Divij Handa
Sayantan Sarkar
Man Luo
Chitta Baral
LRM
34
4
0
20 May 2023
Document Understanding Dataset and Evaluation (DUDE)
Document Understanding Dataset and Evaluation (DUDE)
Jordy Van Landeghem
Rubèn Pérez Tito
Łukasz Borchmann
Michal Pietruszka
Pawel Józiak
...
Bertrand Ackaert
Ernest Valveny
Matthew Blaschko
Sien Moens
Tomasz Stanislawek
VGen
24
53
0
15 May 2023
Comprehensive Solution Program Centric Pretraining for Table-and-Text
  Hybrid Numerical Reasoning
Comprehensive Solution Program Centric Pretraining for Table-and-Text Hybrid Numerical Reasoning
Qianying Liu
Dongsheng Yang
Wenjie Zhong
Fei Cheng
Sadao Kurohashi
AIMat
36
0
0
12 May 2023
How does GPT-2 compute greater-than?: Interpreting mathematical
  abilities in a pre-trained language model
How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model
Michael Hanna
Ollie Liu
Alexandre Variengien
LRM
193
121
0
30 Apr 2023
Can neural networks do arithmetic? A survey on the elementary numerical
  skills of state-of-the-art deep learning models
Can neural networks do arithmetic? A survey on the elementary numerical skills of state-of-the-art deep learning models
Alberto Testolin
AIMat
35
20
0
14 Mar 2023
Augmented Language Models: a Survey
Augmented Language Models: a Survey
Grégoire Mialon
Roberto Dessì
Maria Lomeli
Christoforos Nalmpantis
Ramakanth Pasunuru
...
Jane Dwivedi-Yu
Asli Celikyilmaz
Edouard Grave
Yann LeCun
Thomas Scialom
LRM
KELM
47
368
0
15 Feb 2023
A Survey of Deep Learning for Mathematical Reasoning
A Survey of Deep Learning for Mathematical Reasoning
Pan Lu
Liang Qiu
Wenhao Yu
Sean Welleck
Kai-Wei Chang
ReLM
LRM
46
139
0
20 Dec 2022
Reasoning with Language Model Prompting: A Survey
Reasoning with Language Model Prompting: A Survey
Shuofei Qiao
Yixin Ou
Ningyu Zhang
Xiang Chen
Yunzhi Yao
Shumin Deng
Chuanqi Tan
Fei Huang
Huajun Chen
ReLM
ELM
LRM
71
311
0
19 Dec 2022
UniGeo: Unifying Geometry Logical Reasoning via Reformulating
  Mathematical Expression
UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression
Jiaqi Chen
Tong Li
Jinghui Qin
Pan Lu
Liang Lin
Chongyu Chen
Xiaodan Liang
AIMat
LRM
47
89
0
06 Dec 2022
Chaining Simultaneous Thoughts for Numerical Reasoning
Chaining Simultaneous Thoughts for Numerical Reasoning
Zhihong Shao
Fei Huang
Minlie Huang
AIMat
AI4CE
16
18
0
29 Nov 2022
Lila: A Unified Benchmark for Mathematical Reasoning
Lila: A Unified Benchmark for Mathematical Reasoning
Swaroop Mishra
Matthew Finlayson
Pan Lu
Leonard Tang
Sean Welleck
...
Tanmay Rajpurohit
Oyvind Tafjord
Ashish Sabharwal
Peter Clark
Ashwin Kalyan
ELM
AIMat
ReLM
LRM
33
0
0
31 Oct 2022
A Causal Framework to Quantify the Robustness of Mathematical Reasoning
  with Language Models
A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models
Alessandro Stolfo
Zhijing Jin
Kumar Shridhar
Bernhard Schölkopf
Mrinmaya Sachan
ELM
OOD
LRM
35
61
0
21 Oct 2022
A Survey of Parameters Associated with the Quality of Benchmarks in NLP
A Survey of Parameters Associated with the Quality of Benchmarks in NLP
Swaroop Mishra
Anjana Arunkumar
Chris Bryan
Chitta Baral
37
1
0
14 Oct 2022
"John is 50 years old, can his son be 65?" Evaluating NLP Models'
  Understanding of Feasibility
"John is 50 years old, can his son be 65?" Evaluating NLP Models' Understanding of Feasibility
Himanshu Gupta
Neeraj Varshney
Swaroop Mishra
Kuntal Kumar Pal
Saurabh Arjun Sawant
Kevin Scaria
Siddharth Goyal
Chitta Baral
ELM
20
14
0
14 Oct 2022
FOLIO: Natural Language Reasoning with First-Order Logic
FOLIO: Natural Language Reasoning with First-Order Logic
Simeng Han
Hailey Schoelkopf
Yilun Zhao
Zhenting Qi
Martin Riddell
...
Yingbo Zhou
Caiming Xiong
Rex Ying
Arman Cohan
Dragomir R. Radev
ReLM
LRM
39
94
0
02 Sep 2022
Solving Quantitative Reasoning Problems with Language Models
Solving Quantitative Reasoning Problems with Language Models
Aitor Lewkowycz
Anders Andreassen
David Dohan
Ethan Dyer
Henryk Michalewski
...
Theo Gutman-Solo
Yuhuai Wu
Behnam Neyshabur
Guy Gur-Ari
Vedant Misra
ReLM
ELM
LRM
61
755
0
29 Jun 2022
Neural Retriever and Go Beyond: A Thesis Proposal
Neural Retriever and Go Beyond: A Thesis Proposal
Man Luo
35
1
0
31 May 2022
Is a Question Decomposition Unit All We Need?
Is a Question Decomposition Unit All We Need?
Pruthvi H. Patel
Swaroop Mishra
Mihir Parmar
Chitta Baral
ReLM
158
51
0
25 May 2022
Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard
  Contexts
Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts
H. Trivedi
Niranjan Balasubramanian
Tushar Khot
Ashish Sabharwal
ReLM
LRM
24
11
0
25 May 2022
Let the Model Decide its Curriculum for Multitask Learning
Let the Model Decide its Curriculum for Multitask Learning
Neeraj Varshney
Swaroop Mishra
Chitta Baral
25
8
0
19 May 2022
In-BoXBART: Get Instructions into Biomedical Multi-Task Learning
In-BoXBART: Get Instructions into Biomedical Multi-Task Learning
Mihir Parmar
Swaroop Mishra
Mirali Purohit
Man Luo
M. H. Murad
Chitta Baral
28
22
0
15 Apr 2022
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
299
6,984
0
20 Apr 2018
Previous
12