Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.13867
Cited By
Mathematical Capabilities of ChatGPT
31 January 2023
Simon Frieder
Luca Pinchetti
Alexis Chevalier
Ryan-Rhys Griffiths
Tommaso Salvatori
Thomas Lukasiewicz
P. Petersen
Julius Berner
ELM
AI4MH
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mathematical Capabilities of ChatGPT"
50 / 200 papers shown
Title
Exploring the Limits of ChatGPT in Software Security Applications
Fangzhou Wu
Qingzhao Zhang
Ati Priya Bajaj
Tiffany Bao
Ning Zhang
Ruoyu Wang
Chaowei Xiao
ALM
SILM
ELM
25
8
0
08 Dec 2023
DeceptPrompt: Exploiting LLM-driven Code Generation via Adversarial Natural Language Instructions
Fangzhou Wu
Xiaogeng Liu
Chaowei Xiao
AAML
SILM
34
26
0
07 Dec 2023
Large Language Models for Mathematicians
Simon Frieder
Julius Berner
P. Petersen
Thomas Lukasiewicz
13
4
0
07 Dec 2023
InteraSSort: Interactive Assortment Planning Using Large Language Models
Saketh Reddy Karra
Theja Tulabandhula
37
3
0
20 Nov 2023
Exploring the Potential of Large Language Models in Computational Argumentation
Guizhen Chen
Liying Cheng
Anh Tuan Luu
Lidong Bing
LLMAG
LRM
29
23
0
15 Nov 2023
When does In-context Learning Fall Short and Why? A Study on Specification-Heavy Tasks
Hao Peng
Xiaozhi Wang
Jianhui Chen
Weikai Li
Y. Qi
...
Zhili Wu
Kaisheng Zeng
Bin Xu
Lei Hou
Juanzi Li
34
28
0
15 Nov 2023
Adversarial Preference Optimization: Enhancing Your Alignment via RM-LLM Game
Pengyu Cheng
Yifan Yang
Jian Li
Yong Dai
Tianhao Hu
Peixin Cao
Nan Du
Xiaolong Li
28
28
0
14 Nov 2023
Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation
Ruomeng Ding
Chaoyun Zhang
Lu Wang
Yong Xu
Ming-Jie Ma
Wei Zhang
Si Qin
Saravan Rajmohan
Qingwei Lin
Dongmei Zhang
LRM
35
61
0
07 Nov 2023
An Interdisciplinary Outlook on Large Language Models for Scientific Research
James Boyko
Joseph Cohen
Nathan Fox
Maria Han Veiga
Jennifer I-Hsiu Li
...
Andreas H. Rauch
Kenneth N. Reid
Soumi Tribedi
Anastasia Visheratina
Xin Xie
38
18
0
03 Nov 2023
The Expressibility of Polynomial based Attention Scheme
Zhao Song
Guangyi Xu
Junze Yin
36
5
0
30 Oct 2023
The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics
Christoph Leiter
Juri Opitz
Daniel Deutsch
Yang Gao
Rotem Dror
Steffen Eger
ALM
LRM
ELM
40
31
0
30 Oct 2023
Enhancing Chemistry Learning with ChatGPT, Bing Chat, Bard, and Claude as Agents-to-Think-With: A Comparative Case Study
Renato P. dos Santos
20
4
0
23 Oct 2023
LUNA: A Model-Based Universal Analysis Framework for Large Language Models
Da Song
Xuan Xie
Jiayang Song
Derui Zhu
Yuheng Huang
Felix Juefei Xu
Lei Ma
ALM
40
3
0
22 Oct 2023
AI for Mathematics: A Cognitive Science Perspective
Cedegao E. Zhang
Katherine M. Collins
Adrian Weller
Joshua B. Tenenbaum
36
10
0
19 Oct 2023
Can Large Language Models Explain Themselves? A Study of LLM-Generated Self-Explanations
Shiyuan Huang
Siddarth Mamidanna
Shreedhar Jangam
Yilun Zhou
Leilani H. Gilpin
LRM
MILM
ELM
43
67
0
17 Oct 2023
Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT
Xiaoshuai Song
Keqing He
Pei Wang
Guanting Dong
Yutao Mou
Jingang Wang
Yunsen Xian
Xunliang Cai
Weiran Xu
LRM
42
14
0
16 Oct 2023
GLoRE: Evaluating Logical Reasoning of Large Language Models
Hanmeng Liu
Zhiyang Teng
Ruoxi Ning
Jian Liu
Qiji Zhou
Yuexin Zhang
Yue Zhang
ReLM
ELM
LRM
70
8
0
13 Oct 2023
Can GPT models be Financial Analysts? An Evaluation of ChatGPT and GPT-4 on mock CFA Exams
Ethan Callanan
A. Mbakwe
Antony Papadimitriou
Yulong Pei
Mathieu Sibue
Xiaodan Zhu
Zhiqiang Ma
Xiaomo Liu
Sameena Shah
ELM
38
14
0
12 Oct 2023
A New Benchmark and Reverse Validation Method for Passage-level Hallucination Detection
Shiping Yang
Renliang Sun
Xiao-Yi Wan
HILM
40
41
0
10 Oct 2023
OptiMUS: Optimization Modeling Using MIP Solvers and large language models
Ali AhmadiTeshnizi
Wenzhi Gao
Madeleine Udell
LLMAG
16
23
0
09 Oct 2023
The potential of large language models for improving probability learning: A study on ChatGPT3.5 and first-year computer engineering students
Angel Udias
A. Alonso-Ayuso
Ignacio Sanchez
Sonia Hernandez
Maria Eugenia Castellanos
R. M. Diez
Emilio Lopez Cano
47
1
0
09 Oct 2023
FELM: Benchmarking Factuality Evaluation of Large Language Models
Shiqi Chen
Yiran Zhao
Jinghan Zhang
Ethan Chern
Siyang Gao
Pengfei Liu
Junxian He
HILM
41
33
0
01 Oct 2023
Language Models as a Service: Overview of a New Paradigm and its Challenges
Emanuele La Malfa
Aleksandar Petrov
Simon Frieder
Christoph Weinhuber
Ryan Burnell
Raza Nazar
Anthony Cohn
Nigel Shadbolt
Michael Wooldridge
ALM
ELM
35
3
0
28 Sep 2023
ChatGPT & Mechanical Engineering: Examining performance on the FE Mechanical Engineering and Undergraduate Exams
Matthew Frenkel
Hebah Emara
34
2
0
26 Sep 2023
What does ChatGPT know about natural science and engineering?
Lukas Schulze Balhorn
Jana M. Weber
Stefan Buijsman
J. Hildebrandt
Martina Ziefle
Artur M. Schweidtmann
AI4MH
AI4CE
ELM
14
4
0
18 Sep 2023
How much can ChatGPT really help Computational Biologists in Programming?
C. R. Rahman
Limsoon Wong
AI4CE
16
2
0
17 Sep 2023
ChatGPT-4 with Code Interpreter can be used to solve introductory college-level vector calculus and electromagnetism problems
Tanuj Kumar
M. Kats
21
9
0
16 Sep 2023
TrafficGPT: Viewing, Processing and Interacting with Traffic Foundation Models
Siyao Zhang
Daocheng Fu
Zhao Zhang
Bin Yu
Pinlong Cai
19
48
0
13 Sep 2023
Towards LLM-based Autograding for Short Textual Answers
Johannes Schneider
Bernd Schenk
Christina Niklaus
AI4Ed
22
32
0
09 Sep 2023
Beyond Static Datasets: A Deep Interaction Approach to LLM Evaluation
Jiatong Li
Rui Li
Qi Liu
34
15
0
08 Sep 2023
LogGPT: Exploring ChatGPT for Log-Based Anomaly Detection
Jiaxing Qi
Shaohan Huang
Zhongzhi Luan
Carol J. Fung
Hailong Yang
D. Qian
22
26
0
03 Sep 2023
No Train Still Gain. Unleash Mathematical Reasoning of Large Language Models with Monte Carlo Tree Search Guided by Energy Function
Haotian Xu
LRM
38
12
0
01 Sep 2023
GPTEval: A Survey on Assessments of ChatGPT and GPT-4
Rui Mao
Guanyi Chen
Xulang Zhang
Frank Guerin
Min Zhang
ELM
LM&MA
38
101
0
24 Aug 2023
Are ChatGPT and GPT-4 Good Poker Players? -- A Pre-Flop Analysis
Akshat Gupta
LLMAG
AI4MH
26
10
0
23 Aug 2023
Diversity Measures: Domain-Independent Proxies for Failure in Language Model Queries
Noel Ngu
Nathaniel Lee
Paulo Shakarian
35
4
0
22 Aug 2023
A criterion for Artificial General Intelligence: hypothetic-deductive reasoning, tested on ChatGPT
L. Vervoort
Vitaliy Mizyakov
Anastasia V. Ugleva
ReLM
ELM
LRM
24
1
0
05 Aug 2023
Does Correction Remain A Problem For Large Language Models?
Xiaowu Zhang
Xiaotian Zhang
Cheng Yang
Hang Yan
Xipeng Qiu
LRM
KELM
22
5
0
03 Aug 2023
What Is the Difference Between a Mountain and a Molehill? Quantifying Semantic Labeling of Visual Features in Line Charts
Dennis Bromley
V. Setlur
31
10
0
02 Aug 2023
Olio: A Semantic Search Interface for Data Repositories
V. Setlur
Andriy Kanyuka
Arjun Srinivasan
17
10
0
31 Jul 2023
How to Design and Deliver Courses for Higher Education in the AI Era: Insights from Exam Data Analysis
A. Wazan
I. Taj
Abdulhadi Shoufan
R. Laborde
Rémi Venant
ELM
32
1
0
22 Jul 2023
SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models
Xiaoxuan Wang
Ziniu Hu
Pan Lu
Yanqiao Zhu
Jieyu Zhang
Satyen Subramaniam
Arjun R. Loomba
Shichang Zhang
Yizhou Sun
Wei Wang
ELM
LRM
30
87
0
20 Jul 2023
PharmacyGPT: The AI Pharmacist
Zheng Liu
Zihao Wu
Mengxuan Hu
Bokai Zhao
Lin Zhao
...
Ye Shen
Sheng Li
Brian Murray
Tianming Liu
Andrea Sikora
LM&MA
AI4MH
45
0
0
19 Jul 2023
Unmasking the giant: A comprehensive evaluation of ChatGPT's proficiency in coding algorithms and data structures
Sayed Erfan Arefin
T. Ashrafi
H. Al-Qudah
Ynes Ineza
Abdul Serwadda
ELM
36
6
0
10 Jul 2023
Can LLMs be Good Financial Advisors?: An Initial Study in Personal Decision Making for Optimized Outcomes
Kausik Lakkaraju
Sai Krishna Revanth Vuruma
Vishal Pallagani
Bharath Muppasani
Biplav Srivastava
29
15
0
08 Jul 2023
A Survey on Evaluation of Large Language Models
Yu-Chu Chang
Xu Wang
Jindong Wang
Yuanyi Wu
Linyi Yang
...
Yue Zhang
Yi-Ju Chang
Philip S. Yu
Qian Yang
Xingxu Xie
ELM
LM&MA
ALM
75
1,529
0
06 Jul 2023
Evaluating the Effectiveness of Large Language Models in Representing Textual Descriptions of Geometry and Spatial Relations
Yu Ji
Song Gao
41
16
0
05 Jul 2023
Evaluating ChatGPT's Decimal Skills and Feedback Generation in a Digital Learning Game
H. Nguyen
Hayden Stec
Xinying Hou
Sarah Di
B. McLaren
LRM
17
30
0
29 Jun 2023
Taqyim: Evaluating Arabic NLP Tasks Using ChatGPT Models
Zaid Alyafeai
Maged S. Alshaibani
Badr AlKhamissi
H. Luqman
Ebrahim Alareqi
A. Fadel
ELM
LM&MA
AI4MH
22
17
0
28 Jun 2023
MyCrunchGPT: A chatGPT assisted framework for scientific machine learning
Varun V. Kumar
Leonard Gleyzer
Adar Kahana
K. Shukla
George Karniadakis
AI4CE
39
11
0
27 Jun 2023
Investigating the Effectiveness of ChatGPT in Mathematical Reasoning and Problem Solving: Evidence from the Vietnamese National High School Graduation Examination
Xuan-Quy Dao
Ngoc-Bich Le
21
30
0
10 Jun 2023
Previous
1
2
3
4
Next