Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.10420
Cited By
A Comprehensive Capability Analysis of GPT-3 and GPT-3.5 Series Models
18 March 2023
Junjie Ye
Xuanting Chen
Nuo Xu
Can Zu
Zekai Shao
Shichun Liu
Yuhan Cui
Zeyang Zhou
Chao Gong
Yang Shen
Jie Zhou
Siming Chen
Tao Gui
Qi Zhang
Xuanjing Huang
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Comprehensive Capability Analysis of GPT-3 and GPT-3.5 Series Models"
50 / 167 papers shown
Title
Evaluating the Effectiveness of Black-Box Prompt Optimization as the Scale of LLMs Continues to Grow
Ziyu Zhou
Yihang Wu
J. Yang
Zhan Xiao
Rongjun Li
LRM
29
0
0
13 May 2025
ICE-Pruning: An Iterative Cost-Efficient Pruning Pipeline for Deep Neural Networks
Wenhao Hu
Paul Henderson
José Cano
32
0
0
12 May 2025
A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models
Junjie Ye
Caishuang Huang
Zhe Chen
Wenjie Fu
Chenyuan Yang
...
Tao Gui
Qi Zhang
Zhongchao Shi
Jianping Fan
Xuanjing Huang
ALM
43
0
0
12 May 2025
Frame In, Frame Out: Do LLMs Generate More Biased News Headlines than Humans?
Valeria Pastorino
N. Moosavi
43
0
0
08 May 2025
Enhanced Urdu Intent Detection with Large Language Models and Prototype-Informed Predictive Pipelines
Faiza Hassan
Summra Saleem
Kashif Javed
Muhammad Nabeel Asim
A. Rehman
Andreas Dengel
33
0
0
08 May 2025
Performance Evaluation of Large Language Models in Bangla Consumer Health Query Summarization
Ajwad Abrar
Farzana Tabassum
Sabbir Ahmed
LM&MA
ELM
AI4MH
43
0
0
08 May 2025
CHORUS: Zero-shot Hierarchical Retrieval and Orchestration for Generating Linear Programming Code
Tasnim Ahmed
Salimur Choudhury
29
0
0
02 May 2025
SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning
Yiting Wang
Wanghao Ye
Ping Guo
Yexiao He
Zhilin Wang
...
Sihan Chen
Ankur Srivastava
Qingfu Zhang
Gang Qu
Ang Li
43
0
0
14 Apr 2025
StruPhantom: Evolutionary Injection Attacks on Black-Box Tabular Agents Powered by Large Language Models
Yang Feng
Xudong Pan
AAML
36
0
0
14 Apr 2025
Iterative Self-Training for Code Generation via Reinforced Re-Ranking
Nikita Sorokin
I. Sedykh
Valentin Malykh
31
0
0
13 Apr 2025
Enhancing NER Performance in Low-Resource Pakistani Languages using Cross-Lingual Data Augmentation
Toqeer Ehsan
Thamar Solorio
140
0
0
07 Apr 2025
Large Language Model (LLM) for Software Security: Code Analysis, Malware Analysis, Reverse Engineering
Hamed Jelodar
Samita Bai
Parisa Hamedi
Hesamodin Mohammadian
R. Razavi-Far
Ali Ghorbani
39
1
0
07 Apr 2025
From Tokens to Lattices: Emergent Lattice Structures in Language Models
Bo Xiong
Steffen Staab
LRM
24
0
0
04 Apr 2025
GReaTER: Generate Realistic Tabular data after data Enhancement and Reduction
Tung Sum Thomas Kwok
Chi-Hua Wang
Guang Cheng
LMTD
71
1
0
19 Mar 2025
Bridging Social Psychology and LLM Reasoning: Conflict-Aware Meta-Review Generation via Cognitive Alignment
Wei Chen
Han Ding
Meng Yuan
Zhao Zhang
Deqing Wang
Fuzhen Zhuang
155
0
0
18 Mar 2025
Breaking Free from MMI: A New Frontier in Rationalization by Probing Input Utilization
Wei Liu
Zhiying Deng
Zhongyu Niu
Jun Wang
Yining Qi
Zhigang Zeng
Ruixuan Li
47
2
0
08 Mar 2025
Are LLMs Ready for Practical Adoption for Assertion Generation?
Vaishnavi Pulavarthi
Deeksha Nandal
Soham Dan
Debjit Pal
56
2
0
28 Feb 2025
NutriGen: Personalized Meal Plan Generator Leveraging Large Language Models to Enhance Dietary and Nutritional Adherence
Saman Khamesian
Asiful Arefeen
Stephanie M. Carpenter
Hassan Ghasemzadeh
60
0
0
28 Feb 2025
Pastiche Novel Generation Creating: Fan Fiction You Love in Your Favorite Author's Style
Xueran Han
Yuhan Liu
Mingzhe Li
Wei Liu
Sen Hu
Rui Yan
Zhiqiang Xu
Xiuying Chen
69
0
0
24 Feb 2025
Problem-Solving Logic Guided Curriculum In-Context Learning for LLMs Complex Reasoning
Xuetao Ma
Wenbin Jiang
Hua Huang
LRM
68
1
0
24 Feb 2025
Multi-Agent Autonomous Driving Systems with Large Language Models: A Survey of Recent Advances
Yaozu Wu
Dongyuan Li
Yankai Chen
Renhe Jiang
Henry Peng Zou
Liancheng Fang
Zhen Wang
Philip S. Yu
LLMAG
70
2
0
24 Feb 2025
PropaInsight: Toward Deeper Understanding of Propaganda in Terms of Techniques, Appeals, and Intent
Jiateng Liu
Lin Ai
Zizhou Liu
Payam Karisani
Zheng Hui
May Fung
Preslav Nakov
Julia Hirschberg
Heng Ji
DiffM
90
4
0
17 Feb 2025
SelfCheckAgent: Zero-Resource Hallucination Detection in Generative Large Language Models
Diyana Muhammed
Gollam Rabby
Sören Auer
LLMAG
HILM
81
0
0
03 Feb 2025
How Should We Build A Benchmark? Revisiting 274 Code-Related Benchmarks For LLMs
Jialun Cao
Yuk-Kit Chan
Zixuan Ling
Wenxuan Wang
Shuqing Li
...
Pinjia He
Shuai Wang
Zibin Zheng
Michael R. Lyu
Shing-Chi Cheung
ALM
71
2
0
18 Jan 2025
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use
Junjie Ye
Zhengyin Du
Xuesong Yao
Weijian Lin
Yufei Xu
...
Siyu Yuan
Tao Gui
Qi Zhang
Xuanjing Huang
Jiecao Chen
56
0
0
05 Jan 2025
Using Large Language Models for Automated Grading of Student Writing about Science
Chris Impey
Matthew Wenger
Nikhil Garuda
Shahriar Golchin
Sarah Stamer
ELM
AI4Ed
44
3
0
25 Dec 2024
LLM4AD: A Platform for Algorithm Design with Large Language Model
Fei Liu
Rui-Xun Zhang
Zhuoliang Xie
Rui Sun
Kai Li
Xi Lin
Zhenkun Wang
Zhichao Lu
Qingfu Zhang
50
3
0
23 Dec 2024
Cognition Chain for Explainable Psychological Stress Detection on Social Media
Xin Wang
Boyan Gao
Yi Dai
Lei Cao
Liang Zhao
Yuqing Yang
David A. Clifton
68
0
0
18 Dec 2024
A Survey of Calibration Process for Black-Box LLMs
Liangru Xie
Hui Liu
Jingying Zeng
Xianfeng Tang
Yan Han
Chen Luo
Jing Huang
Zhen Li
Suhang Wang
Qi He
74
1
0
17 Dec 2024
AIDBench: A benchmark for evaluating the authorship identification capability of large language models
Zichen Wen
Dadi Guo
Huishuai Zhang
77
0
0
20 Nov 2024
Understanding Student Sentiment on Mental Health Support in Colleges Using Large Language Models
Palak Sood
Chengyang He
Divyanshu Gupta
Yue Ning
Ping Wang
AI4MH
75
0
0
18 Nov 2024
A Hierarchical Language Model For Interpretable Graph Reasoning
Sambhav Khurana
Xiner Li
Shurui Gui
Shuiwang Ji
LRM
34
0
0
29 Oct 2024
Is GPT-4 Less Politically Biased than GPT-3.5? A Renewed Investigation of ChatGPT's Political Biases
Erik Weber
Jérôme Rutinowski
Niklas Jost
Markus Pauly
25
0
0
28 Oct 2024
AskBeacon -- Performing genomic data exchange and analytics with natural language
Anuradha Wickramarachchi
Shakila Tonni
Sonali Majumdar
Sarvnaz Karimi
Sulev Kõks
Brendan Hosking
Jordi Rambla
Natalie A. Twine
Yatish Jain
Denis C. Bauer
LM&MA
16
0
0
22 Oct 2024
FLARE: Faithful Logic-Aided Reasoning and Exploration
Erik Arakelyan
Pasquale Minervini
Pat Verga
Patrick Lewis
Isabelle Augenstein
ReLM
LRM
69
2
0
14 Oct 2024
Accurate and Regret-aware Numerical Problem Solver for Tabular Question Answering
Yuxiang Wang
Jianzhong Qi
Junhao Gan
LMTD
53
2
0
10 Oct 2024
Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension
Zaiquan Yang
Yuhao Liu
Jiaying Lin
Gerhard Hancke
Rynson W. H. Lau
31
1
0
02 Oct 2024
Decoding Hate: Exploring Language Models' Reactions to Hate Speech
Paloma Piot
Javier Parapar
43
1
0
01 Oct 2024
Ethical and Scalable Automation: A Governance and Compliance Framework for Business Applications
Haocheng Lin
34
2
0
25 Sep 2024
Konstruktor: A Strong Baseline for Simple Knowledge Graph Question Answering
M. Lysyuk
Mikhail Salnikov
Pavel Braslavski
Alexander Panchenko
36
1
0
24 Sep 2024
LINKAGE: Listwise Ranking among Varied-Quality References for Non-Factoid QA Evaluation via LLMs
Sihui Yang
Keping Bi
Wanqing Cui
Jiafeng Guo
Xueqi Cheng
23
2
0
23 Sep 2024
Interactive Machine Teaching by Labeling Rules and Instances
Giannis Karamanolakis
Daniel J. Hsu
Luis Gravano
35
0
0
08 Sep 2024
Impact of ChatGPT on the writing style of condensed matter physicists
Shaojun Xu
Xiaohui Ye
Mengqi Zhang
Pei Wang
23
0
0
30 Aug 2024
CTP-LLM: Clinical Trial Phase Transition Prediction Using Large Language Models
Michael Reinisch
Jianfeng He
C. Liao
Sauleh Ahmad Siddiqui
Bei Xiao
33
1
0
20 Aug 2024
DAC: Decomposed Automation Correction for Text-to-SQL
Dingzirui Wang
Longxu Dou
Xuanliang Zhang
Qingfu Zhu
Wanxiang Che
54
1
0
16 Aug 2024
Integrating Large Language Models and Knowledge Graphs for Extraction and Validation of Textual Test Data
Zili Wang
Marco Balduini
Federico De Santis
Andrea Proia
Arsenio Leo
Marco Brambilla
Shiming Xiang
29
2
0
03 Aug 2024
Motamot: A Dataset for Revealing the Supremacy of Large Language Models over Transformer Models in Bengali Political Sentiment Analysis
Fatema Tuj Johora Faria
Mukaffi Bin Moin
Rabeya Islam Mumu
Md Mahabubul Alam Abir
Abrar Nawar Alfy
Mohammad Shafiul Alam
27
0
0
28 Jul 2024
Fairness Definitions in Language Models Explained
Thang Viet Doan
Zhibo Chu
Zichong Wang
Wenbin Zhang
ALM
55
10
0
26 Jul 2024
Prompt Selection Matters: Enhancing Text Annotations for Social Sciences with Large Language Models
Louis Abraham
Charles Arnal
Antoine Marie
65
1
0
15 Jul 2024
Self-Evolving GPT: A Lifelong Autonomous Experiential Learner
Jinglong Gao
Xiao Ding
Yiming Cui
Jianbai Zhao
Hepeng Wang
Ting Liu
Bing Qin
KELM
CLL
41
3
0
12 Jul 2024
1
2
3
4
Next