Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2201.11903
Cited By
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
28 January 2022
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Chain-of-Thought Prompting Elicits Reasoning in Large Language Models"
50 / 1,334 papers shown
Title
Evaluating GPT- and Reasoning-based Large Language Models on Physics Olympiad Problems: Surpassing Human Performance and Implications for Educational Assessment
Paul Tschisgale
Holger Maus
Fabian Kieser
Ben Kroehs
Stefan Petersen
Peter Wulff
ELM
LRM
28
0
0
14 May 2025
Guiding LLM-based Smart Contract Generation with Finite State Machine
Hao Luo
Yuhao Lin
Xiao Yan
Xintong Hu
Y. Wang
Qiming Zeng
Hao Wang
Jiawei Jiang
31
0
0
13 May 2025
CodePDE: An Inference Framework for LLM-driven PDE Solver Generation
Shanda Li
Tanya Marwah
Junhong Shen
W. Sun
Andrej Risteski
Yiming Yang
Ameet Talwalkar
AI4CE
34
0
0
13 May 2025
TUMS: Enhancing Tool-use Abilities of LLMs with Multi-structure Handlers
Aiyao He
Sijia Cui
Shuai Xu
Yanna Wang
Bo Xu
34
0
0
13 May 2025
OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Zhaochen Su
Linjie Li
Mingyang Song
Yunzhuo Hao
Zhengyuan Yang
...
Guanjie Chen
Jiawei Gu
Juntao Li
Xiaoye Qu
Yu Cheng
OffRL
LRM
25
0
0
13 May 2025
LLM-based Prompt Ensemble for Reliable Medical Entity Recognition from EHRs
K. M. S. Islam
A. S. Nipu
Jiawei Wu
Praveen Madiraju
36
0
0
13 May 2025
Internet of Agents: Fundamentals, Applications, and Challenges
Yuntao Wang
Shaolong Guo
Yanghe Pan
Zhou Su
Fahao Chen
Tom H. Luan
Peng Li
Jiawen Kang
Dusit Niyato
LLMAG
LM&Ro
AI4CE
55
0
0
12 May 2025
FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning
Zhehao Zhang
Weijie Xu
Fanyou Wu
Chandan K. Reddy
29
0
0
12 May 2025
Assessing the Chemical Intelligence of Large Language Models
Nicholas T. Runcie
Charlotte M. Deane
Fergus Imrie
ELM
LRM
28
0
0
12 May 2025
LLM-Augmented Chemical Synthesis and Design Decision Programs
Haorui Wang
Jeff Guo
Lingkai Kong
R. Ramprasad
Philippe Schwaller
Yuanqi Du
Chao Zhang
26
0
0
11 May 2025
Towards Human-Centric Autonomous Driving: A Fast-Slow Architecture Integrating Large Language Model Guidance with Reinforcement Learning
Chengkai Xu
Jiaqi Liu
Yicheng Guo
Y. Zhang
Peng Hang
Jian-jun Sun
26
0
0
11 May 2025
Evolutionary thoughts: integration of large language models and evolutionary algorithms
Antonio Jimeno Yepes
Pieter Barnard
29
0
0
09 May 2025
UniVLA: Learning to Act Anywhere with Task-centric Latent Actions
Qingwen Bu
Y. Yang
Jisong Cai
Shenyuan Gao
Guanghui Ren
Maoqing Yao
Ping Luo
Hongyang Li
84
0
0
09 May 2025
Harnessing LLMs Explanations to Boost Surrogate Models in Tabular Data Classification
Ruxue Shi
Hengrui Gu
Xu Shen
Xin Wang
LMTD
132
0
0
09 May 2025
Large Language Model-driven Security Assistant for Internet of Things via Chain-of-Thought
Mingfei Zeng
Ming Xie
Xixi Zheng
Chunhai Li
Chuan Zhang
Liehuang Zhu
29
0
0
08 May 2025
Enigme: Generative Text Puzzles for Evaluating Reasoning in Language Models
John Hawkins
ReLM
LRM
48
0
0
08 May 2025
Crosslingual Reasoning through Test-Time Scaling
Zheng-Xin Yong
Muhammad Farid Adilazuarda
Jonibek Mansurov
Ruochen Zhang
Niklas Muennighoff
Carsten Eickhoff
Genta Indra Winata
Julia Kreutzer
Stephen H. Bach
Alham Fikri Aji
LRM
ELM
116
0
0
08 May 2025
Rethinking Invariance in In-context Learning
Lizhe Fang
Yifei Wang
Khashayar Gatmiry
Lei Fang
Y. Wang
46
2
0
08 May 2025
GroverGPT-2: Simulating Grover's Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization
Min Chen
Jinglei Cheng
Pingzhi Li
Haoran Wang
Tianlong Chen
Junyu Liu
LRM
46
0
0
08 May 2025
Scalable Chain of Thoughts via Elastic Reasoning
Yuhui Xu
Hanze Dong
Lei Wang
Doyen Sahoo
Junnan Li
Caiming Xiong
OffRL
LRM
49
1
0
08 May 2025
Optimization Problem Solving Can Transition to Evolutionary Agentic Workflows
Wenhao Li
Bo Jin
Mingyi Hong
Changhong Lu
Xiangfeng Wang
46
0
0
07 May 2025
Red Teaming the Mind of the Machine: A Systematic Evaluation of Prompt Injection and Jailbreak Vulnerabilities in LLMs
Chetan Pathade
AAML
SILM
54
0
0
07 May 2025
Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving
Qi Liu
Xinhao Zheng
Renqiu Xia
Xingzhi Qi
Qinxiang Cao
Junchi Yan
AIMat
45
0
0
07 May 2025
Large Means Left: Political Bias in Large Language Models Increases with Their Number of Parameters
David Exler
Mark Schutera
Markus Reischl
Luca Rettenberger
54
0
0
07 May 2025
Enhancing Granular Sentiment Classification with Chain-of-Thought Prompting in Large Language Models
Vihaan Miriyala
Smrithi Bukkapatnam
Lavanya Prahallad
LRM
26
0
0
07 May 2025
MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection
Zhihao Zhang
Abhinav Kumar
Girish Chandar Ganesan
Xiaoming Liu
122
0
0
07 May 2025
An LLM-based Self-Evolving Security Framework for 6G Space-Air-Ground Integrated Networks
Qi Qin
Xinye Cao
Guoshun Nan
Sihan Chen
Rushan Li
...
Haitao Du
Qimei Cui
Pengxuan Mao
Xiaofeng Tao
Tony Q. S. Quek
27
0
0
06 May 2025
VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making
Jake Grigsby
Yuke Zhu
Michael S Ryoo
Juan Carlos Niebles
OffRL
VLM
36
0
0
06 May 2025
Graph Drawing for LLMs: An Empirical Evaluation
Walter Didimo
Fabrizio Montecchiani
Tommaso Piselli
42
0
0
06 May 2025
Interpretable Zero-shot Learning with Infinite Class Concepts
Zihan Ye
Shreyank N Gowda
Shiming Chen
Yaochu Jin
Kaizhu Huang
Xiaobo Jin
VLM
35
0
0
06 May 2025
Soft Best-of-n Sampling for Model Alignment
C. M. Verdun
Alex Oesterling
Himabindu Lakkaraju
Flavio du Pin Calmon
BDL
121
0
0
06 May 2025
BadLingual: A Novel Lingual-Backdoor Attack against Large Language Models
Z. Wang
Hongwei Li
Rui Zhang
Wenbo Jiang
Kangjie Chen
Tianwei Zhang
Qingchuan Zhao
Guowen Xu
AAML
41
0
0
06 May 2025
LogisticsVLN: Vision-Language Navigation For Low-Altitude Terminal Delivery Based on Agentic UAVs
Xinyuan Zhang
Yonglin Tian
Fei Lin
Yue Liu
Jing Ma
Kornélia Sára Szatmáry
Fei Wang
45
0
0
06 May 2025
SonicRAG : High Fidelity Sound Effects Synthesis Based on Retrival Augmented Generation
Yu-Ren Guo
Wen-Kai Tai
45
0
0
06 May 2025
DYSTIL: Dynamic Strategy Induction with Large Language Models for Reinforcement Learning
Borui Wang
Kathleen McKeown
Rex Ying
OffRL
34
0
0
06 May 2025
LogiDebrief: A Signal-Temporal Logic based Automated Debriefing Approach with Large Language Models Integration
Zirong Chen
Ziyan An
Jennifer Reynolds
Kristin Mullen
Stephen Martini
Meiyi Ma
29
0
0
06 May 2025
HyperTree Planning: Enhancing LLM Reasoning via Hierarchical Thinking
Runquan Gui
Z. Wang
J. Wang
Chi Ma
Huiling Zhen
M. Yuan
Jianye Hao
Defu Lian
Enhong Chen
Feng Wu
LRM
104
0
0
05 May 2025
FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models
Zhouliang Yu
Ruotian Peng
Keyi Ding
Y. K. Li
Zhongyuan Peng
...
Huajian Xin
W. R. Huang
Yandong Wen
Ge Zhang
Weiyang Liu
LRM
96
0
0
05 May 2025
SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning
Tianjian Li
Daniel Khashabi
55
0
0
05 May 2025
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play
Yemin Shi
Yu Shu
Siwei Dong
Guangyi Liu
Jaward Sesay
Jingwen Li
Zhiting Hu
AuLLM
VLM
45
0
0
05 May 2025
Recursive Decomposition with Dependencies for Generic Divide-and-Conquer Reasoning
Sergio Hernández-Gutiérrez
Minttu Alakuijala
Alexander Nikitin
Pekka Marttinen
LRM
52
2
0
05 May 2025
Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL
Jiarui Yao
Yifan Hao
Hanning Zhang
Hanze Dong
Wei Xiong
Nan Jiang
Tong Zhang
LRM
57
0
0
05 May 2025
Automated Hybrid Reward Scheduling via Large Language Models for Robotic Skill Learning
Changxin Huang
Junyang Liang
Yanbin Chang
Jingzhao Xu
Jianqiang Li
32
0
0
05 May 2025
34 Examples of LLM Applications in Materials Science and Chemistry: Towards Automation, Assistants, Agents, and Accelerated Scientific Discovery
Yoel Zimmermann
Adib Bazgir
Alexander H Al-Feghali
Mehrad Ansari
L. C. Brinson
...
Shang Zhu
Jan Janssen
Calvin Li
Ian T. Foster
B. Blaiszik
45
0
0
05 May 2025
BLAB: Brutally Long Audio Bench
Orevaoghene Ahia
Martijn Bartelds
Kabir Ahuja
Hila Gonen
Valentin Hofmann
...
Noah Bennett
Shinji Watanabe
Noah A. Smith
Yulia Tsvetkov
Sachin Kumar
AuLLM
LM&MA
VLM
53
0
0
05 May 2025
Towards Safer Pretraining: Analyzing and Filtering Harmful Content in Webscale datasets for Responsible LLMs
Sai Krishna Mendu
Harish Yenala
Aditi Gulati
Shanu Kumar
Parag Agrawal
29
0
0
04 May 2025
R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation
Meng-Hao Guo
Jiajun Xu
Yi Zhang
Jiaxi Song
Haoyang Peng
...
Yongming Rao
Houwen Peng
Han Hu
Gordon Wetzstein
Shi-Min Hu
ELM
LRM
57
0
0
04 May 2025
Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents
Christian Schroeder de Witt
AAML
AI4CE
120
0
0
04 May 2025
LLM-OptiRA: LLM-Driven Optimization of Resource Allocation for Non-Convex Problems in Wireless Communications
Xinyue Peng
Yanming Liu
Yihan Cang
Chaoqun Cao
Ming Chen
47
0
0
04 May 2025
Incorporating Legal Structure in Retrieval-Augmented Generation: A Case Study on Copyright Fair Use
Justin Ho
Alexandra Colby
William Fisher
AILaw
39
0
0
04 May 2025
1
2
3
4
...
25
26
27
Next