Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2201.11903
Cited By
v1
v2
v3
v4
v5
v6 (latest)
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
28 January 2022
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Chain-of-Thought Prompting Elicits Reasoning in Large Language Models"
50 / 6,022 papers shown
Title
Beyond path selection: Better LLMs for Scientific Information Extraction with MimicSFT and Relevance and Rule-induced(R
2
^2
2
)GRPO
Ran Li
Shimin Di
Yuchen Liu
Chen Jing
Yu Qiu
Lei Chen
LRM
56
0
0
28 May 2025
Benchmarking Abstract and Reasoning Abilities Through A Theoretical Perspective
Qingchuan Ma
Yuhang Wu
Xiawu Zheng
Rongrong Ji
29
0
0
28 May 2025
From Motion to Behavior: Hierarchical Modeling of Humanoid Generative Behavior Control
Jusheng Zhang
Jinzhou Tang
Sidi Liu
Mingyan Li
Sheng Zhang
Jian Wang
Keze Wang
15
0
0
28 May 2025
UI-Evol: Automatic Knowledge Evolving for Computer Use Agents
Ziyun Zhang
Xinyi Liu
Xiaoyi Zhang
Jun Wang
Gang Chen
Yan Lu
LLMAG
110
0
0
28 May 2025
Visual Large Language Models Exhibit Human-Level Cognitive Flexibility in the Wisconsin Card Sorting Test
Guangfu Hao
Frederic Alexandre
S. Yu
LRM
23
0
0
28 May 2025
Rethinking the Unsolvable: When In-Context Search Meets Test-Time Scaling
Fanzeng Xia
Yidong Luo
Tinko Sebastian Bartels
Yaqi Xu
Tongxin Li
ReLM
LRM
84
0
0
28 May 2025
From Large AI Models to Agentic AI: A Tutorial on Future Intelligent Communications
Feibo Jiang
Cunhua Pan
Li Dong
Kezhi Wang
O. Dobre
Mérouane Debbah
LLMAG
AI4TS
172
1
0
28 May 2025
Advancing Expert Specialization for Better MoE
Hongcan Guo
Haolang Lu
Guoshun Nan
Bolun Chu
Jialin Zhuang
Yuan Yang
Wenhao Che
Sicong Leng
Qimei Cui
Xudong Jiang
MoE
MoMe
81
0
0
28 May 2025
Sherlock: Self-Correcting Reasoning in Vision-Language Models
Yi Ding
Ruqi Zhang
ReLM
LRM
VLM
104
0
0
28 May 2025
Scaling Reasoning without Attention
Xueliang Zhao
Wei Wu
Lingpeng Kong
OffRL
ReLM
LRM
VLM
67
0
0
28 May 2025
Scaling Offline RL via Efficient and Expressive Shortcut Models
Nicolas Espinosa-Dice
Yiyi Zhang
Yiding Chen
Bradley Guo
Owen Oertell
Gokul Swamy
Kianté Brantley
Wen Sun
OffRL
LRM
67
0
0
28 May 2025
On Learning Verifiers for Chain-of-Thought Reasoning
Maria-Florina Balcan
Avrim Blum
Zhiyuan Li
Dravyansh Sharma
LRM
37
0
0
28 May 2025
Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
Hanting Chen
Yasheng Wang
Kai Han
Dong Li
Lin Li
...
Hailin Hu
Yehui Tang
Dacheng Tao
Xinghao Chen
Yunhe Wang
LRM
91
0
0
28 May 2025
Pretraining Language Models to Ponder in Continuous Space
Boyi Zeng
Shixiang Song
Siyuan Huang
Yixuan Wang
He Li
Ziwei He
Xinbing Wang
Zhiyu Li
Zhouhan Lin
LRM
81
0
0
27 May 2025
AITEE -- Agentic Tutor for Electrical Engineering
Christopher Knievel
Alexander Bernhardt
Christian Bernhardt
18
0
0
27 May 2025
Born a Transformer -- Always a Transformer?
Yana Veitsman
Mayank Jobanputra
Yash Sarrof
Aleksandra Bakalova
Vera Demberg
Ellie Pavlick
Michael Hahn
47
0
0
27 May 2025
Don't Think Longer, Think Wisely: Optimizing Thinking Dynamics for Large Reasoning Models
Sohyun An
Ruochen Wang
Tianyi Zhou
Cho-Jui Hsieh
KELM
LRM
94
1
0
27 May 2025
ChemHAS: Hierarchical Agent Stacking for Enhancing Chemistry Tools
Zhucong Li
Bowei Zhang
Jin Xiao
Zhijian Zhou
Fenglei Cao
Jiaqing Liang
Yuan Qi
35
0
0
27 May 2025
Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning
Mingyang Song
Mao Zheng
OffRL
LRM
89
1
0
27 May 2025
Generalizable Heuristic Generation Through Large Language Models with Meta-Optimization
Yiding Shi
Jianan Zhou
Wen Song
Jieyi Bi
Yaoxin Wu
Jie Zhang
68
0
0
27 May 2025
Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance
Shintaro Ozaki
Tatsuya Hiraoka
Hiroto Otake
Hiroki Ouchi
Masaru Isonuma
...
Kentaro Inui
Taro Watanabe
Yusuke Miyao
Yohei Oseki
Yu Takagi
LRM
52
0
0
27 May 2025
Beyond Chemical QA: Evaluating LLM's Chemical Reasoning with Modular Chemical Operations
Hao Li
He Cao
Bin Feng
Yanjun Shao
Xiangru Tang
Zhiyuan Yan
Li Yuan
Yonghong Tian
Yu-Feng Li
LRM
ELM
63
0
0
27 May 2025
Can Large Reasoning Models Self-Train?
Sheikh Shafayat
Fahim Tajwar
Ruslan Salakhutdinov
J. Schneider
Andrea Zanette
ReLM
OffRL
LRM
68
2
0
27 May 2025
Semantic Communication meets System 2 ML: How Abstraction, Compositionality and Emergent Languages Shape Intelligence
Mehdi Bennis
Salem Lahlou
61
1
0
27 May 2025
Understand, Think, and Answer: Advancing Visual Reasoning with Large Multimodal Models
Yufei Zhan
Hongyin Zhao
Yousong Zhu
Shurong Zheng
Fan Yang
Ming Tang
Jinqiao Wang
VLM
LRM
51
0
0
27 May 2025
Large Language Model-enhanced Reinforcement Learning for Low-Altitude Economy Networking
Lingyi Cai
Ruichen Zhang
Changyuan Zhao
Yu Zhang
Jiawen Kang
Dusit Niyato
Tao Jiang
X. Shen
OffRL
52
1
0
27 May 2025
Long Context Scaling: Divide and Conquer via Multi-Agent Question-driven Collaboration
Sibo Xiao
Zixin Lin
Wenyang Gao
Yue Zhang
LLMAG
55
0
0
27 May 2025
AutoJudger: An Agent-Driven Framework for Efficient Benchmarking of MLLMs
Xuanwen Ding
Chengjun Pan
Zejun Li
Jiwen Zhang
Siyuan Wang
Zhongyu Wei
54
0
0
27 May 2025
RRO: LLM Agent Optimization Through Rising Reward Trajectories
Zilong Wang
Jingfeng Yang
Sreyashi Nag
Samarth Varshney
Xianfeng Tang
Haoming Jiang
Jingbo Shang
Sheikh Sarwar
LRM
37
0
0
27 May 2025
Reinforced Informativeness Optimization for Long-Form Retrieval-Augmented Generation
Yuhao Wang
Ruiyang Ren
Yucheng Wang
Wayne Xin Zhao
Jing Liu
Hua Wu
Haifeng Wang
RALM
OffRL
82
0
0
27 May 2025
LLMs Think, But Not In Your Flow: Reasoning-Level Personalization for Black-Box Large Language Models
Jieyong Kim
Tongyoung Kim
Soojin Yoon
Jaehyung Kim
Dongha Lee
LRM
57
0
0
27 May 2025
RefTool: Enhancing Model Reasoning with Reference-Guided Tool Creation
Xiao-Yang Liu
Da Yin
Zirui Wu
Yansong Feng
KELM
LRM
60
0
0
27 May 2025
Simulating the Unseen: Crash Prediction Must Learn from What Did Not Happen
Zihao Li
Xinyuan Cao
Xiangbo Gao
Kexin Tian
Keshu Wu
...
Yunlong Zhang
Tianbao Yang
Dominique Lord
Zhengzhong Tu
Yang Zhou
70
0
0
27 May 2025
CoderAgent: Simulating Student Behavior for Personalized Programming Learning with Large Language Models
Yi Zhan
Qi Liu
Weibo Gao
Zheng Zhang
Tianfu Wang
Shuanghong Shen
Junyu Lu
Zhenya Huang
LLMAG
AI4CE
92
0
0
27 May 2025
Breaking the Performance Ceiling in Complex Reinforcement Learning requires Inference Strategies
Félix Chalumeau
Daniel Rajaonarivonivelomanantsoa
Ruan de Kock
Claude Formanek
Sasha Abramowitz
...
Refiloe Shabe
Arnol Fokam
Siddarth S. Singh
Ulrich A. Mbou Sob
Arnu Pretorius
52
0
0
27 May 2025
Test-Time Learning for Large Language Models
Jinwu Hu
Zhitian Zhang
Guohao Chen
Xutao Wen
Chao Shuai
Wei Luo
Bin Xiao
Yuanqing Li
Mingkui Tan
55
0
0
27 May 2025
FCKT: Fine-Grained Cross-Task Knowledge Transfer with Semantic Contrastive Learning for Targeted Sentiment Analysis
Wei Chen
Zhao Zhang
Meng Yuan
K. Xu
Fuzhen Zhuang
237
0
0
27 May 2025
Pause Tokens Strictly Increase the Expressivity of Constant-Depth Transformers
Charles London
Varun Kanade
48
0
0
27 May 2025
Accelerating RL for LLM Reasoning with Optimal Advantage Regression
Kianté Brantley
Mingyu Chen
Zhaolin Gao
Jason D. Lee
Wen Sun
Wenhao Zhan
Xuezhou Zhang
OffRL
LRM
75
1
0
27 May 2025
Beyond Templates: Dynamic Adaptation of Reasoning Demonstrations via Feasibility-Aware Exploration
Yong Wu
Weihang Pan
Ke Li
Chen Binhui
Ping Li
Binbin Lin
LRM
68
0
0
27 May 2025
Diagnosing and Resolving Cloud Platform Instability with Multi-modal RAG LLMs
Yifan Wang
Kenneth P. Birman
92
0
0
27 May 2025
Counterfactual Simulatability of LLM Explanations for Generation Tasks
Marvin Limpijankit
Yanda Chen
Melanie Subbiah
Nicholas Deas
Kathleen McKeown
LRM
9
0
0
27 May 2025
MIRROR: Multi-agent Intra- and Inter-Reflection for Optimized Reasoning in Tool Learning
Zikang Guo
Benfeng Xu
Xiaorui Wang
Zhendong Mao
65
0
0
27 May 2025
System Prompt Extraction Attacks and Defenses in Large Language Models
B. Das
M. H. Amini
Yanzhao Wu
AAML
17
0
0
27 May 2025
FunReason: Enhancing Large Language Models' Function Calling via Self-Refinement Multiscale Loss and Automated Data Refinement
Bingguang Hao
Maolin Wang
Zengzhuang Xu
Cunyin Peng
Yicheng Chen
Xiangyu Zhao
Jinjie Gu
Chenyi Zhuang
ReLM
LRM
104
0
0
26 May 2025
Chain-of-Thought for Autonomous Driving: A Comprehensive Survey and Future Prospects
Yixin Cui
Haotian Lin
Shuo Yang
Yixiao Wang
Yanjun Huang
Hong Chen
LM&Ro
LRM
ELM
108
0
0
26 May 2025
Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression
Peijie Dong
Zhenheng Tang
Xiang Liu
Lujun Li
Xiaowen Chu
Bo Li
103
0
0
26 May 2025
CAD-Coder: Text-to-CAD Generation with Chain-of-Thought and Geometric Reward
Yandong Guan
Xilin Wang
Xingxi Ming
Jing Zhang
Dong Xu
Qian Yu
3DV
LRM
22
0
0
26 May 2025
The Coverage Principle: A Framework for Understanding Compositional Generalization
Hoyeon Chang
Jinho Park
Hanseul Cho
Sohee Yang
Miyoung Ko
Hyeonbin Hwang
Seungpil Won
Dohaeng Lee
Youbin Ahn
Minjoon Seo
51
0
0
26 May 2025
ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous Driving
Xueyi Liu
Zuodong Zhong
Yuxin Guo
Yun-Fu Liu
Zhiguo Su
...
Yinfeng Gao
Yupeng Zheng
Qiao Lin
Huiyong Chen
Dongbin Zhao
LRM
53
0
0
26 May 2025
Previous
1
2
3
...
6
7
8
...
119
120
121
Next