Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2201.11903
Cited By
v1
v2
v3
v4
v5
v6 (latest)
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
28 January 2022
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Chain-of-Thought Prompting Elicits Reasoning in Large Language Models"
50 / 6,020 papers shown
Title
Vision-EKIPL: External Knowledge-Infused Policy Learning for Visual Reasoning
Chaoyang Wang
Zeyu Zhang
Haiyun Jiang
OffRL
LRM
13
0
0
07 Jun 2025
RARL: Improving Medical VLM Reasoning and Generalization with Reinforcement Learning and LoRA under Data and Hardware Constraints
Tan-Hanh Pham
Chris Ngo
OffRL
LRM
9
0
0
07 Jun 2025
Detecting Voice Phishing with Precision: Fine-Tuning Small Language Models
Ju Yong Sim
Seong Hwan Kim
40
0
0
06 Jun 2025
Enhancing Robot Safety via MLLM-Based Semantic Interpretation of Failure Data
Aryaman Gupta
Yusuf Umut Ciftci
Somil Bansal
21
0
0
06 Jun 2025
BioMol-MQA: A Multi-Modal Question Answering Dataset For LLM Reasoning Over Bio-Molecular Interactions
Saptarshi Sengupta
Shuhua Yang
Paul Kwong Yu
Fali Wang
Suhang Wang
32
0
0
06 Jun 2025
PuzzleWorld: A Benchmark for Multimodal, Open-Ended Reasoning in Puzzlehunts
Hengzhi Li
Brendon Jiang
Alexander Naehu
Regan Song
Justin Zhang
...
Steven-Shine Chen
Adithya Balachandran
Wei Dai
Rebecca Chang
Paul Pu Liang
ReLM
LRM
54
0
0
06 Jun 2025
Route-and-Reason: Scaling Large Language Model Reasoning with Reinforced Model Router
Chenyang Shao
Xinyang Liu
Yutang Lin
Fengli Xu
Yong Li
MoE
LRM
61
0
0
06 Jun 2025
PROVSYN: Synthesizing Provenance Graphs for Data Augmentation in Intrusion Detection Systems
Yi Huang
Wajih UI Hassan
Yao Guo
Xiangqun Chen
Ding Li
47
0
0
06 Jun 2025
HMVLM: Multistage Reasoning-Enhanced Vision-Language Model for Long-Tailed Driving Scenarios
Daming Wang
Yuhao Song
Zijian He
Kangliang Chen
Xing Pan
Lu Deng
Weihao Gu
VLM
LRM
85
0
0
06 Jun 2025
Token Signature: Predicting Chain-of-Thought Gains with Token Decoding Feature in Large Language Models
Peijie Liu
Fengli Xu
Yong Li
LRM
43
0
0
06 Jun 2025
ProRefine: Inference-time Prompt Refinement with Textual Feedback
Deepak Pandita
Tharindu Cyril Weerasooriya
A. Shah
Christopher Homan
Wei Wei
LLMAG
ReLM
LRM
134
0
0
05 Jun 2025
Dissecting Logical Reasoning in LLMs: A Fine-Grained Evaluation and Supervision Study
Yujun Zhou
Jiayi Ye
Zipeng Ling
Yufei Han
Yue Huang
...
Zhenwen Liang
Kehan Guo
Taicheng Guo
Xiangqi Wang
Xiangliang Zhang
ReLM
LRM
105
1
0
05 Jun 2025
ScaleRTL: Scaling LLMs with Reasoning Data and Test-Time Compute for Accurate RTL Code Generation
Chenhui Deng
Yun-Da Tsai
Guan-Ting Liu
Zhongzhi Yu
Haoxing Ren
LLMAG
LRM
28
1
0
05 Jun 2025
LLM-First Search: Self-Guided Exploration of the Solution Space
Nathan Herr
Tim Rocktaschel
Roberta Raileanu
LRM
139
0
0
05 Jun 2025
TreeRPO: Tree Relative Policy Optimization
Zhicheng YANG
Zhijiang Guo
Yinya Huang
Xiaodan Liang
Yiwei Wang
Jing Tang
LRM
77
0
0
05 Jun 2025
A Reasoning-Based Approach to Cryptic Crossword Clue Solving
Martin Andrews
Sam Witteveen
ReLM
ELM
LRM
94
0
0
05 Jun 2025
On the Mechanism of Reasoning Pattern Selection in Reinforcement Learning for Language Models
Xingwu Chen
Tianle Li
Difan Zou
LRM
88
0
0
05 Jun 2025
Kinetics: Rethinking Test-Time Scaling Laws
Ranajoy Sadhukhan
Zhuoming Chen
Haizhong Zheng
Yang Zhou
Emma Strubell
Beidi Chen
101
0
0
05 Jun 2025
AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs
Lidong Lu
Guo Chen
Z. Li
Yicheng Liu
Tong Lu
VLM
LRM
93
0
0
05 Jun 2025
DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning
Tanmay Parekh
Kartik Mehta
Ninareh Mehrabi
Kai-Wei Chang
Nanyun Peng
84
0
0
05 Jun 2025
Sample Complexity and Representation Ability of Test-time Scaling Paradigms
Baihe Huang
Shanda Li
Tianhao Wu
Yiming Yang
Ameet Talwalkar
Kannan Ramchandran
Michael I. Jordan
Jiantao Jiao
LRM
95
0
0
05 Jun 2025
Neural Network Reprogrammability: A Unified Theme on Model Reprogramming, Prompt Tuning, and Prompt Instruction
Zesheng Ye
C. Cai
Ruijiang Dong
Jianzhong Qi
Lei Feng
Pin-Yu Chen
Feng Liu
195
0
0
05 Jun 2025
macOSWorld: A Multilingual Interactive Benchmark for GUI Agents
Pei Yang
Hai Ci
Mike Zheng Shou
LLMAG
49
0
0
04 Jun 2025
Exchange of Perspective Prompting Enhances Reasoning in Large Language Models
Lin Sun
Can Zhang
LRM
56
0
0
04 Jun 2025
Matching Markets Meet LLMs: Algorithmic Reasoning with Ranked Preferences
Hadi Hosseini
Samarth Khanna
Ronak Singh
LRM
42
0
0
04 Jun 2025
A Generative Adaptive Replay Continual Learning Model for Temporal Knowledge Graph Reasoning
Zhiyu Zhang
Wei Chen
Youfang Lin
Huaiyu Wan
OffRL
CLL
111
0
0
04 Jun 2025
Horizon Reduction Makes RL Scalable
Seohong Park
Kevin Frans
Deepinder Mann
Benjamin Eysenbach
Aviral Kumar
Sergey Levine
OffRL
75
0
0
04 Jun 2025
AgentMisalignment: Measuring the Propensity for Misaligned Behaviour in LLM-Based Agents
Akshat Naik
Patrick Quinn
Guillermo Bosch
Emma Gouné
Francisco Javier Campos Zabala
Jason Ross Brown
Edward James Young
59
0
0
04 Jun 2025
Does Thinking More always Help? Understanding Test-Time Scaling in Reasoning Models
Soumya Suvra Ghosal
Souradip Chakraborty
Avinash Reddy
Yifu Lu
Mengdi Wang
Dinesh Manocha
Furong Huang
Mohammad Ghavamzadeh
Amrit Singh Bedi
ReLM
LRM
82
0
0
04 Jun 2025
A Statistical Physics of Language Model Reasoning
Jack David Carson
Amir Reisizadeh
LRM
AI4CE
73
0
0
04 Jun 2025
Long or short CoT? Investigating Instance-level Switch of Large Reasoning Models
Ruiqi Zhang
Changyi Xiao
Yixin Cao
LRM
86
0
0
04 Jun 2025
DrSR: LLM based Scientific Equation Discovery with Dual Reasoning from Data and Experience
Runxiang Wang
Boxiao Wang
Kai Li
Yifan Zhang
Jian Cheng
20
0
0
04 Jun 2025
The Cost of Dynamic Reasoning: Demystifying AI Agents and Test-Time Scaling from an AI Infrastructure Perspective
Jiin Kim
Byeongjun Shin
Jinha Chung
Minsoo Rhu
LLMAG
LRM
25
1
0
04 Jun 2025
Learning-at-Criticality in Large Language Models for Quantum Field Theory and Beyond
X-D Cai
Sihan Hu
Tao Wang
Yuan Huang
Pan Zhang
Youjin Deng
Kun Chen
LRM
63
0
0
04 Jun 2025
MMR-V: What's Left Unsaid? A Benchmark for Multimodal Deep Reasoning in Videos
Kejian Zhu
Zhuoran Jin
Hongbang Yuan
Jiachun Li
Shangqing Tu
Pengfei Cao
Yubo Chen
Kang Liu
Jun Zhao
VLM
LRM
72
0
0
04 Jun 2025
BEAR: BGP Event Analysis and Reporting
Hanqing Li
Melania Fedeli
Vinay Kolar
Diego Klabjan
64
0
0
04 Jun 2025
SQLens: An End-to-End Framework for Error Detection and Correction in Text-to-SQL
Yue Gong
Chuan Lei
X. Qin
Kapil Vaidya
Balakrishnan Narayanaswamy
Tim Kraska
19
0
0
04 Jun 2025
From Virtual Agents to Robot Teams: A Multi-Robot Framework Evaluation in High-Stakes Healthcare Context
Yuanchen Bai
Zijian Ding
Angelique Taylor
56
0
0
04 Jun 2025
ClozeMath: Improving Mathematical Reasoning in Language Models by Learning to Fill Equations
Quang Hieu Pham
T. Nguyen
Tung Pham
Anh Tuan Luu
Dat Quoc Nguyen
ReLM
LRM
131
0
0
04 Jun 2025
EPiC: Towards Lossless Speedup for Reasoning Training through Edge-Preserving CoT Condensation
Jinghan Jia
Hadi Reisizadeh
Chongyu Fan
Nathalie Baracaldo
Mingyi Hong
Sijia Liu
LRM
126
0
0
04 Jun 2025
Enhancing Decision-Making of Large Language Models via Actor-Critic
Heng Dong
Kefei Duan
Chongjie Zhang
LLMAG
16
0
0
04 Jun 2025
SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling
Anhao Zhao
Fanghua Ye
Yingqi Fan
Junlong Tong
Zhiwei Fei
Hui Su
Xiaoyu Shen
61
0
0
04 Jun 2025
Multimodal Tabular Reasoning with Privileged Structured Information
Jun-Peng Jiang
Yu Xia
Hai-Long Sun
Shiyin Lu
Qing-Guo Chen
Weihua Luo
Kaifu Zhang
De-Chuan Zhan
Han-Jia Ye
LMTD
LRM
84
0
0
04 Jun 2025
Understanding Physical Properties of Unseen Deformable Objects by Leveraging Large Language Models and Robot Actions
Changmin Park
Beomjoon Lee
Haechan Jung
Haejin Jung
Changjoo Nam
LM&Ro
90
0
0
04 Jun 2025
TracLLM: A Generic Framework for Attributing Long Context LLMs
Yanting Wang
Wei Zou
Runpeng Geng
Jinyuan Jia
LLMAG
117
0
0
04 Jun 2025
TriPSS: A Tri-Modal Keyframe Extraction Framework Using Perceptual, Structural, and Semantic Representations
Mert Can Cakmak
Nitin Agarwal
Diwash Poudel
20
0
0
03 Jun 2025
Beyond Invisibility: Learning Robust Visible Watermarks for Stronger Copyright Protection
Tianci Liu
Tong Yang
Quan Zhang
Qi Lei
WIGM
AAML
37
0
0
03 Jun 2025
Comba: Improving Bilinear RNNs with Closed-loop Control
Jiaxi Hu
Yongqi Pan
Jusen Du
Disen Lan
Xiaqiang Tang
Qingsong Wen
Yuxuan Liang
Weigao Sun
63
0
0
03 Jun 2025
Why do AI agents communicate in human language?
Pengcheng Zhou
Yinglun Feng
Halimulati Julaiti
Zhongliang Yang
LLMAG
31
0
0
03 Jun 2025
ORPP: Self-Optimizing Role-playing Prompts to Enhance Language Model Capabilities
Yifan Duan
Yihong Tang
Kehai Chen
Liqiang Nie
Min Zhang
58
0
0
03 Jun 2025
Previous
1
2
3
4
5
...
119
120
121
Next