ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.15408
  4. Cited By
Towards Revealing the Mystery behind Chain of Thought: A Theoretical
  Perspective

Towards Revealing the Mystery behind Chain of Thought: A Theoretical Perspective

24 May 2023
Guhao Feng
Bohang Zhang
Yuntian Gu
Haotian Ye
Di He
Liwei Wang
    LRM
ArXivPDFHTML

Papers citing "Towards Revealing the Mystery behind Chain of Thought: A Theoretical Perspective"

50 / 166 papers shown
Title
Unraveling Arithmetic in Large Language Models: The Role of Algebraic Structures
Unraveling Arithmetic in Large Language Models: The Role of Algebraic Structures
Fu-Chieh Chang
Pei-Yuan Wu
Pei-Yuan Wu
LRM
112
1
0
25 Nov 2024
Reducing Reasoning Costs: The Path of Optimization for Chain of Thought via Sparse Attention Mechanism
Reducing Reasoning Costs: The Path of Optimization for Chain of Thought via Sparse Attention Mechanism
Libo Wang
LRM
AI4CE
48
3
0
14 Nov 2024
Quantifying artificial intelligence through algebraic generalization
Quantifying artificial intelligence through algebraic generalization
Takuya Ito
Murray Campbell
L. Horesh
Tim Klinger
Parikshit Ram
ELM
53
0
0
08 Nov 2024
Number Cookbook: Number Understanding of Language Models and How to Improve It
Number Cookbook: Number Understanding of Language Models and How to Improve It
Haotong Yang
Yi Hu
Shijia Kang
Zhouchen Lin
Muhan Zhang
LRM
46
2
0
06 Nov 2024
Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Md Rifat Arefin
G. Subbaraj
Nicolas Angelard-Gontier
Yann LeCun
Irina Rish
Ravid Shwartz-Ziv
C. Pal
LRM
181
0
0
04 Nov 2024
RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner
RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner
Fu-Chieh Chang
Yu-Ting Lee
Hui-Ying Shih
Pei-Yuan Wu
Pei-Yuan Wu
OffRL
LRM
189
0
0
31 Oct 2024
SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and
  Prompt Types
SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types
Yutao Mou
Shikun Zhang
Wei Ye
ELM
40
9
0
29 Oct 2024
Counting Ability of Large Language Models and Impact of Tokenization
Counting Ability of Large Language Models and Impact of Tokenization
Xiang Zhang
Juntai Cao
Chenyu You
LRM
40
5
0
25 Oct 2024
Optimizing Chain-of-Thought Reasoning: Tackling Arranging Bottleneck via
  Plan Augmentation
Optimizing Chain-of-Thought Reasoning: Tackling Arranging Bottleneck via Plan Augmentation
Yuli Qiu
Jiashu Yao
Heyan Huang
Yuhang Guo
LRM
29
0
0
22 Oct 2024
A Theoretical Understanding of Chain-of-Thought: Coherent Reasoning and
  Error-Aware Demonstration
A Theoretical Understanding of Chain-of-Thought: Coherent Reasoning and Error-Aware Demonstration
Yingqian Cui
Pengfei He
Xianfeng Tang
Qi He
Chen Luo
Jiliang Tang
Yue Xing
LRM
39
6
0
21 Oct 2024
Supervised Chain of Thought
Supervised Chain of Thought
Xiang Zhang
Dujian Ding
LRM
AI4CE
31
1
0
18 Oct 2024
How Numerical Precision Affects Mathematical Reasoning Capabilities of
  LLMs
How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs
Guhao Feng
Kai-Bo Yang
Yuntian Gu
Xinyue Ai
Shengjie Luo
Jiacheng Sun
Di He
Zechao Li
Liwei Wang
LRM
37
6
0
17 Oct 2024
A Theoretical Survey on Foundation Models
A Theoretical Survey on Foundation Models
Shi Fu
Yuzhu Chen
Yingjie Wang
Dacheng Tao
28
0
0
15 Oct 2024
Learning Linear Attention in Polynomial Time
Learning Linear Attention in Polynomial Time
Morris Yau
Ekin Akyürek
Jiayuan Mao
Joshua B. Tenenbaum
Stefanie Jegelka
Jacob Andreas
19
2
0
14 Oct 2024
Low-Dimension-to-High-Dimension Generalization And Its Implications for
  Length Generalization
Low-Dimension-to-High-Dimension Generalization And Its Implications for Length Generalization
Yang Chen
Yitao Liang
Zhouchen Lin
40
1
0
11 Oct 2024
OneNet: A Fine-Tuning Free Framework for Few-Shot Entity Linking via
  Large Language Model Prompting
OneNet: A Fine-Tuning Free Framework for Few-Shot Entity Linking via Large Language Model Prompting
Xukai Liu
Ye Liu
Kai Zhang
Kehang Wang
Qi Liu
Enhong Chen
45
2
0
10 Oct 2024
Can Transformers Reason Logically? A Study in SAT Solving
Can Transformers Reason Logically? A Study in SAT Solving
Leyan Pan
Vijay Ganesh
Jacob Abernethy
Chris Esposo
Wenke Lee
ReLM
LRM
33
0
0
09 Oct 2024
O1 Replication Journey: A Strategic Progress Report -- Part 1
O1 Replication Journey: A Strategic Progress Report -- Part 1
Yiwei Qin
Xuefeng Li
Haoyang Zou
Yixiu Liu
Shijie Xia
...
Yixin Ye
Weizhe Yuan
Hector Liu
Yuan Li
Pengfei Liu
VLM
48
71
0
08 Oct 2024
From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency
From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency
Kaiyue Wen
Huaqing Zhang
Hongzhou Lin
Jingzhao Zhang
MoE
LRM
61
2
0
07 Oct 2024
Multi-Step Time Series Inference Agent for Reasoning and Automated Task Execution
Multi-Step Time Series Inference Agent for Reasoning and Automated Task Execution
Wen Ye
Yizhou Zhang
Wei Yang
Lumingyuan Tang
Defu Cao
Jie Cai
Yan Liu
BDL
CoGe
AI4TS
41
2
0
05 Oct 2024
Can Mamba Always Enjoy the "Free Lunch"?
Can Mamba Always Enjoy the "Free Lunch"?
Ruifeng Ren
Zhicong Li
Yong Liu
44
1
0
04 Oct 2024
SELU: Self-Learning Embodied MLLMs in Unknown Environments
SELU: Self-Learning Embodied MLLMs in Unknown Environments
Boyu Li
Haobin Jiang
Ziluo Ding
Xinrun Xu
Haoran Li
Dongbin Zhao
Zongqing Lu
LRM
55
2
0
04 Oct 2024
Autoregressive Large Language Models are Computationally Universal
Autoregressive Large Language Models are Computationally Universal
Dale Schuurmans
Hanjun Dai
Francesco Zanini
38
2
0
04 Oct 2024
How Much Can RAG Help the Reasoning of LLM?
How Much Can RAG Help the Reasoning of LLM?
Jingyu Liu
Jiaen Lin
Yong Liu
LRM
36
9
0
03 Oct 2024
Training Nonlinear Transformers for Chain-of-Thought Inference: A
  Theoretical Generalization Analysis
Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis
Hongkang Li
Meng Wang
Songtao Lu
Xiaodong Cui
Pin-Yu Chen
LRM
35
5
0
03 Oct 2024
On Expressive Power of Looped Transformers: Theoretical Analysis and Enhancement via Timestep Encoding
On Expressive Power of Looped Transformers: Theoretical Analysis and Enhancement via Timestep Encoding
Kevin Xu
Issei Sato
39
3
0
02 Oct 2024
Investigating the Impact of Model Complexity in Large Language Models
Investigating the Impact of Model Complexity in Large Language Models
Jing Luo
Huiyuan Wang
Weiran Huang
39
0
0
01 Oct 2024
Instance-adaptive Zero-shot Chain-of-Thought Prompting
Instance-adaptive Zero-shot Chain-of-Thought Prompting
Xiaosong Yuan
Chen Shen
Shaotian Yan
Xiaofeng Zhang
Liang Xie
Wenxiao Wang
Renchu Guan
Ying Wang
Jieping Ye
ReLM
LRM
57
4
0
30 Sep 2024
In-Situ Mode: Generative AI-Driven Characters Transforming Art
  Engagement Through Anthropomorphic Narratives
In-Situ Mode: Generative AI-Driven Characters Transforming Art Engagement Through Anthropomorphic Narratives
Yongming Li
Hao Zhang
Andrea Yaoyun Cui
Z. Ma
Yunpeng Song
Zhongmin Cai
Yun Huang
31
0
0
24 Sep 2024
A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?
A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?
Yunfei Xie
Juncheng Wu
Haoqin Tu
Siwei Yang
Bingchen Zhao
Yongshuo Zong
Qiao Jin
Cihang Xie
Yuyin Zhou
LM&MA
ELM
LRM
49
19
0
23 Sep 2024
System 2 thinking in OpenAI's o1-preview model: Near-perfect performance
  on a mathematics exam
System 2 thinking in OpenAI's o1-preview model: Near-perfect performance on a mathematics exam
J. D. Winter
Dimitra Dodou
Y. B. Eisma
VLM
ELM
LRM
ReLM
27
11
0
19 Sep 2024
Watch Your Steps: Observable and Modular Chains of Thought
Watch Your Steps: Observable and Modular Chains of Thought
Cassandra A. Cohen
William W. Cohen
LRM
36
1
0
17 Sep 2024
Enhancing Sequential Recommendations through Multi-Perspective
  Reflections and Iteration
Enhancing Sequential Recommendations through Multi-Perspective Reflections and Iteration
Weicong Qin
Yi Xu
Weijie Yu
Chenglei Shen
Xiao Zhang
Ming He
Jianping Fan
Jun Xu
16
1
0
10 Sep 2024
DiPT: Enhancing LLM reasoning through diversified perspective-taking
DiPT: Enhancing LLM reasoning through diversified perspective-taking
H. Just
Mahavir Dabas
Lifu Huang
Ming Jin
Ruoxi Jia
LRM
45
1
0
10 Sep 2024
Conversational Complexity for Assessing Risk in Large Language Models
Conversational Complexity for Assessing Risk in Large Language Models
John Burden
Manuel Cebrian
José Hernández-Orallo
48
0
0
02 Sep 2024
Critic-CoT: Boosting the reasoning abilities of large language model via
  Chain-of-thoughts Critic
Critic-CoT: Boosting the reasoning abilities of large language model via Chain-of-thoughts Critic
Xin Zheng
Jie Lou
Boxi Cao
Xueru Wen
Yuqiu Ji
Hongyu Lin
Yunfan LU
Xianpei Han
Debing Zhang
Le Sun
LLMAG
OffRL
LRM
ReLM
KELM
49
13
1
29 Aug 2024
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for
  Reinforcement Learning and Monte-Carlo Tree Search
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Huajian Xin
Z. Z. Ren
Junxiao Song
Zhihong Shao
Wanjia Zhao
...
Dejian Yang
Zhibin Gou
Z. F. Wu
Fuli Luo
Chong Ruan
AIMat
LRM
50
54
0
15 Aug 2024
Unveiling Factual Recall Behaviors of Large Language Models through
  Knowledge Neurons
Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons
Yifei Wang
Yuheng Chen
Wanting Wen
Yu Sheng
Linjing Li
D. Zeng
KELM
47
5
0
06 Aug 2024
When Can Transformers Count to n?
When Can Transformers Count to n?
Gilad Yehudai
Haim Kaplan
Asma Ghandeharioun
Mor Geva
Amir Globerson
39
12
0
21 Jul 2024
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Huiguo He
Huan Yang
Zixi Tuo
Yuan Zhou
Qiuyue Wang
Yuhang Zhang
Zeyu Liu
Wenhao Huang
Hongyang Chao
Jian Yin
DiffM
VGen
62
12
0
17 Jul 2024
Representing Rule-based Chatbots with Transformers
Representing Rule-based Chatbots with Transformers
Dan Friedman
Abhishek Panigrahi
Danqi Chen
71
1
0
15 Jul 2024
Deciphering the Factors Influencing the Efficacy of Chain-of-Thought:
  Probability, Memorization, and Noisy Reasoning
Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning
Akshara Prabhakar
Thomas L. Griffiths
R. Thomas McCoy
LRM
44
17
0
01 Jul 2024
GraphArena: Evaluating and Exploring Large Language Models on Graph Computation
GraphArena: Evaluating and Exploring Large Language Models on Graph Computation
Jianheng Tang
Qifan Zhang
Yuhan Li
Nuo Chen
Jia Li
21
1
0
29 Jun 2024
Large Language Models have Intrinsic Self-Correction Ability
Large Language Models have Intrinsic Self-Correction Ability
Dancheng Liu
Amir Nassereldine
Ziming Yang
Chenhui Xu
Yuting Hu
Jiajie Li
Utkarsh Kumar
Changjae Lee
Jinjun Xiong
KELM
ReLM
LRM
36
11
0
21 Jun 2024
Cognitive Map for Language Models: Optimal Planning via Verbally
  Representing the World Model
Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model
Doyoung Kim
Jongwon Lee
Jinho Park
Minjoon Seo
LM&Ro
44
0
0
21 Jun 2024
Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference
Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference
Anton Xue
Avishree Khare
Rajeev Alur
Surbhi Goel
Eric Wong
61
2
0
21 Jun 2024
On the Representational Capacity of Neural Language Models with Chain-of-Thought Reasoning
On the Representational Capacity of Neural Language Models with Chain-of-Thought Reasoning
Franz Nowak
Anej Svete
Alexandra Butoi
Ryan Cotterell
ReLM
LRM
54
13
0
20 Jun 2024
AutoCAP: Towards Automatic Cross-lingual Alignment Planning for
  Zero-shot Chain-of-Thought
AutoCAP: Towards Automatic Cross-lingual Alignment Planning for Zero-shot Chain-of-Thought
Yongheng Zhang
Qiguang Chen
Min Li
Wanxiang Che
Libo Qin
LRM
46
5
0
20 Jun 2024
What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages
What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages
Nadav Borenstein
Anej Svete
R. Chan
Josef Valvoda
Franz Nowak
Isabelle Augenstein
Eleanor Chodroff
Ryan Cotterell
42
12
0
06 Jun 2024
On Limitation of Transformer for Learning HMMs
On Limitation of Transformer for Learning HMMs
Jiachen Hu
Qinghua Liu
Chi Jin
50
3
0
06 Jun 2024
Previous
1234
Next