ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.02167
  4. Cited By
Training Nonlinear Transformers for Chain-of-Thought Inference: A
  Theoretical Generalization Analysis

Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis

3 October 2024
Hongkang Li
Meng Wang
Songtao Lu
Xiaodong Cui
Pin-Yu Chen
    LRM
ArXivPDFHTML

Papers citing "Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis"

3 / 3 papers shown
Title
How Transformers Learn Regular Language Recognition: A Theoretical Study on Training Dynamics and Implicit Bias
How Transformers Learn Regular Language Recognition: A Theoretical Study on Training Dynamics and Implicit Bias
Ruiquan Huang
Yingbin Liang
Jing Yang
46
0
0
02 May 2025
When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers
When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers
Hongkang Li
Yihua Zhang
Shuai Zhang
Hao Wu
Sijia Liu
Pin-Yu Chen
MoMe
69
3
0
15 Apr 2025
Enhancing Graph Transformers with Hierarchical Distance Structural
  Encoding
Enhancing Graph Transformers with Hierarchical Distance Structural Encoding
Yuan Luo
Hongkang Li
Lei Shi
Xiao-Ming Wu
28
7
0
22 Aug 2023
1