ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2412.18624
  4. Cited By
How to explain grokking
v1v2v3 (latest)

How to explain grokking

17 December 2024
S. V. Kozyrev
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "How to explain grokking"

5 / 5 papers shown
Title
Progress measures for grokking via mechanistic interpretability
Progress measures for grokking via mechanistic interpretability
Neel Nanda
Lawrence Chan
Tom Lieberum
Jess Smith
Jacob Steinhardt
117
451
0
12 Jan 2023
Towards Understanding Grokking: An Effective Theory of Representation
  Learning
Towards Understanding Grokking: An Effective Theory of Representation Learning
Ziming Liu
O. Kitouni
Niklas Nolte
Eric J. Michaud
Max Tegmark
Mike Williams
AI4CE
112
154
0
20 May 2022
Grokking: Generalization Beyond Overfitting on Small Algorithmic
  Datasets
Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets
Alethea Power
Yuri Burda
Harrison Edwards
Igor Babuschkin
Vedant Misra
130
366
0
06 Jan 2022
Loss landscapes and optimization in over-parameterized non-linear
  systems and neural networks
Loss landscapes and optimization in over-parameterized non-linear systems and neural networks
Chaoyue Liu
Libin Zhu
M. Belkin
ODL
142
266
0
29 Feb 2020
Almost-everywhere algorithmic stability and generalization error
Almost-everywhere algorithmic stability and generalization error
S. Kutin
P. Niyogi
114
173
0
12 Dec 2012
1