Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.18624
Cited By
v1
v2
v3 (latest)
How to explain grokking
17 December 2024
S. V. Kozyrev
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"How to explain grokking"
5 / 5 papers shown
Title
Progress measures for grokking via mechanistic interpretability
Neel Nanda
Lawrence Chan
Tom Lieberum
Jess Smith
Jacob Steinhardt
117
451
0
12 Jan 2023
Towards Understanding Grokking: An Effective Theory of Representation Learning
Ziming Liu
O. Kitouni
Niklas Nolte
Eric J. Michaud
Max Tegmark
Mike Williams
AI4CE
112
154
0
20 May 2022
Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets
Alethea Power
Yuri Burda
Harrison Edwards
Igor Babuschkin
Vedant Misra
130
366
0
06 Jan 2022
Loss landscapes and optimization in over-parameterized non-linear systems and neural networks
Chaoyue Liu
Libin Zhu
M. Belkin
ODL
142
266
0
29 Feb 2020
Almost-everywhere algorithmic stability and generalization error
S. Kutin
P. Niyogi
114
173
0
12 Dec 2012
1