How to explain grokking

v1v2v3 (latest)

How to explain grokking

17 December 2024

ArXiv (abs)PDF HTML

Papers citing "How to explain grokking"

5 / 5 papers shown

Title
Progress measures for grokking via mechanistic interpretability Neel Nanda Lawrence Chan Tom Lieberum Jess Smith Jacob Steinhardt 117 451 0 12 Jan 2023
Towards Understanding Grokking: An Effective Theory of Representation Learning Ziming Liu O. Kitouni Niklas Nolte Eric J. Michaud Max Tegmark Mike Williams AI4CE 112 154 0 20 May 2022
Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets Alethea Power Yuri Burda Harrison Edwards Igor Babuschkin Vedant Misra 130 366 0 06 Jan 2022
Loss landscapes and optimization in over-parameterized non-linear systems and neural networks Chaoyue Liu Libin Zhu M. Belkin ODL 142 266 0 29 Feb 2020
Almost-everywhere algorithmic stability and generalization error S. Kutin P. Niyogi 114 173 0 12 Dec 2012