21
0

Geometry of Learning -- L2 Phase Transitions in Deep and Shallow Neural Networks

Abstract

When neural networks (NNs) are subject to L2 regularization, increasing the regularization strength beyond a certain threshold pushes the model into an under-parameterization regime. This transition manifests as a first-order phase transition in single-hidden-layer NNs and a second-order phase transition in NNs with two or more hidden layers. This paper establishes a unified framework for such transitions by integrating the Ricci curvature of the loss landscape with regularizer-driven deep learning. First, we show that a curvature change-point separates the model-accuracy regimes in the onset of learning and that it is identical to the critical point of the phase transition driven by regularization. Second, we show that for more complex data sets additional phase transitions exist between model accuracies, and that they are again identical to curvature change points in the error landscape. Third, by studying the MNIST data set using a Variational Autoencoder, we demonstrate that the curvature change points identify phase transitions in model accuracy outside the L2 setting. Our framework also offers practical insights for optimizing model performance across various architectures and datasets. By linking geometric features of the error landscape to observable phase transitions, our work paves the way for more informed regularization strategies and potentially new methods for probing the intrinsic structure of neural networks beyond the L2 context.

View on arXiv
@article{ersoy2025_2505.06597,
  title={ Geometry of Learning -- L2 Phase Transitions in Deep and Shallow Neural Networks },
  author={ Ibrahim Talha Ersoy and Karoline Wiesner },
  journal={arXiv preprint arXiv:2505.06597},
  year={ 2025 }
}
Comments on this paper