
Width Provably Matters in Optimization for Deep Linear Neural Networks
Papers citing "Width Provably Matters in Optimization for Deep Linear Neural Networks"
50 / 68 papers shown
Title |
---|
![]() Deep Linear Networks can Benignly Overfit when Shallow Ones Do Niladri S. Chatterji Philip M. Long |