On the Generalization Mystery in Deep Learning

18 March 2022

Papers citing "On the Generalization Mystery in Deep Learning"

10 / 10 papers shown

Title
Information-Theoretic Generalization Bounds for Deep Neural Networks Haiyun He Christina Lee Yu 35 4 0 04 Apr 2024
Astroconformer: The Prospects of Analyzing Stellar Light Curves with Transformer-Based Deep Learning Models Kishankumar Bhimani Yuan-Sen Ting Jie Yu 16 4 0 28 Sep 2023
Token-Level Fitting Issues of Seq2seq Models Guangsheng Bao Zhiyang Teng Yue Zhang 24 0 0 08 May 2023
On the Interpretability of Regularisation for Neural Networks Through Model Gradient Similarity Vincent Szolnoky Viktor Andersson Balázs Kulcsár Rebecka Jörnsten 42 5 0 25 May 2022
Exploring the Learning Difficulty of Data Theory and Measure Weiyao Zhu Ou Wu Fengguang Su Yingjun Deng 35 5 0 16 May 2022
Beyond Lipschitz: Sharp Generalization and Excess Risk Bounds for Full-Batch GD Konstantinos E. Nikolakakis Farzin Haddadpour Amin Karbasi Dionysios S. Kalogerias 43 17 0 26 Apr 2022
Learning in High Dimension Always Amounts to Extrapolation Randall Balestriero J. Pesenti Yann LeCun 41 103 0 18 Oct 2021
Enabling Binary Neural Network Training on the Edge Erwei Wang James J. Davis Daniele Moro Piotr Zielinski Jia Jie Lim C. Coelho S. Chatterjee P. Cheung George A. Constantinides MQ 20 24 0 08 Feb 2021
Catastrophic Fisher Explosion: Early Phase Fisher Matrix Impacts Generalization Stanislaw Jastrzebski Devansh Arpit Oliver Åstrand Giancarlo Kerg Huan Wang Caiming Xiong R. Socher Kyunghyun Cho Krzysztof J. Geras AI4CE 184 65 0 28 Dec 2020
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima N. Keskar Dheevatsa Mudigere J. Nocedal M. Smelyanskiy P. T. P. Tang ODL 308 2,890 0 15 Sep 2016