ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.10036
  4. Cited By
On the Generalization Mystery in Deep Learning

On the Generalization Mystery in Deep Learning

18 March 2022
S. Chatterjee
Piotr Zielinski
    OOD
ArXivPDFHTML

Papers citing "On the Generalization Mystery in Deep Learning"

10 / 10 papers shown
Title
Information-Theoretic Generalization Bounds for Deep Neural Networks
Information-Theoretic Generalization Bounds for Deep Neural Networks
Haiyun He
Christina Lee Yu
35
4
0
04 Apr 2024
Astroconformer: The Prospects of Analyzing Stellar Light Curves with
  Transformer-Based Deep Learning Models
Astroconformer: The Prospects of Analyzing Stellar Light Curves with Transformer-Based Deep Learning Models
Kishankumar Bhimani
Yuan-Sen Ting
Jie Yu
16
4
0
28 Sep 2023
Token-Level Fitting Issues of Seq2seq Models
Token-Level Fitting Issues of Seq2seq Models
Guangsheng Bao
Zhiyang Teng
Yue Zhang
24
0
0
08 May 2023
On the Interpretability of Regularisation for Neural Networks Through
  Model Gradient Similarity
On the Interpretability of Regularisation for Neural Networks Through Model Gradient Similarity
Vincent Szolnoky
Viktor Andersson
Balázs Kulcsár
Rebecka Jörnsten
42
5
0
25 May 2022
Exploring the Learning Difficulty of Data Theory and Measure
Exploring the Learning Difficulty of Data Theory and Measure
Weiyao Zhu
Ou Wu
Fengguang Su
Yingjun Deng
35
5
0
16 May 2022
Beyond Lipschitz: Sharp Generalization and Excess Risk Bounds for
  Full-Batch GD
Beyond Lipschitz: Sharp Generalization and Excess Risk Bounds for Full-Batch GD
Konstantinos E. Nikolakakis
Farzin Haddadpour
Amin Karbasi
Dionysios S. Kalogerias
43
17
0
26 Apr 2022
Learning in High Dimension Always Amounts to Extrapolation
Learning in High Dimension Always Amounts to Extrapolation
Randall Balestriero
J. Pesenti
Yann LeCun
41
103
0
18 Oct 2021
Enabling Binary Neural Network Training on the Edge
Enabling Binary Neural Network Training on the Edge
Erwei Wang
James J. Davis
Daniele Moro
Piotr Zielinski
Jia Jie Lim
C. Coelho
S. Chatterjee
P. Cheung
George A. Constantinides
MQ
20
24
0
08 Feb 2021
Catastrophic Fisher Explosion: Early Phase Fisher Matrix Impacts
  Generalization
Catastrophic Fisher Explosion: Early Phase Fisher Matrix Impacts Generalization
Stanislaw Jastrzebski
Devansh Arpit
Oliver Åstrand
Giancarlo Kerg
Huan Wang
Caiming Xiong
R. Socher
Kyunghyun Cho
Krzysztof J. Geras
AI4CE
184
65
0
28 Dec 2020
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp
  Minima
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
308
2,890
0
15 Sep 2016
1