ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.03530
  4. Cited By
Understanding deep learning requires rethinking generalization

Understanding deep learning requires rethinking generalization

10 November 2016
Chiyuan Zhang
Samy Bengio
Moritz Hardt
Benjamin Recht
Oriol Vinyals
    HAI
ArXivPDFHTML

Papers citing "Understanding deep learning requires rethinking generalization"

50 / 882 papers shown
Title
Threat Modeling for AI: The Case for an Asset-Centric Approach
Threat Modeling for AI: The Case for an Asset-Centric Approach
Jose Sanchez Vicarte
Marcin Spoczynski
Mostafa Elsaid
29
0
0
08 May 2025
More Optimal Fractional-Order Stochastic Gradient Descent for Non-Convex Optimization Problems
More Optimal Fractional-Order Stochastic Gradient Descent for Non-Convex Optimization Problems
Mohammad Partohaghighi
Roummel Marcia
YangQuan Chen
19
0
0
05 May 2025
Sharpness-Aware Minimization with Z-Score Gradient Filtering for Neural Networks
Sharpness-Aware Minimization with Z-Score Gradient Filtering for Neural Networks
Juyoung Yun
38
0
0
05 May 2025
Contextures: Representations from Contexts
Contextures: Representations from Contexts
Runtian Zhai
Kai Yang
Che-Ping Tsai
Burak Varici
Zico Kolter
Pradeep Ravikumar
119
0
0
02 May 2025
Handling Label Noise via Instance-Level Difficulty Modeling and Dynamic Optimization
Handling Label Noise via Instance-Level Difficulty Modeling and Dynamic Optimization
Kuan Zhang
Chengliang Chai
Jingzhe Xu
Chi Zhang
Ye Yuan
Guoren Wang
Lei Cao
NoLa
66
0
0
01 May 2025
A Mathematical Philosophy of Explanations in Mechanistic Interpretability -- The Strange Science Part I.i
A Mathematical Philosophy of Explanations in Mechanistic Interpretability -- The Strange Science Part I.i
Kola Ayonrinde
Louis Jaburi
MILM
86
1
0
01 May 2025
Sobolev norm inconsistency of kernel interpolation
Sobolev norm inconsistency of kernel interpolation
Yunfei Yang
34
0
0
29 Apr 2025
Gradient Descent as a Shrinkage Operator for Spectral Bias
Gradient Descent as a Shrinkage Operator for Spectral Bias
Simon Lucey
38
0
0
25 Apr 2025
Hadamard product in deep learning: Introduction, Advances and Challenges
Hadamard product in deep learning: Introduction, Advances and Challenges
Grigorios G. Chrysos
Yongtao Wu
Razvan Pascanu
Philip Torr
V. Cevher
AAML
98
0
0
17 Apr 2025
Generalization through variance: how noise shapes inductive biases in diffusion models
Generalization through variance: how noise shapes inductive biases in diffusion models
John J. Vastola
DiffM
164
2
0
16 Apr 2025
Effective Dimension Aware Fractional-Order Stochastic Gradient Descent for Convex Optimization Problems
Effective Dimension Aware Fractional-Order Stochastic Gradient Descent for Convex Optimization Problems
Mohammad Partohaghighi
Roummel Marcia
YangQuan Chen
46
0
0
17 Mar 2025
High-entropy Advantage in Neural Networks' Generalizability
High-entropy Advantage in Neural Networks' Generalizability
Entao Yang
Xuzhi Zhang
Yue Shang
Ge Zhang
AI4CE
63
0
0
17 Mar 2025
Training Large Neural Networks With Low-Dimensional Error Feedback
Training Large Neural Networks With Low-Dimensional Error Feedback
Maher Hanut
Jonathan Kadmon
40
1
0
27 Feb 2025
Sample Selection via Contrastive Fragmentation for Noisy Label Regression
Sample Selection via Contrastive Fragmentation for Noisy Label Regression
C. Kim
Sangwoo Moon
Jihwan Moon
Dongyeon Woo
Gunhee Kim
NoLa
57
0
0
25 Feb 2025
GraphFM: Graph Factorization Machines for Feature Interaction Modeling
GraphFM: Graph Factorization Machines for Feature Interaction Modeling
Shu Wu
Zekun Li
Yunyue Su
Zeyu Cui
Xiaoyu Zhang
Liang Wang
66
22
0
24 Feb 2025
On Memorization in Diffusion Models
On Memorization in Diffusion Models
Xiangming Gu
Chao Du
Tianyu Pang
Chongxuan Li
Min-Bin Lin
Ye Wang
DiffM
TDI
166
43
0
21 Feb 2025
Random Forest Autoencoders for Guided Representation Learning
Random Forest Autoencoders for Guided Representation Learning
Adrien Aumon
Shuang Ni
Myriam Lizotte
Guy Wolf
Kevin R. Moon
Jake S. Rhodes
67
0
0
18 Feb 2025
Stability-based Generalization Bounds for Variational Inference
Stability-based Generalization Bounds for Variational Inference
Yadi Wei
R. Khardon
BDL
49
0
0
17 Feb 2025
Captured by Captions: On Memorization and its Mitigation in CLIP Models
Captured by Captions: On Memorization and its Mitigation in CLIP Models
Wenhao Wang
Adam Dziedzic
Grace C. Kim
Michael Backes
Franziska Boenisch
93
0
0
11 Feb 2025
Early Stopping Against Label Noise Without Validation Data
Early Stopping Against Label Noise Without Validation Data
Suqin Yuan
Lei Feng
Tongliang Liu
NoLa
101
15
0
11 Feb 2025
The Cake that is Intelligence and Who Gets to Bake it: An AI Analogy and its Implications for Participation
The Cake that is Intelligence and Who Gets to Bake it: An AI Analogy and its Implications for Participation
Martin Mundt
Anaelia Ovalle
Felix Friedrich
A Pranav
Subarnaduti Paul
Manuel Brack
Kristian Kersting
William Agnew
289
0
0
05 Feb 2025
Noise-Tolerant Hybrid Prototypical Learning with Noisy Web Data
Chao Liang
Linchao Zhu
Zongxin Yang
Wei Chen
Yi Yang
NoLa
59
0
0
05 Jan 2025
Functional Risk Minimization
Functional Risk Minimization
Ferran Alet
Clement Gehring
Tomás Lozano-Pérez
Kenji Kawaguchi
Joshua B. Tenenbaum
Leslie Pack Kaelbling
OffRL
60
0
0
31 Dec 2024
Combating Semantic Contamination in Learning with Label Noise
Combating Semantic Contamination in Learning with Label Noise
Wenxiao Fan
Kan Li
NoLa
184
0
0
16 Dec 2024
Guiding Through Complexity: What Makes Good Supervision for Hard Math Reasoning Tasks?
Guiding Through Complexity: What Makes Good Supervision for Hard Math Reasoning Tasks?
Xuan He
Da Yin
Nanyun Peng
LRM
40
0
0
27 Oct 2024
LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics
LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics
Thomas Robert
M. Safaryan
Ionut-Vlad Modoranu
Dan Alistarh
ODL
36
2
0
21 Oct 2024
Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late in Training
Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late in Training
Zhanpeng Zhou
Mingze Wang
Yuchen Mao
Bingrui Li
Junchi Yan
AAML
62
0
0
14 Oct 2024
Extended convexity and smoothness and their applications in deep learning
Extended convexity and smoothness and their applications in deep learning
Binchuan Qi
Wei Gong
Li Li
61
0
0
08 Oct 2024
Residual Kolmogorov-Arnold Network for Enhanced Deep Learning
Residual Kolmogorov-Arnold Network for Enhanced Deep Learning
Ray Congrui Yu
Sherry Wu
Jiang Gui
44
1
0
07 Oct 2024
Rethinking Fair Representation Learning for Performance-Sensitive Tasks
Rethinking Fair Representation Learning for Performance-Sensitive Tasks
Charles Jones
Fabio De Sousa Ribeiro
Mélanie Roschewitz
Daniel Coelho De Castro
Ben Glocker
FaML
OOD
CML
146
1
0
05 Oct 2024
Classification-Denoising Networks
Classification-Denoising Networks
Louis Thiry
Florentin Guth
34
0
0
04 Oct 2024
How Much Can We Forget about Data Contamination?
How Much Can We Forget about Data Contamination?
Sebastian Bordt
Suraj Srinivas
Valentyn Boreiko
U. V. Luxburg
45
1
0
04 Oct 2024
Timber! Poisoning Decision Trees
Timber! Poisoning Decision Trees
Stefano Calzavara
Lorenzo Cazzaro
Massimo Vettori
AAML
27
0
0
01 Oct 2024
Retro-li: Small-Scale Retrieval Augmented Generation Supporting Noisy Similarity Searches and Domain Shift Generalization
Retro-li: Small-Scale Retrieval Augmented Generation Supporting Noisy Similarity Searches and Domain Shift Generalization
Gentiana Rashiti
G. Karunaratne
Mrinmaya Sachan
Abu Sebastian
Abbas Rahimi
RALM
39
0
0
12 Sep 2024
Optimizing Neural Network Performance and Interpretability with
  Diophantine Equation Encoding
Optimizing Neural Network Performance and Interpretability with Diophantine Equation Encoding
Ronald Katende
35
0
0
11 Sep 2024
Overfitting Behaviour of Gaussian Kernel Ridgeless Regression: Varying
  Bandwidth or Dimensionality
Overfitting Behaviour of Gaussian Kernel Ridgeless Regression: Varying Bandwidth or Dimensionality
Marko Medvedev
Gal Vardi
Nathan Srebro
68
3
0
05 Sep 2024
Theoretical Insights into Overparameterized Models in Multi-Task and Replay-Based Continual Learning
Theoretical Insights into Overparameterized Models in Multi-Task and Replay-Based Continual Learning
Mohammadamin Banayeeanzade
Mahdi Soltanolkotabi
Mohammad Rostami
CLL
LRM
103
1
0
29 Aug 2024
Weakly Contrastive Learning via Batch Instance Discrimination and
  Feature Clustering for Small Sample SAR ATR
Weakly Contrastive Learning via Batch Instance Discrimination and Feature Clustering for Small Sample SAR ATR
Yikui Zhai
Wenlve Zhou
Bing Sun
Jingwen Li
Qirui Ke
...
Junying Gan
Chaoyun Mai
R. D. Labati
Vincenzo Piuri
F. Scotti
27
19
0
07 Aug 2024
How DNNs break the Curse of Dimensionality: Compositionality and Symmetry Learning
How DNNs break the Curse of Dimensionality: Compositionality and Symmetry Learning
Arthur Jacot
Seok Hoan Choi
Yuxiao Wen
AI4CE
91
2
0
08 Jul 2024
Evaluating Model Performance Under Worst-case Subpopulations
Evaluating Model Performance Under Worst-case Subpopulations
Mike Li
Hongseok Namkoong
Shangzhou Xia
45
17
0
01 Jul 2024
CHG Shapley: Efficient Data Valuation and Selection towards Trustworthy Machine Learning
CHG Shapley: Efficient Data Valuation and Selection towards Trustworthy Machine Learning
Huaiguang Cai
FedML
TDI
58
1
0
17 Jun 2024
Just How Flexible are Neural Networks in Practice?
Just How Flexible are Neural Networks in Practice?
Ravid Shwartz-Ziv
Micah Goldblum
Arpit Bansal
C. Bayan Bruss
Yann LeCun
Andrew Gordon Wilson
43
4
0
17 Jun 2024
Asymptotic Unbiased Sample Sampling to Speed Up Sharpness-Aware Minimization
Asymptotic Unbiased Sample Sampling to Speed Up Sharpness-Aware Minimization
Jiaxin Deng
Junbiao Pang
Baochang Zhang
66
1
0
12 Jun 2024
Loss Gradient Gaussian Width based Generalization and Optimization Guarantees
Loss Gradient Gaussian Width based Generalization and Optimization Guarantees
A. Banerjee
Qiaobo Li
Yingxue Zhou
49
0
0
11 Jun 2024
A Margin-based Multiclass Generalization Bound via Geometric Complexity
A Margin-based Multiclass Generalization Bound via Geometric Complexity
Michael Munn
Benoit Dherin
Javier Gonzalvo
UQCV
40
2
0
28 May 2024
Connectivity Shapes Implicit Regularization in Matrix Factorization Models for Matrix Completion
Connectivity Shapes Implicit Regularization in Matrix Factorization Models for Matrix Completion
Zhiwei Bai
Jiajie Zhao
Yaoyu Zhang
AI4CE
37
0
0
22 May 2024
A Multi-Perspective Analysis of Memorization in Large Language Models
A Multi-Perspective Analysis of Memorization in Large Language Models
Bowen Chen
Namgi Han
Yusuke Miyao
46
1
0
19 May 2024
A Systematic Evaluation of Large Language Models for Natural Language
  Generation Tasks
A Systematic Evaluation of Large Language Models for Natural Language Generation Tasks
Xuanfan Ni
Piji Li
ELM
LRM
31
8
0
16 May 2024
Uniform Generalization Bounds on Data-Dependent Hypothesis Sets via PAC-Bayesian Theory on Random Sets
Uniform Generalization Bounds on Data-Dependent Hypothesis Sets via PAC-Bayesian Theory on Random Sets
Benjamin Dupuis
Paul Viallard
George Deligiannidis
Umut Simsekli
42
2
0
26 Apr 2024
Information-Theoretic Generalization Bounds for Deep Neural Networks
Information-Theoretic Generalization Bounds for Deep Neural Networks
Haiyun He
Christina Lee Yu
35
4
0
04 Apr 2024
1234...161718
Next