Understanding deep learning requires rethinking generalization

10 November 2016

Benjamin Recht

Papers citing "Understanding deep learning requires rethinking generalization"

50 / 882 papers shown

Title
Threat Modeling for AI: The Case for an Asset-Centric Approach Jose Sanchez Vicarte Marcin Spoczynski Mostafa Elsaid 29 0 0 08 May 2025
More Optimal Fractional-Order Stochastic Gradient Descent for Non-Convex Optimization Problems Mohammad Partohaghighi Roummel Marcia YangQuan Chen 19 0 0 05 May 2025
Sharpness-Aware Minimization with Z-Score Gradient Filtering for Neural Networks Juyoung Yun 38 0 0 05 May 2025
Contextures: Representations from Contexts Runtian Zhai Kai Yang Che-Ping Tsai Burak Varici Zico Kolter Pradeep Ravikumar 119 0 0 02 May 2025
Handling Label Noise via Instance-Level Difficulty Modeling and Dynamic Optimization Kuan Zhang Chengliang Chai Jingzhe Xu Chi Zhang Ye Yuan Guoren Wang Lei Cao NoLa 66 0 0 01 May 2025
A Mathematical Philosophy of Explanations in Mechanistic Interpretability -- The Strange Science Part I.i Kola Ayonrinde Louis Jaburi MILM 86 1 0 01 May 2025
Sobolev norm inconsistency of kernel interpolation Yunfei Yang 34 0 0 29 Apr 2025
Gradient Descent as a Shrinkage Operator for Spectral Bias Simon Lucey 38 0 0 25 Apr 2025
Hadamard product in deep learning: Introduction, Advances and Challenges Grigorios G. Chrysos Yongtao Wu Razvan Pascanu Philip Torr V. Cevher AAML 98 0 0 17 Apr 2025
Generalization through variance: how noise shapes inductive biases in diffusion models John J. Vastola DiffM 164 2 0 16 Apr 2025
Effective Dimension Aware Fractional-Order Stochastic Gradient Descent for Convex Optimization Problems Mohammad Partohaghighi Roummel Marcia YangQuan Chen 46 0 0 17 Mar 2025
High-entropy Advantage in Neural Networks' Generalizability Entao Yang Xuzhi Zhang Yue Shang Ge Zhang AI4CE 63 0 0 17 Mar 2025
Training Large Neural Networks With Low-Dimensional Error Feedback Maher Hanut Jonathan Kadmon 40 1 0 27 Feb 2025
Sample Selection via Contrastive Fragmentation for Noisy Label Regression C. Kim Sangwoo Moon Jihwan Moon Dongyeon Woo Gunhee Kim NoLa 57 0 0 25 Feb 2025
GraphFM: Graph Factorization Machines for Feature Interaction Modeling Shu Wu Zekun Li Yunyue Su Zeyu Cui Xiaoyu Zhang Liang Wang 66 22 0 24 Feb 2025
On Memorization in Diffusion Models Xiangming Gu Chao Du Tianyu Pang Chongxuan Li Min-Bin Lin Ye Wang DiffM TDI 166 43 0 21 Feb 2025
Random Forest Autoencoders for Guided Representation Learning Adrien Aumon Shuang Ni Myriam Lizotte Guy Wolf Kevin R. Moon Jake S. Rhodes 67 0 0 18 Feb 2025
Stability-based Generalization Bounds for Variational Inference Yadi Wei R. Khardon BDL 49 0 0 17 Feb 2025
Captured by Captions: On Memorization and its Mitigation in CLIP Models Wenhao Wang Adam Dziedzic Grace C. Kim Michael Backes Franziska Boenisch 93 0 0 11 Feb 2025
Early Stopping Against Label Noise Without Validation Data Suqin Yuan Lei Feng Tongliang Liu NoLa 101 15 0 11 Feb 2025
The Cake that is Intelligence and Who Gets to Bake it: An AI Analogy and its Implications for Participation Martin Mundt Anaelia Ovalle Felix Friedrich A Pranav Subarnaduti Paul Manuel Brack Kristian Kersting William Agnew 289 0 0 05 Feb 2025
Noise-Tolerant Hybrid Prototypical Learning with Noisy Web Data Chao Liang Linchao Zhu Zongxin Yang Wei Chen Yi Yang NoLa 59 0 0 05 Jan 2025
Functional Risk Minimization Ferran Alet Clement Gehring Tomás Lozano-Pérez Kenji Kawaguchi Joshua B. Tenenbaum Leslie Pack Kaelbling OffRL 60 0 0 31 Dec 2024
Combating Semantic Contamination in Learning with Label Noise Wenxiao Fan Kan Li NoLa 184 0 0 16 Dec 2024
Guiding Through Complexity: What Makes Good Supervision for Hard Math Reasoning Tasks? Xuan He Da Yin Nanyun Peng LRM 40 0 0 27 Oct 2024
LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics Thomas Robert M. Safaryan Ionut-Vlad Modoranu Dan Alistarh ODL 36 2 0 21 Oct 2024
Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late in Training Zhanpeng Zhou Mingze Wang Yuchen Mao Bingrui Li Junchi Yan AAML 62 0 0 14 Oct 2024
Extended convexity and smoothness and their applications in deep learning Binchuan Qi Wei Gong Li Li 61 0 0 08 Oct 2024
Residual Kolmogorov-Arnold Network for Enhanced Deep Learning Ray Congrui Yu Sherry Wu Jiang Gui 44 1 0 07 Oct 2024
Rethinking Fair Representation Learning for Performance-Sensitive Tasks Charles Jones Fabio De Sousa Ribeiro Mélanie Roschewitz Daniel Coelho De Castro Ben Glocker FaML OOD CML 146 1 0 05 Oct 2024
Classification-Denoising Networks Louis Thiry Florentin Guth 34 0 0 04 Oct 2024
How Much Can We Forget about Data Contamination? Sebastian Bordt Suraj Srinivas Valentyn Boreiko U. V. Luxburg 45 1 0 04 Oct 2024
Timber! Poisoning Decision Trees Stefano Calzavara Lorenzo Cazzaro Massimo Vettori AAML 27 0 0 01 Oct 2024
Retro-li: Small-Scale Retrieval Augmented Generation Supporting Noisy Similarity Searches and Domain Shift Generalization Gentiana Rashiti G. Karunaratne Mrinmaya Sachan Abu Sebastian Abbas Rahimi RALM 39 0 0 12 Sep 2024
Optimizing Neural Network Performance and Interpretability with Diophantine Equation Encoding Ronald Katende 35 0 0 11 Sep 2024
Overfitting Behaviour of Gaussian Kernel Ridgeless Regression: Varying Bandwidth or Dimensionality Marko Medvedev Gal Vardi Nathan Srebro 68 3 0 05 Sep 2024
Theoretical Insights into Overparameterized Models in Multi-Task and Replay-Based Continual Learning Mohammadamin Banayeeanzade Mahdi Soltanolkotabi Mohammad Rostami CLL LRM 103 1 0 29 Aug 2024
Weakly Contrastive Learning via Batch Instance Discrimination and Feature Clustering for Small Sample SAR ATR Yikui Zhai Wenlve Zhou Bing Sun Jingwen Li Qirui Ke ... Junying Gan Chaoyun Mai R. D. Labati Vincenzo Piuri F. Scotti 27 19 0 07 Aug 2024
How DNNs break the Curse of Dimensionality: Compositionality and Symmetry Learning Arthur Jacot Seok Hoan Choi Yuxiao Wen AI4CE 91 2 0 08 Jul 2024
Evaluating Model Performance Under Worst-case Subpopulations Mike Li Hongseok Namkoong Shangzhou Xia 45 17 0 01 Jul 2024
CHG Shapley: Efficient Data Valuation and Selection towards Trustworthy Machine Learning Huaiguang Cai FedML TDI 58 1 0 17 Jun 2024
Just How Flexible are Neural Networks in Practice? Ravid Shwartz-Ziv Micah Goldblum Arpit Bansal C. Bayan Bruss Yann LeCun Andrew Gordon Wilson 43 4 0 17 Jun 2024
Asymptotic Unbiased Sample Sampling to Speed Up Sharpness-Aware Minimization Jiaxin Deng Junbiao Pang Baochang Zhang 66 1 0 12 Jun 2024
Loss Gradient Gaussian Width based Generalization and Optimization Guarantees A. Banerjee Qiaobo Li Yingxue Zhou 49 0 0 11 Jun 2024
A Margin-based Multiclass Generalization Bound via Geometric Complexity Michael Munn Benoit Dherin Javier Gonzalvo UQCV 40 2 0 28 May 2024
Connectivity Shapes Implicit Regularization in Matrix Factorization Models for Matrix Completion Zhiwei Bai Jiajie Zhao Yaoyu Zhang AI4CE 37 0 0 22 May 2024
A Multi-Perspective Analysis of Memorization in Large Language Models Bowen Chen Namgi Han Yusuke Miyao 46 1 0 19 May 2024
A Systematic Evaluation of Large Language Models for Natural Language Generation Tasks Xuanfan Ni Piji Li ELM LRM 31 8 0 16 May 2024
Uniform Generalization Bounds on Data-Dependent Hypothesis Sets via PAC-Bayesian Theory on Random Sets Benjamin Dupuis Paul Viallard George Deligiannidis Umut Simsekli 42 2 0 26 Apr 2024
Information-Theoretic Generalization Bounds for Deep Neural Networks Haiyun He Christina Lee Yu 35 4 0 04 Apr 2024