Understanding deep learning requires rethinking generalization

10 November 2016

Benjamin Recht

Papers citing "Understanding deep learning requires rethinking generalization"

50 / 882 papers shown

Title
Faster Convergence of Stochastic Accelerated Gradient Descent under Interpolation Aaron Mishkin Mert Pilanci Mark Schmidt 64 1 0 03 Apr 2024
Partitioned Neural Network Training via Synthetic Intermediate Labels C. V. Karadag Nezih Topaloglu 34 1 0 17 Mar 2024
A Decade's Battle on Dataset Bias: Are We There Yet? Zhuang Liu Kaiming He 42 28 0 13 Mar 2024
Efficient Knowledge Deletion from Trained Models through Layer-wise Partial Machine Unlearning Vinay Chakravarthi Gogineni E. Nadimi MU 31 1 0 12 Mar 2024
On the use of Silver Standard Data for Zero-shot Classification Tasks in Information Extraction Jianwei Wang Tianyin Wang Ziqian Zeng 56 1 0 28 Feb 2024
Investigating Generalization Behaviours of Generative Flow Networks Lazar Atanackovic Emmanuel Bengio AI4CE 30 2 0 07 Feb 2024
Characterizing Overfitting in Kernel Ridgeless Regression Through the Eigenspectrum Tin Sum Cheng Aurelien Lucchi Anastasis Kratsios David Belius 37 8 0 02 Feb 2024
Strategic Usage in a Multi-Learner Setting Eliot Shekhtman Sarah Dean 37 2 0 29 Jan 2024
Learning to Manipulate under Limited Information Wesley H. Holliday Alexander Kristoffersen Eric Pacuit 22 4 0 29 Jan 2024
Learning with Noisy Labels: Interconnection of Two Expectation-Maximizations Heewon Kim Hyun Sung Chang Kiho Cho Jaeyun Lee Bohyung Han NoLa 26 2 0 09 Jan 2024
PERP: Rethinking the Prune-Retrain Paradigm in the Era of LLMs Max Zimmer Megi Andoni Christoph Spiegel Sebastian Pokutta VLM 52 10 0 23 Dec 2023
The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction Pratyusha Sharma Jordan T. Ash Dipendra Kumar Misra LRM 19 78 0 21 Dec 2023
Optimizing Neural Networks with Gradient Lexicase Selection Lijie Ding Lee Spector 40 20 0 19 Dec 2023
$\emph{Lifted} RDT based capacity analysis of the 1-hidden layer treelike \emph{sign} perceptrons neural networks$ \emph{Lifted} RDT based capacity analysis of the 1-hidden layer treelike \emph{sign} perceptrons neural networks M. Stojnic 24 1 0 13 Dec 2023
Capacity of the treelike sign perceptrons neural networks with one hidden layer -- RDT based upper bounds M. Stojnic 21 4 0 13 Dec 2023
Critical Influence of Overparameterization on Sharpness-aware Minimization Sungbin Shin Dongyeop Lee Maksym Andriushchenko Namhoon Lee AAML 44 1 0 29 Nov 2023
In Search of a Data Transformation That Accelerates Neural Field Training Junwon Seo Sangyoon Lee Kwang In Kim Jaeho Lee 44 3 0 28 Nov 2023
Polynomially Over-Parameterized Convolutional Neural Networks Contain Structured Strong Winning Lottery Tickets A. D. Cunha Francesco d’Amore Emanuele Natale MLT 27 1 0 16 Nov 2023
Unified machine learning tasks and datasets for enhancing renewable energy Arsam Aryandoust Thomas Rigoni Francesco di Stefano Anthony Patt 40 0 0 12 Nov 2023
Rethinking Benchmark and Contamination for Language Models with Rephrased Samples Shuo Yang Wei-Lin Chiang Lianmin Zheng Joseph E. Gonzalez Ion Stoica ALM 27 110 0 08 Nov 2023
OpenForest: A data catalogue for machine learning in forest monitoring Arthur Ouaknine T. Kattenborn Etienne Laliberté David Rolnick 51 5 0 01 Nov 2023
Learning to Abstain From Uninformative Data Yikai Zhang Songzhu Zheng M. Dalirrooyfard Pengxiang Wu Anderson Schneider Anant Raj Yuriy Nevmyvaka Chao Chen 26 2 0 25 Sep 2023
PanoMixSwap Panorama Mixing via Structural Swapping for Indoor Scene Understanding Yu-Cheng Hsieh Cheng Sun Suraj Dengale Min Sun 3DPC 36 1 0 18 Sep 2023
Fundamental Limits of Deep Learning-Based Binary Classifiers Trained with Hinge Loss T. Getu Georges Kaddoum M. Bennis 40 1 0 13 Sep 2023
Learning Active Subspaces for Effective and Scalable Uncertainty Quantification in Deep Neural Networks Sanket R. Jantre Nathan M. Urban Xiaoning Qian Byung-Jun Yoon BDL UQCV 26 4 0 06 Sep 2023
Geometry and Local Recovery of Global Minima of Two-layer Neural Networks at Overparameterization Leyang Zhang Yaoyu Zhang Tao Luo 20 2 0 01 Sep 2023
MarginMatch: Improving Semi-Supervised Learning with Pseudo-Margins Tiberiu Sosea Cornelia Caragea 16 12 0 17 Aug 2023
Test-Time Poisoning Attacks Against Test-Time Adaptation Models Tianshuo Cong Xinlei He Yun Shen Yang Zhang AAML TTA 32 5 0 16 Aug 2023
DaMSTF: Domain Adversarial Learning Enhanced Meta Self-Training for Domain Adaptation Menglong Lu Zhen Huang Yunxiang Zhao Zhiliang Tian Yang Liu Dongsheng Li 29 6 0 05 Aug 2023
Isolation and Induction: Training Robust Deep Neural Networks against Model Stealing Attacks Jun Guo Aishan Liu Xingyu Zheng Siyuan Liang Yisong Xiao Yichao Wu Xianglong Liu AAML 35 12 0 02 Aug 2023
Understanding Activation Patterns in Artificial Neural Networks by Exploring Stochastic Processes S. Lehmler Muhammad Saif-ur-Rehman Tobias Glasmachers Ioannis Iossifidis 24 0 0 01 Aug 2023
Are Transformers with One Layer Self-Attention Using Low-Rank Weight Matrices Universal Approximators? T. Kajitsuka Issei Sato 31 16 0 26 Jul 2023
Learning to Segment from Noisy Annotations: A Spatial Correction Approach Jiacheng Yao Yikai Zhang Songzhu Zheng Mayank Goswami Prateek Prasanna Chao Chen 41 15 0 21 Jul 2023
Sharpness Minimization Algorithms Do Not Only Minimize Sharpness To Achieve Better Generalization Kaiyue Wen Zhiyuan Li Tengyu Ma FAtt 38 26 0 20 Jul 2023
Addressing caveats of neural persistence with deep graph persistence Leander Girrbach Anders Christensen Ole Winther Zeynep Akata A. Sophia Koepke GNN 25 1 0 20 Jul 2023
Deconstructing Data Reconstruction: Multiclass, Weight Decay and General Losses G. Buzaglo Niv Haim Gilad Yehudai Gal Vardi Yakir Oz Yaniv Nikankin Michal Irani 34 10 0 04 Jul 2023
Understanding quantum machine learning also requires rethinking generalization Elies Gil-Fuster Jens Eisert Carlos Bravo-Prieto 35 45 0 23 Jun 2023
Precise Asymptotic Generalization for Multiclass Classification with Overparameterized Linear Models David X. Wu A. Sahai 26 2 0 23 Jun 2023
Quantizable Transformers: Removing Outliers by Helping Attention Heads Do Nothing Yelysei Bondarenko Markus Nagel Tijmen Blankevoort MQ 15 88 0 22 Jun 2023
FedNoisy: Federated Noisy Label Learning Benchmark Siqi Liang Jintao Huang Junyuan Hong Dun Zeng Jiayu Zhou Zenglin Xu FedML 40 7 0 20 Jun 2023
Gibbs-Based Information Criteria and the Over-Parameterized Regime Haobo Chen Yuheng Bu Greg Wornell 27 1 0 08 Jun 2023
Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection Capability Jianing Zhu Hengzhuang Li Jiangchao Yao Tongliang Liu Jianliang Xu Bo Han OODD 43 12 0 06 Jun 2023
Proximity to Losslessly Compressible Parameters Matthew Farrugia-Roberts 30 0 0 05 Jun 2023
Memorization Capacity of Multi-Head Attention in Transformers Sadegh Mahdavi Renjie Liao Christos Thrampoulidis 26 22 0 03 Jun 2023
Instance-dependent Noisy-label Learning with Graphical Model Based Noise-rate Estimation Arpit Garg Cuong C. Nguyen Rafael Felix Thanh-Toan Do G. Carneiro NoLa 35 1 0 31 May 2023
BadLabel: A Robust Perspective on Evaluating and Enhancing Label-noise Learning Jingfeng Zhang Bo Song Haohan Wang Bo Han Tongliang Liu Lei Liu Masashi Sugiyama AAML NoLa 32 14 0 28 May 2023
Generalization Guarantees of Gradient Descent for Multi-Layer Neural Networks Puyu Wang Yunwen Lei Di Wang Yiming Ying Ding-Xuan Zhou MLT 29 3 0 26 May 2023
Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations Hao Chen Ankit Shah Jindong Wang R. Tao Yidong Wang Xingxu Xie Masashi Sugiyama Rita Singh Bhiksha Raj 37 12 0 22 May 2023
NoisywikiHow: A Benchmark for Learning with Real-world Noisy Labels in Natural Language Processing Tingting Wu Xiao Ding Minji Tang Haotian Zhang Bing Qin Ting Liu NoLa 34 9 0 18 May 2023
Small Models are Valuable Plug-ins for Large Language Models Canwen Xu Yichong Xu Shuohang Wang Yang Liu Chenguang Zhu Julian McAuley LLMAG 41 45 0 15 May 2023