Understanding deep learning requires rethinking generalization

10 November 2016

Benjamin Recht

Papers citing "Understanding deep learning requires rethinking generalization"

50 / 885 papers shown

Title
Regularization-wise double descent: Why it occurs and how to eliminate it Fatih Yilmaz Reinhard Heckel 30 11 0 03 Jun 2022
Dataset Distillation using Neural Feature Regression Yongchao Zhou E. Nezhadarya Jimmy Ba DD FedML 44 149 0 01 Jun 2022
Context-based Virtual Adversarial Training for Text Classification with Noisy Labels Do-Myoung Lee Yeachan Kim Chang-gyun Seo NoLa 21 2 0 29 May 2022
Why Robust Generalization in Deep Learning is Difficult: Perspective of Expressive Power Binghui Li Jikai Jin Han Zhong J. Hopcroft Liwei Wang OOD 82 27 0 27 May 2022
Embedding Principle in Depth for the Loss Landscape Analysis of Deep Neural Networks Zhiwei Bai Tao Luo Z. Xu Yaoyu Zhang 31 4 0 26 May 2022
VeriFi: Towards Verifiable Federated Unlearning Xiangshan Gao Xingjun Ma Jingyi Wang Youcheng Sun Bo Li S. Ji Peng Cheng Jiming Chen MU 67 46 0 25 May 2022
On the Interpretability of Regularisation for Neural Networks Through Model Gradient Similarity Vincent Szolnoky Viktor Andersson Balázs Kulcsár Rebecka Jörnsten 42 5 0 25 May 2022
Compression-aware Training of Neural Networks using Frank-Wolfe Max Zimmer Christoph Spiegel Sebastian Pokutta 29 9 0 24 May 2022
Randomly Initialized One-Layer Neural Networks Make Data Linearly Separable Promit Ghosal Srinath Mahankali Yihang Sun MLT 24 4 0 24 May 2022
Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models Kushal Tirumala Aram H. Markosyan Luke Zettlemoyer Armen Aghajanyan TDI 29 185 0 22 May 2022
Interpolating Compressed Parameter Subspaces Siddhartha Datta N. Shadbolt 37 5 0 19 May 2022
Large Neural Networks Learning from Scratch with Very Few Data and without Explicit Regularization C. Linse T. Martinetz SSL VLM 12 4 0 18 May 2022
Learn2Weight: Parameter Adaptation against Similar-domain Adversarial Attacks Siddhartha Datta AAML 34 4 0 15 May 2022
HiURE: Hierarchical Exemplar Contrastive Learning for Unsupervised Relation Extraction Xuming Hu Shuliang Liu Chenwei Zhang Shuang Li Lijie Wen Philip S. Yu SSL 46 39 0 04 May 2022
A Comprehensive Survey of Image Augmentation Techniques for Deep Learning Mingle Xu Sook Yoon A. Fuentes D. Park VLM 27 397 0 03 May 2022
FedRN: Exploiting k-Reliable Neighbors Towards Robust Federated Learning Sangmook Kim Wonyoung Shin Soohyuk Jang Hwanjun Song Se-Young Yun 34 2 0 03 May 2022
Perfectly Balanced: Improving Transfer and Robustness of Supervised Contrastive Learning Mayee F. Chen Daniel Y. Fu A. Narayan Michael Zhang Zhao Song Kayvon Fatahalian Christopher Ré SSL 32 47 0 15 Apr 2022
Nonlocal optimization of binary neural networks Amir Khoshaman Giuseppe Castiglione C. Srinivasa 18 0 0 05 Apr 2022
Learning from few examples with nonlinear feature maps I. Tyukin Oliver J. Sutton Alexander N. Gorban 14 1 0 31 Mar 2022
PACE: A Parallelizable Computation Encoder for Directed Acyclic Graphs Zehao Dong Muhan Zhang Fuhai Li Yixin Chen CML GNN 33 17 0 19 Mar 2022
Reducing Flipping Errors in Deep Neural Networks Xiang Deng Yun Xiao Bo Long Zhongfei Zhang AAML 38 3 0 16 Mar 2022
Deep AutoAugment Yu Zheng Z. Zhang Shen Yan Mi Zhang ViT 23 26 0 11 Mar 2022
The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks Xin Yu Thiago Serra Srikumar Ramalingam Shandian Zhe 42 48 0 09 Mar 2022
Selective-Supervised Contrastive Learning with Noisy Labels Shikun Li Xiaobo Xia Shiming Ge Tongliang Liu NoLa 24 172 0 08 Mar 2022
Generalization Through The Lens Of Leave-One-Out Error Gregor Bachmann Thomas Hofmann Aurelien Lucchi 52 7 0 07 Mar 2022
Explicitising The Implicit Intrepretability of Deep Neural Networks Via Duality Chandrashekar Lakshminarayanan Ashutosh Kumar Singh A. Rajkumar AI4CE 26 1 0 01 Mar 2022
Understanding Contrastive Learning Requires Incorporating Inductive Biases Nikunj Saunshi Jordan T. Ash Surbhi Goel Dipendra Kumar Misra Cyril Zhang Sanjeev Arora Sham Kakade A. Krishnamurthy SSL 24 109 0 28 Feb 2022
The Spectral Bias of Polynomial Neural Networks Moulik Choraria L. Dadi Grigorios G. Chrysos Julien Mairal V. Cevher 24 18 0 27 Feb 2022
Benign Underfitting of Stochastic Gradient Descent Tomer Koren Roi Livni Yishay Mansour Uri Sherman MLT 20 13 0 27 Feb 2022
Adversarial robustness of sparse local Lipschitz predictors Ramchandran Muthukumar Jeremias Sulam AAML 32 13 0 26 Feb 2022
ASSIST: Towards Label Noise-Robust Dialogue State Tracking Fanghua Ye Yue Feng Emine Yilmaz 21 21 0 26 Feb 2022
Benefit of Interpolation in Nearest Neighbor Algorithms Yue Xing Qifan Song Guang Cheng 11 28 0 23 Feb 2022
On PAC-Bayesian reconstruction guarantees for VAEs Badr-Eddine Chérief-Abdellatif Yuyang Shi Arnaud Doucet Benjamin Guedj DRL 50 17 0 23 Feb 2022
Random Feature Amplification: Feature Learning and Generalization in Neural Networks Spencer Frei Niladri S. Chatterji Peter L. Bartlett MLT 30 29 0 15 Feb 2022
Information-Theoretic Analysis of Minimax Excess Risk Hassan Hafez-Kolahi Behrad Moniri S. Kasaei 17 4 0 15 Feb 2022
Generalisation and the Risk--Entropy Curve Dominic Belcher Antonia Marcu Adam Prugel-Bennett 11 0 0 15 Feb 2022
On the Origins of the Block Structure Phenomenon in Neural Network Representations Thao Nguyen M. Raghu Simon Kornblith 25 14 0 15 Feb 2022
Evolving Neural Networks with Optimal Balance between Information Flow and Connections Cost A. Khalili A. Bouchachia 14 0 0 12 Feb 2022
The no-free-lunch theorems of supervised learning T. Sterkenburg Peter Grünwald FedML 24 56 0 09 Feb 2022
A Survey on Poisoning Attacks Against Supervised Machine Learning Wenjun Qiu AAML 28 9 0 05 Feb 2022
Learning with Neighbor Consistency for Noisy Labels Ahmet Iscen Jack Valmadre Anurag Arnab Cordelia Schmid NoLa 41 75 0 04 Feb 2022
Non-Vacuous Generalisation Bounds for Shallow Neural Networks Felix Biggs Benjamin Guedj BDL 30 26 0 03 Feb 2022
On Regularizing Coordinate-MLPs Sameera Ramasinghe L. MacDonald Simon Lucey 158 5 0 01 Feb 2022
Deep Layer-wise Networks Have Closed-Form Weights Chieh-Tsai Wu A. Masoomi Arthur Gretton Jennifer Dy 29 3 0 01 Feb 2022
Datamodels: Predicting Predictions from Training Data Andrew Ilyas Sung Min Park Logan Engstrom Guillaume Leclerc A. Madry TDI 47 131 0 01 Feb 2022
Backdoors Stuck At The Frontdoor: Multi-Agent Backdoor Attacks That Backfire Siddhartha Datta N. Shadbolt AAML 32 7 0 28 Jan 2022
Interplay between depth of neural networks and locality of target functions Takashi Mori Masakuni Ueda 25 0 0 28 Jan 2022
Improved Overparametrization Bounds for Global Convergence of Stochastic Gradient Descent for Shallow Neural Networks Bartlomiej Polaczyk J. Cyranka ODL 33 3 0 28 Jan 2022
Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks Noam Razin Asaf Maman Nadav Cohen 46 29 0 27 Jan 2022
PiCO+: Contrastive Label Disambiguation for Robust Partial Label Learning Haobo Wang Rui Xiao Yixuan Li Lei Feng Gang Niu Gang Chen J. Zhao VLM 49 25 0 22 Jan 2022