ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.03530
  4. Cited By
Understanding deep learning requires rethinking generalization

Understanding deep learning requires rethinking generalization

10 November 2016
Chiyuan Zhang
Samy Bengio
Moritz Hardt
Benjamin Recht
Oriol Vinyals
    HAI
ArXivPDFHTML

Papers citing "Understanding deep learning requires rethinking generalization"

50 / 882 papers shown
Title
Faster Convergence of Stochastic Accelerated Gradient Descent under Interpolation
Faster Convergence of Stochastic Accelerated Gradient Descent under Interpolation
Aaron Mishkin
Mert Pilanci
Mark Schmidt
64
1
0
03 Apr 2024
Partitioned Neural Network Training via Synthetic Intermediate Labels
Partitioned Neural Network Training via Synthetic Intermediate Labels
C. V. Karadag
Nezih Topaloglu
34
1
0
17 Mar 2024
A Decade's Battle on Dataset Bias: Are We There Yet?
A Decade's Battle on Dataset Bias: Are We There Yet?
Zhuang Liu
Kaiming He
42
28
0
13 Mar 2024
Efficient Knowledge Deletion from Trained Models through Layer-wise
  Partial Machine Unlearning
Efficient Knowledge Deletion from Trained Models through Layer-wise Partial Machine Unlearning
Vinay Chakravarthi Gogineni
E. Nadimi
MU
31
1
0
12 Mar 2024
On the use of Silver Standard Data for Zero-shot Classification Tasks in
  Information Extraction
On the use of Silver Standard Data for Zero-shot Classification Tasks in Information Extraction
Jianwei Wang
Tianyin Wang
Ziqian Zeng
56
1
0
28 Feb 2024
Investigating Generalization Behaviours of Generative Flow Networks
Investigating Generalization Behaviours of Generative Flow Networks
Lazar Atanackovic
Emmanuel Bengio
AI4CE
30
2
0
07 Feb 2024
Characterizing Overfitting in Kernel Ridgeless Regression Through the
  Eigenspectrum
Characterizing Overfitting in Kernel Ridgeless Regression Through the Eigenspectrum
Tin Sum Cheng
Aurelien Lucchi
Anastasis Kratsios
David Belius
37
8
0
02 Feb 2024
Strategic Usage in a Multi-Learner Setting
Strategic Usage in a Multi-Learner Setting
Eliot Shekhtman
Sarah Dean
37
2
0
29 Jan 2024
Learning to Manipulate under Limited Information
Learning to Manipulate under Limited Information
Wesley H. Holliday
Alexander Kristoffersen
Eric Pacuit
22
4
0
29 Jan 2024
Learning with Noisy Labels: Interconnection of Two
  Expectation-Maximizations
Learning with Noisy Labels: Interconnection of Two Expectation-Maximizations
Heewon Kim
Hyun Sung Chang
Kiho Cho
Jaeyun Lee
Bohyung Han
NoLa
26
2
0
09 Jan 2024
PERP: Rethinking the Prune-Retrain Paradigm in the Era of LLMs
PERP: Rethinking the Prune-Retrain Paradigm in the Era of LLMs
Max Zimmer
Megi Andoni
Christoph Spiegel
Sebastian Pokutta
VLM
52
10
0
23 Dec 2023
The Truth is in There: Improving Reasoning in Language Models with
  Layer-Selective Rank Reduction
The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
Pratyusha Sharma
Jordan T. Ash
Dipendra Kumar Misra
LRM
19
78
0
21 Dec 2023
Optimizing Neural Networks with Gradient Lexicase Selection
Optimizing Neural Networks with Gradient Lexicase Selection
Lijie Ding
Lee Spector
40
20
0
19 Dec 2023
\emph{Lifted} RDT based capacity analysis of the 1-hidden layer treelike
  \emph{sign} perceptrons neural networks
\emph{Lifted} RDT based capacity analysis of the 1-hidden layer treelike \emph{sign} perceptrons neural networks
M. Stojnic
24
1
0
13 Dec 2023
Capacity of the treelike sign perceptrons neural networks with one
  hidden layer -- RDT based upper bounds
Capacity of the treelike sign perceptrons neural networks with one hidden layer -- RDT based upper bounds
M. Stojnic
21
4
0
13 Dec 2023
Critical Influence of Overparameterization on Sharpness-aware Minimization
Critical Influence of Overparameterization on Sharpness-aware Minimization
Sungbin Shin
Dongyeop Lee
Maksym Andriushchenko
Namhoon Lee
AAML
44
1
0
29 Nov 2023
In Search of a Data Transformation That Accelerates Neural Field
  Training
In Search of a Data Transformation That Accelerates Neural Field Training
Junwon Seo
Sangyoon Lee
Kwang In Kim
Jaeho Lee
44
3
0
28 Nov 2023
Polynomially Over-Parameterized Convolutional Neural Networks Contain
  Structured Strong Winning Lottery Tickets
Polynomially Over-Parameterized Convolutional Neural Networks Contain Structured Strong Winning Lottery Tickets
A. D. Cunha
Francesco d’Amore
Emanuele Natale
MLT
27
1
0
16 Nov 2023
Unified machine learning tasks and datasets for enhancing renewable
  energy
Unified machine learning tasks and datasets for enhancing renewable energy
Arsam Aryandoust
Thomas Rigoni
Francesco di Stefano
Anthony Patt
40
0
0
12 Nov 2023
Rethinking Benchmark and Contamination for Language Models with
  Rephrased Samples
Rethinking Benchmark and Contamination for Language Models with Rephrased Samples
Shuo Yang
Wei-Lin Chiang
Lianmin Zheng
Joseph E. Gonzalez
Ion Stoica
ALM
27
110
0
08 Nov 2023
OpenForest: A data catalogue for machine learning in forest monitoring
OpenForest: A data catalogue for machine learning in forest monitoring
Arthur Ouaknine
T. Kattenborn
Etienne Laliberté
David Rolnick
51
5
0
01 Nov 2023
Learning to Abstain From Uninformative Data
Learning to Abstain From Uninformative Data
Yikai Zhang
Songzhu Zheng
M. Dalirrooyfard
Pengxiang Wu
Anderson Schneider
Anant Raj
Yuriy Nevmyvaka
Chao Chen
26
2
0
25 Sep 2023
PanoMixSwap Panorama Mixing via Structural Swapping for Indoor Scene
  Understanding
PanoMixSwap Panorama Mixing via Structural Swapping for Indoor Scene Understanding
Yu-Cheng Hsieh
Cheng Sun
Suraj Dengale
Min Sun
3DPC
36
1
0
18 Sep 2023
Fundamental Limits of Deep Learning-Based Binary Classifiers Trained with Hinge Loss
Fundamental Limits of Deep Learning-Based Binary Classifiers Trained with Hinge Loss
T. Getu
Georges Kaddoum
M. Bennis
40
1
0
13 Sep 2023
Learning Active Subspaces for Effective and Scalable Uncertainty
  Quantification in Deep Neural Networks
Learning Active Subspaces for Effective and Scalable Uncertainty Quantification in Deep Neural Networks
Sanket R. Jantre
Nathan M. Urban
Xiaoning Qian
Byung-Jun Yoon
BDL
UQCV
26
4
0
06 Sep 2023
Geometry and Local Recovery of Global Minima of Two-layer Neural Networks at Overparameterization
Geometry and Local Recovery of Global Minima of Two-layer Neural Networks at Overparameterization
Leyang Zhang
Yaoyu Zhang
Tao Luo
20
2
0
01 Sep 2023
MarginMatch: Improving Semi-Supervised Learning with Pseudo-Margins
MarginMatch: Improving Semi-Supervised Learning with Pseudo-Margins
Tiberiu Sosea
Cornelia Caragea
16
12
0
17 Aug 2023
Test-Time Poisoning Attacks Against Test-Time Adaptation Models
Test-Time Poisoning Attacks Against Test-Time Adaptation Models
Tianshuo Cong
Xinlei He
Yun Shen
Yang Zhang
AAML
TTA
32
5
0
16 Aug 2023
DaMSTF: Domain Adversarial Learning Enhanced Meta Self-Training for
  Domain Adaptation
DaMSTF: Domain Adversarial Learning Enhanced Meta Self-Training for Domain Adaptation
Menglong Lu
Zhen Huang
Yunxiang Zhao
Zhiliang Tian
Yang Liu
Dongsheng Li
29
6
0
05 Aug 2023
Isolation and Induction: Training Robust Deep Neural Networks against
  Model Stealing Attacks
Isolation and Induction: Training Robust Deep Neural Networks against Model Stealing Attacks
Jun Guo
Aishan Liu
Xingyu Zheng
Siyuan Liang
Yisong Xiao
Yichao Wu
Xianglong Liu
AAML
35
12
0
02 Aug 2023
Understanding Activation Patterns in Artificial Neural Networks by
  Exploring Stochastic Processes
Understanding Activation Patterns in Artificial Neural Networks by Exploring Stochastic Processes
S. Lehmler
Muhammad Saif-ur-Rehman
Tobias Glasmachers
Ioannis Iossifidis
24
0
0
01 Aug 2023
Are Transformers with One Layer Self-Attention Using Low-Rank Weight
  Matrices Universal Approximators?
Are Transformers with One Layer Self-Attention Using Low-Rank Weight Matrices Universal Approximators?
T. Kajitsuka
Issei Sato
31
16
0
26 Jul 2023
Learning to Segment from Noisy Annotations: A Spatial Correction
  Approach
Learning to Segment from Noisy Annotations: A Spatial Correction Approach
Jiacheng Yao
Yikai Zhang
Songzhu Zheng
Mayank Goswami
Prateek Prasanna
Chao Chen
41
15
0
21 Jul 2023
Sharpness Minimization Algorithms Do Not Only Minimize Sharpness To
  Achieve Better Generalization
Sharpness Minimization Algorithms Do Not Only Minimize Sharpness To Achieve Better Generalization
Kaiyue Wen
Zhiyuan Li
Tengyu Ma
FAtt
38
26
0
20 Jul 2023
Addressing caveats of neural persistence with deep graph persistence
Addressing caveats of neural persistence with deep graph persistence
Leander Girrbach
Anders Christensen
Ole Winther
Zeynep Akata
A. Sophia Koepke
GNN
25
1
0
20 Jul 2023
Deconstructing Data Reconstruction: Multiclass, Weight Decay and General
  Losses
Deconstructing Data Reconstruction: Multiclass, Weight Decay and General Losses
G. Buzaglo
Niv Haim
Gilad Yehudai
Gal Vardi
Yakir Oz
Yaniv Nikankin
Michal Irani
34
10
0
04 Jul 2023
Understanding quantum machine learning also requires rethinking
  generalization
Understanding quantum machine learning also requires rethinking generalization
Elies Gil-Fuster
Jens Eisert
Carlos Bravo-Prieto
35
45
0
23 Jun 2023
Precise Asymptotic Generalization for Multiclass Classification with Overparameterized Linear Models
Precise Asymptotic Generalization for Multiclass Classification with Overparameterized Linear Models
David X. Wu
A. Sahai
26
2
0
23 Jun 2023
Quantizable Transformers: Removing Outliers by Helping Attention Heads
  Do Nothing
Quantizable Transformers: Removing Outliers by Helping Attention Heads Do Nothing
Yelysei Bondarenko
Markus Nagel
Tijmen Blankevoort
MQ
15
88
0
22 Jun 2023
FedNoisy: Federated Noisy Label Learning Benchmark
FedNoisy: Federated Noisy Label Learning Benchmark
Siqi Liang
Jintao Huang
Junyuan Hong
Dun Zeng
Jiayu Zhou
Zenglin Xu
FedML
40
7
0
20 Jun 2023
Gibbs-Based Information Criteria and the Over-Parameterized Regime
Gibbs-Based Information Criteria and the Over-Parameterized Regime
Haobo Chen
Yuheng Bu
Greg Wornell
27
1
0
08 Jun 2023
Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection
  Capability
Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection Capability
Jianing Zhu
Hengzhuang Li
Jiangchao Yao
Tongliang Liu
Jianliang Xu
Bo Han
OODD
43
12
0
06 Jun 2023
Proximity to Losslessly Compressible Parameters
Proximity to Losslessly Compressible Parameters
Matthew Farrugia-Roberts
30
0
0
05 Jun 2023
Memorization Capacity of Multi-Head Attention in Transformers
Memorization Capacity of Multi-Head Attention in Transformers
Sadegh Mahdavi
Renjie Liao
Christos Thrampoulidis
26
22
0
03 Jun 2023
Instance-dependent Noisy-label Learning with Graphical Model Based
  Noise-rate Estimation
Instance-dependent Noisy-label Learning with Graphical Model Based Noise-rate Estimation
Arpit Garg
Cuong C. Nguyen
Rafael Felix
Thanh-Toan Do
G. Carneiro
NoLa
35
1
0
31 May 2023
BadLabel: A Robust Perspective on Evaluating and Enhancing Label-noise
  Learning
BadLabel: A Robust Perspective on Evaluating and Enhancing Label-noise Learning
Jingfeng Zhang
Bo Song
Haohan Wang
Bo Han
Tongliang Liu
Lei Liu
Masashi Sugiyama
AAML
NoLa
32
14
0
28 May 2023
Generalization Guarantees of Gradient Descent for Multi-Layer Neural
  Networks
Generalization Guarantees of Gradient Descent for Multi-Layer Neural Networks
Puyu Wang
Yunwen Lei
Di Wang
Yiming Ying
Ding-Xuan Zhou
MLT
29
3
0
26 May 2023
Imprecise Label Learning: A Unified Framework for Learning with Various
  Imprecise Label Configurations
Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations
Hao Chen
Ankit Shah
Jindong Wang
R. Tao
Yidong Wang
Xingxu Xie
Masashi Sugiyama
Rita Singh
Bhiksha Raj
37
12
0
22 May 2023
NoisywikiHow: A Benchmark for Learning with Real-world Noisy Labels in
  Natural Language Processing
NoisywikiHow: A Benchmark for Learning with Real-world Noisy Labels in Natural Language Processing
Tingting Wu
Xiao Ding
Minji Tang
Haotian Zhang
Bing Qin
Ting Liu
NoLa
34
9
0
18 May 2023
Small Models are Valuable Plug-ins for Large Language Models
Small Models are Valuable Plug-ins for Large Language Models
Canwen Xu
Yichong Xu
Shuohang Wang
Yang Liu
Chenguang Zhu
Julian McAuley
LLMAG
41
45
0
15 May 2023
Previous
12345...161718
Next