ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.03530
  4. Cited By
Understanding deep learning requires rethinking generalization

Understanding deep learning requires rethinking generalization

10 November 2016
Chiyuan Zhang
Samy Bengio
Moritz Hardt
Benjamin Recht
Oriol Vinyals
    HAI
ArXivPDFHTML

Papers citing "Understanding deep learning requires rethinking generalization"

50 / 926 papers shown
Title
Non-Vacuous Generalisation Bounds for Shallow Neural Networks
Non-Vacuous Generalisation Bounds for Shallow Neural Networks
Felix Biggs
Benjamin Guedj
BDL
30
26
0
03 Feb 2022
On Regularizing Coordinate-MLPs
On Regularizing Coordinate-MLPs
Sameera Ramasinghe
L. MacDonald
Simon Lucey
158
5
0
01 Feb 2022
Deep Layer-wise Networks Have Closed-Form Weights
Chieh-Tsai Wu
A. Masoomi
Arthur Gretton
Jennifer Dy
29
3
0
01 Feb 2022
Datamodels: Predicting Predictions from Training Data
Datamodels: Predicting Predictions from Training Data
Andrew Ilyas
Sung Min Park
Logan Engstrom
Guillaume Leclerc
A. Madry
TDI
47
131
0
01 Feb 2022
Backdoors Stuck At The Frontdoor: Multi-Agent Backdoor Attacks That
  Backfire
Backdoors Stuck At The Frontdoor: Multi-Agent Backdoor Attacks That Backfire
Siddhartha Datta
N. Shadbolt
AAML
32
7
0
28 Jan 2022
Interplay between depth of neural networks and locality of target
  functions
Interplay between depth of neural networks and locality of target functions
Takashi Mori
Masakuni Ueda
25
0
0
28 Jan 2022
Improved Overparametrization Bounds for Global Convergence of Stochastic
  Gradient Descent for Shallow Neural Networks
Improved Overparametrization Bounds for Global Convergence of Stochastic Gradient Descent for Shallow Neural Networks
Bartlomiej Polaczyk
J. Cyranka
ODL
33
3
0
28 Jan 2022
Implicit Regularization in Hierarchical Tensor Factorization and Deep
  Convolutional Neural Networks
Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks
Noam Razin
Asaf Maman
Nadav Cohen
46
29
0
27 Jan 2022
PiCO+: Contrastive Label Disambiguation for Robust Partial Label
  Learning
PiCO+: Contrastive Label Disambiguation for Robust Partial Label Learning
Haobo Wang
Rui Xiao
Yixuan Li
Lei Feng
Gang Niu
Gang Chen
J. Zhao
VLM
49
25
0
22 Jan 2022
Low-Pass Filtering SGD for Recovering Flat Optima in the Deep Learning
  Optimization Landscape
Low-Pass Filtering SGD for Recovering Flat Optima in the Deep Learning Optimization Landscape
Devansh Bisla
Jing Wang
A. Choromańska
25
34
0
20 Jan 2022
Caring Without Sharing: A Federated Learning Crowdsensing Framework for
  Diversifying Representation of Cities
Caring Without Sharing: A Federated Learning Crowdsensing Framework for Diversifying Representation of Cities
Mi-Gyoung Cho
A. Mashhadi
FedML
36
1
0
20 Jan 2022
BLINC: Lightweight Bimodal Learning for Low-Complexity VVC Intra Coding
BLINC: Lightweight Bimodal Learning for Low-Complexity VVC Intra Coding
Farhad Pakdaman
Mohammad Ali Adelimanesh
M. Hashemi
14
3
0
19 Jan 2022
Overview frequency principle/spectral bias in deep learning
Overview frequency principle/spectral bias in deep learning
Z. Xu
Yaoyu Zhang
Tao Luo
FaML
33
66
0
19 Jan 2022
Towards Adversarial Evaluations for Inexact Machine Unlearning
Towards Adversarial Evaluations for Inexact Machine Unlearning
Shashwat Goel
Ameya Prabhu
Amartya Sanyal
Ser-Nam Lim
Philip Torr
Ponnurangam Kumaraguru
AAML
ELM
MU
32
47
0
17 Jan 2022
Neighborhood Region Smoothing Regularization for Finding Flat Minima In
  Deep Neural Networks
Neighborhood Region Smoothing Regularization for Finding Flat Minima In Deep Neural Networks
Yang Zhao
Hao Zhang
22
1
0
16 Jan 2022
Transferability in Deep Learning: A Survey
Transferability in Deep Learning: A Survey
Junguang Jiang
Yang Shu
Jianmin Wang
Mingsheng Long
OOD
34
101
0
15 Jan 2022
Model Stability with Continuous Data Updates
Model Stability with Continuous Data Updates
Huiting Liu
Avinesh P.V.S
Siddharth Patwardhan
Peter Grasch
Sachin Agarwal
29
16
0
14 Jan 2022
Reconstructing Training Data with Informed Adversaries
Reconstructing Training Data with Informed Adversaries
Borja Balle
Giovanni Cherubin
Jamie Hayes
MIACV
AAML
43
158
0
13 Jan 2022
Implicit Bias of MSE Gradient Optimization in Underparameterized Neural
  Networks
Implicit Bias of MSE Gradient Optimization in Underparameterized Neural Networks
Benjamin Bowman
Guido Montúfar
28
11
0
12 Jan 2022
Stability Based Generalization Bounds for Exponential Family Langevin
  Dynamics
Stability Based Generalization Bounds for Exponential Family Langevin Dynamics
A. Banerjee
Tiancong Chen
Xinyan Li
Yingxue Zhou
34
8
0
09 Jan 2022
Separation of Scales and a Thermodynamic Description of Feature Learning
  in Some CNNs
Separation of Scales and a Thermodynamic Description of Feature Learning in Some CNNs
Inbar Seroussi
Gadi Naveh
Zohar Ringel
35
51
0
31 Dec 2021
Benign Overfitting in Adversarially Robust Linear Classification
Benign Overfitting in Adversarially Robust Linear Classification
Jinghui Chen
Yuan Cao
Quanquan Gu
AAML
SILM
34
10
0
31 Dec 2021
Efficient Diversity-Driven Ensemble for Deep Neural Networks
Efficient Diversity-Driven Ensemble for Deep Neural Networks
Wentao Zhang
Jiawei Jiang
Yingxia Shao
Bin Cui
21
17
0
26 Dec 2021
Over-Parametrized Matrix Factorization in the Presence of Spurious
  Stationary Points
Over-Parametrized Matrix Factorization in the Presence of Spurious Stationary Points
Armin Eftekhari
24
1
0
25 Dec 2021
On the Impact of Hard Adversarial Instances on Overfitting in
  Adversarial Training
On the Impact of Hard Adversarial Instances on Overfitting in Adversarial Training
Chen Liu
Zhichao Huang
Mathieu Salzmann
Tong Zhang
Sabine Süsstrunk
AAML
23
13
0
14 Dec 2021
Deep Translation Prior: Test-time Training for Photorealistic Style
  Transfer
Deep Translation Prior: Test-time Training for Photorealistic Style Transfer
Sunwoo Kim
Soohyun Kim
Seungryong Kim
24
13
0
12 Dec 2021
A generalization gap estimation for overparameterized models via the
  Langevin functional variance
A generalization gap estimation for overparameterized models via the Langevin functional variance
Akifumi Okuno
Keisuke Yano
41
1
0
07 Dec 2021
Multi-scale Feature Learning Dynamics: Insights for Double Descent
Multi-scale Feature Learning Dynamics: Insights for Double Descent
Mohammad Pezeshki
Amartya Mitra
Yoshua Bengio
Guillaume Lajoie
61
25
0
06 Dec 2021
Protecting Intellectual Property of Language Generation APIs with
  Lexical Watermark
Protecting Intellectual Property of Language Generation APIs with Lexical Watermark
Xuanli He
Qiongkai Xu
Lingjuan Lyu
Fangzhao Wu
Chenguang Wang
WaLM
177
95
0
05 Dec 2021
Hard Sample Aware Noise Robust Learning for Histopathology Image
  Classification
Hard Sample Aware Noise Robust Learning for Histopathology Image Classification
Chuang Zhu
Wenkai Chen
T. Peng
Ying Wang
M. Jin
NoLa
34
72
0
05 Dec 2021
Construct Informative Triplet with Two-stage Hard-sample Generation
Construct Informative Triplet with Two-stage Hard-sample Generation
Chuang Zhu
Zheng Hu
Huihui Dong
Gang He
Zekuan Yu
Shangshang Zhang
36
3
0
04 Dec 2021
Novel Class Discovery in Semantic Segmentation
Novel Class Discovery in Semantic Segmentation
Yuyang Zhao
Zhun Zhong
N. Sebe
G. Lee
30
27
0
03 Dec 2021
Learning Curves for Continual Learning in Neural Networks:
  Self-Knowledge Transfer and Forgetting
Learning Curves for Continual Learning in Neural Networks: Self-Knowledge Transfer and Forgetting
Ryo Karakida
S. Akaho
CLL
32
11
0
03 Dec 2021
Embedding Principle: a hierarchical structure of loss landscape of deep
  neural networks
Embedding Principle: a hierarchical structure of loss landscape of deep neural networks
Yaoyu Zhang
Yuqing Li
Zhongwang Zhang
Tao Luo
Z. Xu
29
22
0
30 Nov 2021
The Geometric Occam's Razor Implicit in Deep Learning
The Geometric Occam's Razor Implicit in Deep Learning
Benoit Dherin
Micheal Munn
David Barrett
22
6
0
30 Nov 2021
Deep Probability Estimation
Deep Probability Estimation
Sheng Liu
Aakash Kaku
Weicheng Zhu
M. Leibovich
S. Mohan
...
Haoxiang Huang
L. Zanna
N. Razavian
Jonathan Niles-Weed
C. Fernandez‐Granda
UQCV
OOD
28
14
0
21 Nov 2021
DICE: Leveraging Sparsification for Out-of-Distribution Detection
DICE: Leveraging Sparsification for Out-of-Distribution Detection
Yiyou Sun
Yixuan Li
OODD
38
151
0
18 Nov 2021
Constrained Instance and Class Reweighting for Robust Learning under
  Label Noise
Constrained Instance and Class Reweighting for Robust Learning under Label Noise
Abhishek Kumar
Ehsan Amid
NoLa
32
19
0
09 Nov 2021
Data Augmentation Can Improve Robustness
Data Augmentation Can Improve Robustness
Sylvestre-Alvise Rebuffi
Sven Gowal
D. A. Calian
Florian Stimberg
Olivia Wiles
Timothy A. Mann
AAML
17
270
0
09 Nov 2021
MixACM: Mixup-Based Robustness Transfer via Distillation of Activated
  Channel Maps
MixACM: Mixup-Based Robustness Transfer via Distillation of Activated Channel Maps
Muhammad Awais
Fengwei Zhou
Chuanlong Xie
Jiawei Li
Sung-Ho Bae
Zhenguo Li
AAML
43
17
0
09 Nov 2021
Information-Theoretic Bayes Risk Lower Bounds for Realizable Models
Information-Theoretic Bayes Risk Lower Bounds for Realizable Models
M. Nokleby
Ahmad Beirami
59
1
0
08 Nov 2021
Improved Regularization and Robustness for Fine-tuning in Neural
  Networks
Improved Regularization and Robustness for Fine-tuning in Neural Networks
Dongyue Li
Hongyang R. Zhang
NoLa
55
54
0
08 Nov 2021
Exponential escape efficiency of SGD from sharp minima in non-stationary
  regime
Exponential escape efficiency of SGD from sharp minima in non-stationary regime
Hikaru Ibayashi
Masaaki Imaizumi
34
4
0
07 Nov 2021
Understanding Layer-wise Contributions in Deep Neural Networks through
  Spectral Analysis
Understanding Layer-wise Contributions in Deep Neural Networks through Spectral Analysis
Yatin Dandi
Arthur Jacot
FAtt
26
4
0
06 Nov 2021
Visualizing the Emergence of Intermediate Visual Patterns in DNNs
Visualizing the Emergence of Intermediate Visual Patterns in DNNs
Mingjie Li
Shaobo Wang
Quanshi Zhang
32
11
0
05 Nov 2021
An Explanation of In-context Learning as Implicit Bayesian Inference
An Explanation of In-context Learning as Implicit Bayesian Inference
Sang Michael Xie
Aditi Raghunathan
Percy Liang
Tengyu Ma
ReLM
BDL
VPVLM
LRM
56
695
0
03 Nov 2021
Subquadratic Overparameterization for Shallow Neural Networks
Subquadratic Overparameterization for Shallow Neural Networks
Chaehwan Song
Ali Ramezani-Kebrya
Thomas Pethick
Armin Eftekhari
V. Cevher
30
31
0
02 Nov 2021
Mixture Proportion Estimation and PU Learning: A Modern Approach
Mixture Proportion Estimation and PU Learning: A Modern Approach
Saurabh Garg
Yifan Wu
Alexander J. Smola
Sivaraman Balakrishnan
Zachary Chase Lipton
24
52
0
01 Nov 2021
Real-time Speaker counting in a cocktail party scenario using
  Attention-guided Convolutional Neural Network
Real-time Speaker counting in a cocktail party scenario using Attention-guided Convolutional Neural Network
Midia Yousefi
John H. L. Hansen
28
10
0
30 Oct 2021
Neural Networks as Kernel Learners: The Silent Alignment Effect
Neural Networks as Kernel Learners: The Silent Alignment Effect
Alexander B. Atanasov
Blake Bordelon
Cengiz Pehlevan
MLT
26
75
0
29 Oct 2021
Previous
123...567...171819
Next