ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.03530
  4. Cited By
Understanding deep learning requires rethinking generalization

Understanding deep learning requires rethinking generalization

10 November 2016
Chiyuan Zhang
Samy Bengio
Moritz Hardt
Benjamin Recht
Oriol Vinyals
    HAI
ArXivPDFHTML

Papers citing "Understanding deep learning requires rethinking generalization"

50 / 926 papers shown
Title
Object Detection as a Positive-Unlabeled Problem
Object Detection as a Positive-Unlabeled Problem
Yuewei Yang
Kevin J Liang
Lawrence Carin
21
37
0
11 Feb 2020
A Diffusion Theory For Deep Learning Dynamics: Stochastic Gradient
  Descent Exponentially Favors Flat Minima
A Diffusion Theory For Deep Learning Dynamics: Stochastic Gradient Descent Exponentially Favors Flat Minima
Zeke Xie
Issei Sato
Masashi Sugiyama
ODL
28
17
0
10 Feb 2020
Semi-Supervised Class Discovery
Semi-Supervised Class Discovery
Jeremy Nixon
J. Liu
David Berthelot
20
2
0
10 Feb 2020
Spectrum Dependent Learning Curves in Kernel Regression and Wide Neural
  Networks
Spectrum Dependent Learning Curves in Kernel Regression and Wide Neural Networks
Blake Bordelon
Abdulkadir Canatar
Cengiz Pehlevan
146
201
0
07 Feb 2020
A Precise High-Dimensional Asymptotic Theory for Boosting and
  Minimum-$\ell_1$-Norm Interpolated Classifiers
A Precise High-Dimensional Asymptotic Theory for Boosting and Minimum-ℓ1\ell_1ℓ1​-Norm Interpolated Classifiers
Tengyuan Liang
Pragya Sur
35
68
0
05 Feb 2020
TDEFSI: Theory Guided Deep Learning Based Epidemic Forecasting with
  Synthetic Information
TDEFSI: Theory Guided Deep Learning Based Epidemic Forecasting with Synthetic Information
Lijing Wang
Jiangzhuo Chen
Madhav Marathe
AI4TS
31
19
0
28 Jan 2020
Optimized Generic Feature Learning for Few-shot Classification across
  Domains
Optimized Generic Feature Learning for Few-shot Classification across Domains
Tonmoy Saikia
Thomas Brox
Cordelia Schmid
VLM
30
48
0
22 Jan 2020
Memory capacity of neural networks with threshold and ReLU activations
Memory capacity of neural networks with threshold and ReLU activations
Roman Vershynin
31
21
0
20 Jan 2020
Compounding the Performance Improvements of Assembled Techniques in a
  Convolutional Neural Network
Compounding the Performance Improvements of Assembled Techniques in a Convolutional Neural Network
Jungkyu Lee
Taeryun Won
Tae Kwan Lee
Hyemin Lee
Geonmo Gu
K. Hong
34
57
0
17 Jan 2020
Rethinking Generalization of Neural Models: A Named Entity Recognition
  Case Study
Rethinking Generalization of Neural Models: A Named Entity Recognition Case Study
Jinlan Fu
Pengfei Liu
Qi Zhang
Xuanjing Huang
AI4CE
33
73
0
12 Jan 2020
Confidence Scores Make Instance-dependent Label-noise Learning Possible
Confidence Scores Make Instance-dependent Label-noise Learning Possible
Antonin Berthon
Bo Han
Gang Niu
Tongliang Liu
Masashi Sugiyama
NoLa
37
104
0
11 Jan 2020
Identifying and Compensating for Feature Deviation in Imbalanced Deep
  Learning
Identifying and Compensating for Feature Deviation in Imbalanced Deep Learning
Han-Jia Ye
Hong-You Chen
De-Chuan Zhan
Wei-Lun Chao
32
99
0
06 Jan 2020
Robust Aggregation for Federated Learning
Robust Aggregation for Federated Learning
Krishna Pillutla
Sham Kakade
Zaïd Harchaoui
FedML
37
629
0
31 Dec 2019
Measuring Dataset Granularity
Measuring Dataset Granularity
Huayu Chen
Zeqi Gu
D. Mahajan
L. V. D. van der Maaten
Serge J. Belongie
Ser-Nam Lim
29
13
0
21 Dec 2019
Locality and compositionality in zero-shot learning
Locality and compositionality in zero-shot learning
Tristan Sylvain
Linda Petrini
R. Devon Hjelm
24
56
0
20 Dec 2019
Making Better Mistakes: Leveraging Class Hierarchies with Deep Networks
Making Better Mistakes: Leveraging Class Hierarchies with Deep Networks
Luca Bertinetto
Romain Mueller
Konstantinos Tertikas
Sina Samangooei
Nicholas A. Lord
OOD
23
131
0
19 Dec 2019
Optimization for deep learning: theory and algorithms
Optimization for deep learning: theory and algorithms
Ruoyu Sun
ODL
25
168
0
19 Dec 2019
A Shape Transformation-based Dataset Augmentation Framework for
  Pedestrian Detection
A Shape Transformation-based Dataset Augmentation Framework for Pedestrian Detection
Zhe Chen
Wanli Ouyang
Tongliang Liu
Dacheng Tao
ViT
24
23
0
15 Dec 2019
In Defense of Uniform Convergence: Generalization via derandomization
  with an application to interpolating predictors
In Defense of Uniform Convergence: Generalization via derandomization with an application to interpolating predictors
Jeffrey Negrea
Gintare Karolina Dziugaite
Daniel M. Roy
AI4CE
40
64
0
09 Dec 2019
Deep learning with noisy labels: exploring techniques and remedies in
  medical image analysis
Deep learning with noisy labels: exploring techniques and remedies in medical image analysis
Davood Karimi
Haoran Dou
Simon K. Warfield
Ali Gholipour
NoLa
24
535
0
05 Dec 2019
Insights into Ordinal Embedding Algorithms: A Systematic Evaluation
Insights into Ordinal Embedding Algorithms: A Systematic Evaluation
L. C. Vankadara
Siavash Haghiri
Michael Lohaus
Faiz Ul Wahab
U. V. Luxburg
15
7
0
03 Dec 2019
Towards Understanding the Spectral Bias of Deep Learning
Towards Understanding the Spectral Bias of Deep Learning
Yuan Cao
Zhiying Fang
Yue Wu
Ding-Xuan Zhou
Quanquan Gu
41
214
0
03 Dec 2019
Forecasting significant stock price changes using neural networks
Forecasting significant stock price changes using neural networks
F. Kamalov
AIFin
11
80
0
21 Nov 2019
Robustness Certificates for Sparse Adversarial Attacks by Randomized
  Ablation
Robustness Certificates for Sparse Adversarial Attacks by Randomized Ablation
Alexander Levine
S. Feizi
AAML
28
104
0
21 Nov 2019
Collaborative Unsupervised Domain Adaptation for Medical Image Diagnosis
Collaborative Unsupervised Domain Adaptation for Medical Image Diagnosis
Yifan Zhang
Ying Wei
P. Zhao
Shuaicheng Niu
Qingyao Wu
Mingkui Tan
Junzhou Huang
OOD
26
145
0
17 Nov 2019
Understanding and Improving Layer Normalization
Understanding and Improving Layer Normalization
Jingjing Xu
Xu Sun
Zhiyuan Zhang
Guangxiang Zhao
Junyang Lin
FAtt
18
340
0
16 Nov 2019
Rate-Regularization and Generalization in VAEs
Rate-Regularization and Generalization in VAEs
Alican Bozkurt
Babak Esmaeili
Jean-Baptiste Tristan
Dana H. Brooks
Jennifer G. Dy
Jan-Willem van de Meent
DRL
27
7
0
11 Nov 2019
Rethinking Generalisation
Rethinking Generalisation
Antonia Marcu
Adam Prugel-Bennett
14
0
0
11 Nov 2019
An empirical study of the relation between network architecture and
  complexity
An empirical study of the relation between network architecture and complexity
Emir Konuk
Kevin Smith
11
7
0
11 Nov 2019
Location Attention for Extrapolation to Longer Sequences
Location Attention for Extrapolation to Longer Sequences
Yann Dubois
Gautier Dagan
Dieuwke Hupkes
Elia Bruni
23
40
0
10 Nov 2019
Global Convergence of Gradient Descent for Deep Linear Residual Networks
Global Convergence of Gradient Descent for Deep Linear Residual Networks
Lei Wu
Qingcan Wang
Chao Ma
ODL
AI4CE
28
22
0
02 Nov 2019
Confident Learning: Estimating Uncertainty in Dataset Labels
Confident Learning: Estimating Uncertainty in Dataset Labels
Curtis G. Northcutt
Lu Jiang
Isaac L. Chuang
NoLa
38
674
0
31 Oct 2019
On Generalization Bounds of a Family of Recurrent Neural Networks
On Generalization Bounds of a Family of Recurrent Neural Networks
Minshuo Chen
Xingguo Li
T. Zhao
19
70
0
28 Oct 2019
Mixup-breakdown: a consistency training method for improving
  generalization of speech separation models
Mixup-breakdown: a consistency training method for improving generalization of speech separation models
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
33
22
0
28 Oct 2019
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement
  Learning
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Xinyue Chen
Zijian Zhou
Zihan Wang
Che Wang
Yanqiu Wu
Keith Ross
OffRL
27
120
0
27 Oct 2019
LUTNet: Learning FPGA Configurations for Highly Efficient Neural Network
  Inference
LUTNet: Learning FPGA Configurations for Highly Efficient Neural Network Inference
Erwei Wang
James J. Davis
P. Cheung
George A. Constantinides
MQ
9
41
0
24 Oct 2019
From complex to simple : hierarchical free-energy landscape renormalized
  in deep neural networks
From complex to simple : hierarchical free-energy landscape renormalized in deep neural networks
H. Yoshino
19
6
0
22 Oct 2019
Robust Training with Ensemble Consensus
Robust Training with Ensemble Consensus
Jisoo Lee
Sae-Young Chung
NoLa
22
28
0
22 Oct 2019
MUTE: Data-Similarity Driven Multi-hot Target Encoding for Neural
  Network Design
MUTE: Data-Similarity Driven Multi-hot Target Encoding for Neural Network Design
Mayoore S. Jaiswal
Bumboo Kang
Jinho Lee
Minsik Cho
16
2
0
15 Oct 2019
The Local Elasticity of Neural Networks
The Local Elasticity of Neural Networks
Hangfeng He
Weijie J. Su
40
44
0
15 Oct 2019
Improved Sample Complexities for Deep Networks and Robust Classification
  via an All-Layer Margin
Improved Sample Complexities for Deep Networks and Robust Classification via an All-Layer Margin
Colin Wei
Tengyu Ma
AAML
OOD
36
85
0
09 Oct 2019
Algorithm-Dependent Generalization Bounds for Overparameterized Deep
  Residual Networks
Algorithm-Dependent Generalization Bounds for Overparameterized Deep Residual Networks
Spencer Frei
Yuan Cao
Quanquan Gu
ODL
9
31
0
07 Oct 2019
Meta-Learning Deep Energy-Based Memory Models
Meta-Learning Deep Energy-Based Memory Models
Sergey Bartunov
Jack W. Rae
Simon Osindero
Timothy Lillicrap
43
34
0
07 Oct 2019
SELF: Learning to Filter Noisy Labels with Self-Ensembling
SELF: Learning to Filter Noisy Labels with Self-Ensembling
Philipp Kratzer
Marc Toussaint
Thi Phuong Nhung Ngo
T. Nguyen
Jim Mainprice
Thomas Brox
NoLa
39
308
0
04 Oct 2019
Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks
Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks
Sanjeev Arora
S. Du
Zhiyuan Li
Ruslan Salakhutdinov
Ruosong Wang
Dingli Yu
AAML
19
161
0
03 Oct 2019
Partial differential equation regularization for supervised machine
  learning
Partial differential equation regularization for supervised machine learning
Jillian R. Fisher
27
2
0
03 Oct 2019
How does topology influence gradient propagation and model performance
  of deep networks with DenseNet-type skip connections?
How does topology influence gradient propagation and model performance of deep networks with DenseNet-type skip connections?
Kartikeya Bhardwaj
Guihong Li
R. Marculescu
38
1
0
02 Oct 2019
A Closer Look at Data Bias in Neural Extractive Summarization Models
A Closer Look at Data Bias in Neural Extractive Summarization Models
Ming Zhong
Danqing Wang
Pengfei Liu
Xipeng Qiu
Xuanjing Huang
40
42
0
30 Sep 2019
Overparameterized Neural Networks Implement Associative Memory
Overparameterized Neural Networks Implement Associative Memory
Adityanarayanan Radhakrishnan
M. Belkin
Caroline Uhler
BDL
35
71
0
26 Sep 2019
Adversarial Deep Embedded Clustering: on a better trade-off between
  Feature Randomness and Feature Drift
Adversarial Deep Embedded Clustering: on a better trade-off between Feature Randomness and Feature Drift
Nairouz Mrabah
Mohamed Bouguessa
Riadh Ksantini
OOD
24
53
0
26 Sep 2019
Previous
123...121314...171819
Next