Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1701.06548
Cited By
Regularizing Neural Networks by Penalizing Confident Output Distributions
23 January 2017
Gabriel Pereyra
George Tucker
J. Chorowski
Lukasz Kaiser
Geoffrey E. Hinton
NoLa
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Regularizing Neural Networks by Penalizing Confident Output Distributions"
50 / 640 papers shown
Title
Unimodal-uniform Constrained Wasserstein Training for Medical Diagnosis
Xiaofeng Liu
Xu Han
Yukai Qiao
Yi Ge
Lu Jun
24
32
0
03 Nov 2019
Conservative Wasserstein Training for Pose Estimation
Xiaofeng Liu
Yang Zou
Tong Che
Peng Ding
P. Jia
J. You
Kumar B.V.K
31
33
0
03 Nov 2019
Hierarchical Expert Networks for Meta-Learning
Heinke Hihn
Daniel A. Braun
33
4
0
31 Oct 2019
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
M. Lewis
Yinhan Liu
Naman Goyal
Marjan Ghazvininejad
Abdel-rahman Mohamed
Omer Levy
Veselin Stoyanov
Luke Zettlemoyer
AIMat
VLM
41
10,635
0
29 Oct 2019
A practical two-stage training strategy for multi-stream end-to-end speech recognition
Ruizhi Li
Gregory Sell
Xiaofei Wang
Shinji Watanabe
H. Hermansky
24
7
0
23 Oct 2019
On Warm-Starting Neural Network Training
Jordan T. Ash
Ryan P. Adams
AI4CE
31
21
0
18 Oct 2019
Transformers without Tears: Improving the Normalization of Self-Attention
Toan Q. Nguyen
Julian Salazar
50
225
0
14 Oct 2019
Noise as a Resource for Learning in Knowledge Distillation
Elahe Arani
F. Sarfraz
Bahram Zonooz
18
6
0
11 Oct 2019
Classification As Decoder: Trading Flexibility For Control In Neural Dialogue
Sam Shleifer
Manish Chablani
Namit Katariya
Anitha Kannan
X. Amatriain
28
0
0
04 Oct 2019
Improving Word Embedding Factorization for Compression Using Distilled Nonlinear Neural Decomposition
Vasileios Lioutas
Ahmad Rashid
Krtin Kumar
Md. Akmal Haidar
Mehdi Rezagholizadeh
29
9
0
02 Oct 2019
Well-calibrated Model Uncertainty with Temperature Scaling for Dropout Variational Inference
M. Laves
Sontje Ihler
Karl-Philipp Kortmann
T. Ortmaier
UQCV
17
54
0
30 Sep 2019
Revisiting Knowledge Distillation via Label Smoothing Regularization
Li-xin Yuan
Francis E. H. Tay
Guilin Li
Tao Wang
Jiashi Feng
25
90
0
25 Sep 2019
Synthetic Data for Deep Learning
Sergey I. Nikolenko
48
349
0
25 Sep 2019
Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Yiming Wang
Tongfei Chen
Hainan Xu
Shuoyang Ding
Hang Lv
Yiwen Shao
Nanyun Peng
Lei Xie
Shinji Watanabe
Sanjeev Khudanpur
VLM
33
73
0
18 Sep 2019
Beyond BLEU: Training Neural Machine Translation with Semantic Similarity
John Wieting
Taylor Berg-Kirkpatrick
Kevin Gimpel
Graham Neubig
AAML
30
163
0
14 Sep 2019
Knowledge Transfer Graph for Deep Collaborative Learning
Soma Minami
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
30
9
0
10 Sep 2019
Self-Teaching Networks
Liang Lu
Eric Sun
Jiawei Liu
SSL
14
4
0
09 Sep 2019
Improving Back-Translation with Uncertainty-based Confidence Estimation
Shuo Wang
Yang Liu
Chao Wang
Huanbo Luan
Maosong Sun
UQLM
39
81
0
31 Aug 2019
Bin-wise Temperature Scaling (BTS): Improvement in Confidence Calibration Performance through Simple Scaling Techniques
Byeongmoon Ji
Hyemin Jung
Jihyeun Yoon
Kyungyul Kim
Younghak Shin
UQCV
27
24
0
30 Aug 2019
Confidence Regularized Self-Training
Yang Zou
Zhiding Yu
Xiaofeng Liu
B. Kumar
Jinsong Wang
233
790
0
26 Aug 2019
Calibration of Deep Probabilistic Models with Decoupled Bayesian Neural Networks
Juan Maroñas
Roberto Paredes Palacios
D. Ramos-Castro
UQCV
BDL
16
22
0
23 Aug 2019
Symmetric Cross Entropy for Robust Learning with Noisy Labels
Yisen Wang
Xingjun Ma
Zaiyi Chen
Yuan Luo
Jinfeng Yi
James Bailey
NoLa
39
880
0
16 Aug 2019
Entropic Out-of-Distribution Detection
David Macêdo
T. I. Ren
Cleber Zanchettin
Adriano Oliveira
Teresa B Ludermir
OODD
UQCV
25
32
0
15 Aug 2019
Adaptive Regularization of Labels
Qianggang Ding
Sifan Wu
Hao Sun
Jiadong Guo
Shutao Xia
ODL
24
29
0
15 Aug 2019
Towards Making the Most of BERT in Neural Machine Translation
Jiacheng Yang
Mingxuan Wang
Hao Zhou
Chengqi Zhao
Yong Yu
Weinan Zhang
Lei Li
CLL
21
156
0
15 Aug 2019
Distance Map Loss Penalty Term for Semantic Segmentation
Francesco Calivá
C. Iriondo
A. M. Martinez
S. Majumdar
V. Pedoia
MedIm
24
79
0
10 Aug 2019
Unsupervised Domain Adaptation via Calibrating Uncertainties
Ligong Han
Yang Zou
Ruijiang Gao
Lezi Wang
Dimitris N. Metaxas
21
30
0
25 Jul 2019
AugLabel: Exploiting Word Representations to Augment Labels for Face Attribute Classification
Binod Bhattarai
Rumeysa Bodur
Tae-Kyun Kim
CVBM
27
5
0
15 Jul 2019
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition
Ye Bai
Jiangyan Yi
J. Tao
Zhengkun Tian
Zhengqi Wen
KELM
30
38
0
13 Jul 2019
Privileged Features Distillation at Taobao Recommendations
Chen Xu
Quan Li
Junfeng Ge
Jinyang Gao
Xiaoyong Yang
Changhua Pei
Fei Sun
Jian Wu
Hanxiao Sun
Wenwu Ou
15
67
0
11 Jul 2019
Applying a Pre-trained Language Model to Spanish Twitter Humor Prediction
Bobak Farzin
Piotr Czapla
Jeremy Howard
16
7
0
06 Jul 2019
Adversarial Robustness via Label-Smoothing
Morgane Goibert
Elvis Dohmatob
AAML
10
18
0
27 Jun 2019
Generalizing Back-Translation in Neural Machine Translation
Miguel Graça
Yunsu Kim
Julian Schamper
Shahram Khadivi
Hermann Ney
17
48
0
17 Jun 2019
Membership Privacy for Machine Learning Models Through Knowledge Transfer
Virat Shejwalkar
Amir Houmansadr
22
10
0
15 Jun 2019
Scalable Syntax-Aware Language Models Using Knowledge Distillation
A. Kuncoro
Chris Dyer
Laura Rimell
S. Clark
Phil Blunsom
35
26
0
14 Jun 2019
A Simple and Effective Approach to Automatic Post-Editing with Transfer Learning
Gonçalo M. Correia
André F. T. Martins
19
42
0
14 Jun 2019
Non-Parametric Calibration for Classification
Jonathan Wenger
Hedvig Kjellström
Rudolph Triebel
UQCV
45
79
0
12 Jun 2019
Using Small Proxy Datasets to Accelerate Hyperparameter Search
Sam Shleifer
Eric Prokop
DD
12
22
0
12 Jun 2019
Deep Visual Re-Identification with Confidence
George Adaimi
S. Kreiss
Alexandre Alahi
22
8
0
11 Jun 2019
Improved Adversarial Robustness via Logit Regularization Methods
Cecilia Summers
M. Dinneen
AAML
36
7
0
10 Jun 2019
Outlier Exposure with Confidence Control for Out-of-Distribution Detection
Aristotelis-Angelos Papadopoulos
Mohammad Reza Rajati
Nazim Shaikh
Jiamian Wang
OODD
19
1
0
08 Jun 2019
When Does Label Smoothing Help?
Rafael Müller
Simon Kornblith
Geoffrey E. Hinton
UQCV
61
1,913
0
06 Jun 2019
Training Neural Response Selection for Task-Oriented Dialogue Systems
Matthew Henderson
Ivan Vulić
D. Gerz
I. Casanueva
Paweł Budzianowski
Sam Coope
Georgios P. Spithourakis
Tsung-Hsien Wen
N. Mrksic
Pei-hao Su
14
110
0
04 Jun 2019
Meta Dropout: Learning to Perturb Features for Generalization
Haebeom Lee
Taewook Nam
Eunho Yang
Sung Ju Hwang
OOD
22
3
0
30 May 2019
Improved Training Speed, Accuracy, and Data Utilization Through Loss Function Optimization
Santiago Gonzalez
Risto Miikkulainen
27
75
0
27 May 2019
On Mixup Training: Improved Calibration and Predictive Uncertainty for Deep Neural Networks
S. Thulasidasan
Gopinath Chennupati
J. Bilmes
Tanmoy Bhattacharya
S. Michalak
UQCV
34
529
0
27 May 2019
Adversarial Distillation for Ordered Top-k Attacks
Zekun Zhang
Tianfu Wu
AAML
11
2
0
25 May 2019
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning
Rui Zhao
Xudong Sun
Volker Tresp
29
80
0
21 May 2019
Learning to Groove with Inverse Sequence Transformations
Jon Gillick
Adam Roberts
Jesse Engel
Douglas Eck
David Bamman
SLR
BDL
21
81
0
14 May 2019
Anonymized BERT: An Augmentation Approach to the Gendered Pronoun Resolution Challenge
Bo Liu
26
7
0
06 May 2019
Previous
1
2
3
...
10
11
12
13
Next