Regularizing Neural Networks by Penalizing Confident Output Distributions

23 January 2017

Papers citing "Regularizing Neural Networks by Penalizing Confident Output Distributions"

50 / 640 papers shown

Title
Unimodal-uniform Constrained Wasserstein Training for Medical Diagnosis Xiaofeng Liu Xu Han Yukai Qiao Yi Ge Lu Jun 24 32 0 03 Nov 2019
Conservative Wasserstein Training for Pose Estimation Xiaofeng Liu Yang Zou Tong Che Peng Ding P. Jia J. You Kumar B.V.K 31 33 0 03 Nov 2019
Hierarchical Expert Networks for Meta-Learning Heinke Hihn Daniel A. Braun 33 4 0 31 Oct 2019
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension M. Lewis Yinhan Liu Naman Goyal Marjan Ghazvininejad Abdel-rahman Mohamed Omer Levy Veselin Stoyanov Luke Zettlemoyer AIMat VLM 41 10,635 0 29 Oct 2019
A practical two-stage training strategy for multi-stream end-to-end speech recognition Ruizhi Li Gregory Sell Xiaofei Wang Shinji Watanabe H. Hermansky 24 7 0 23 Oct 2019
On Warm-Starting Neural Network Training Jordan T. Ash Ryan P. Adams AI4CE 31 21 0 18 Oct 2019
Transformers without Tears: Improving the Normalization of Self-Attention Toan Q. Nguyen Julian Salazar 50 225 0 14 Oct 2019
Noise as a Resource for Learning in Knowledge Distillation Elahe Arani F. Sarfraz Bahram Zonooz 18 6 0 11 Oct 2019
Classification As Decoder: Trading Flexibility For Control In Neural Dialogue Sam Shleifer Manish Chablani Namit Katariya Anitha Kannan X. Amatriain 28 0 0 04 Oct 2019
Improving Word Embedding Factorization for Compression Using Distilled Nonlinear Neural Decomposition Vasileios Lioutas Ahmad Rashid Krtin Kumar Md. Akmal Haidar Mehdi Rezagholizadeh 29 9 0 02 Oct 2019
Well-calibrated Model Uncertainty with Temperature Scaling for Dropout Variational Inference M. Laves Sontje Ihler Karl-Philipp Kortmann T. Ortmaier UQCV 17 54 0 30 Sep 2019
Revisiting Knowledge Distillation via Label Smoothing Regularization Li-xin Yuan Francis E. H. Tay Guilin Li Tao Wang Jiashi Feng 25 90 0 25 Sep 2019
Synthetic Data for Deep Learning Sergey I. Nikolenko 48 349 0 25 Sep 2019
Espresso: A Fast End-to-end Neural Speech Recognition Toolkit Yiming Wang Tongfei Chen Hainan Xu Shuoyang Ding Hang Lv Yiwen Shao Nanyun Peng Lei Xie Shinji Watanabe Sanjeev Khudanpur VLM 33 73 0 18 Sep 2019
Beyond BLEU: Training Neural Machine Translation with Semantic Similarity John Wieting Taylor Berg-Kirkpatrick Kevin Gimpel Graham Neubig AAML 30 163 0 14 Sep 2019
Knowledge Transfer Graph for Deep Collaborative Learning Soma Minami Tsubasa Hirakawa Takayoshi Yamashita H. Fujiyoshi 30 9 0 10 Sep 2019
Self-Teaching Networks Liang Lu Eric Sun Jiawei Liu SSL 14 4 0 09 Sep 2019
Improving Back-Translation with Uncertainty-based Confidence Estimation Shuo Wang Yang Liu Chao Wang Huanbo Luan Maosong Sun UQLM 39 81 0 31 Aug 2019
Bin-wise Temperature Scaling (BTS): Improvement in Confidence Calibration Performance through Simple Scaling Techniques Byeongmoon Ji Hyemin Jung Jihyeun Yoon Kyungyul Kim Younghak Shin UQCV 27 24 0 30 Aug 2019
Confidence Regularized Self-Training Yang Zou Zhiding Yu Xiaofeng Liu B. Kumar Jinsong Wang 233 790 0 26 Aug 2019
Calibration of Deep Probabilistic Models with Decoupled Bayesian Neural Networks Juan Maroñas Roberto Paredes Palacios D. Ramos-Castro UQCV BDL 16 22 0 23 Aug 2019
Symmetric Cross Entropy for Robust Learning with Noisy Labels Yisen Wang Xingjun Ma Zaiyi Chen Yuan Luo Jinfeng Yi James Bailey NoLa 39 880 0 16 Aug 2019
Entropic Out-of-Distribution Detection David Macêdo T. I. Ren Cleber Zanchettin Adriano Oliveira Teresa B Ludermir OODD UQCV 25 32 0 15 Aug 2019
Adaptive Regularization of Labels Qianggang Ding Sifan Wu Hao Sun Jiadong Guo Shutao Xia ODL 24 29 0 15 Aug 2019
Towards Making the Most of BERT in Neural Machine Translation Jiacheng Yang Mingxuan Wang Hao Zhou Chengqi Zhao Yong Yu Weinan Zhang Lei Li CLL 21 156 0 15 Aug 2019
Distance Map Loss Penalty Term for Semantic Segmentation Francesco Calivá C. Iriondo A. M. Martinez S. Majumdar V. Pedoia MedIm 24 79 0 10 Aug 2019
Unsupervised Domain Adaptation via Calibrating Uncertainties Ligong Han Yang Zou Ruijiang Gao Lezi Wang Dimitris N. Metaxas 21 30 0 25 Jul 2019
AugLabel: Exploiting Word Representations to Augment Labels for Face Attribute Classification Binod Bhattarai Rumeysa Bodur Tae-Kyun Kim CVBM 27 5 0 15 Jul 2019
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition Ye Bai Jiangyan Yi J. Tao Zhengkun Tian Zhengqi Wen KELM 30 38 0 13 Jul 2019
Privileged Features Distillation at Taobao Recommendations Chen Xu Quan Li Junfeng Ge Jinyang Gao Xiaoyong Yang Changhua Pei Fei Sun Jian Wu Hanxiao Sun Wenwu Ou 15 67 0 11 Jul 2019
Applying a Pre-trained Language Model to Spanish Twitter Humor Prediction Bobak Farzin Piotr Czapla Jeremy Howard 16 7 0 06 Jul 2019
Adversarial Robustness via Label-Smoothing Morgane Goibert Elvis Dohmatob AAML 10 18 0 27 Jun 2019
Generalizing Back-Translation in Neural Machine Translation Miguel Graça Yunsu Kim Julian Schamper Shahram Khadivi Hermann Ney 17 48 0 17 Jun 2019
Membership Privacy for Machine Learning Models Through Knowledge Transfer Virat Shejwalkar Amir Houmansadr 22 10 0 15 Jun 2019
Scalable Syntax-Aware Language Models Using Knowledge Distillation A. Kuncoro Chris Dyer Laura Rimell S. Clark Phil Blunsom 35 26 0 14 Jun 2019
A Simple and Effective Approach to Automatic Post-Editing with Transfer Learning Gonçalo M. Correia André F. T. Martins 19 42 0 14 Jun 2019
Non-Parametric Calibration for Classification Jonathan Wenger Hedvig Kjellström Rudolph Triebel UQCV 45 79 0 12 Jun 2019
Using Small Proxy Datasets to Accelerate Hyperparameter Search Sam Shleifer Eric Prokop DD 12 22 0 12 Jun 2019
Deep Visual Re-Identification with Confidence George Adaimi S. Kreiss Alexandre Alahi 22 8 0 11 Jun 2019
Improved Adversarial Robustness via Logit Regularization Methods Cecilia Summers M. Dinneen AAML 36 7 0 10 Jun 2019
Outlier Exposure with Confidence Control for Out-of-Distribution Detection Aristotelis-Angelos Papadopoulos Mohammad Reza Rajati Nazim Shaikh Jiamian Wang OODD 19 1 0 08 Jun 2019
When Does Label Smoothing Help? Rafael Müller Simon Kornblith Geoffrey E. Hinton UQCV 61 1,913 0 06 Jun 2019
Training Neural Response Selection for Task-Oriented Dialogue Systems Matthew Henderson Ivan Vulić D. Gerz I. Casanueva Paweł Budzianowski Sam Coope Georgios P. Spithourakis Tsung-Hsien Wen N. Mrksic Pei-hao Su 14 110 0 04 Jun 2019
Meta Dropout: Learning to Perturb Features for Generalization Haebeom Lee Taewook Nam Eunho Yang Sung Ju Hwang OOD 22 3 0 30 May 2019
Improved Training Speed, Accuracy, and Data Utilization Through Loss Function Optimization Santiago Gonzalez Risto Miikkulainen 27 75 0 27 May 2019
On Mixup Training: Improved Calibration and Predictive Uncertainty for Deep Neural Networks S. Thulasidasan Gopinath Chennupati J. Bilmes Tanmoy Bhattacharya S. Michalak UQCV 34 529 0 27 May 2019
Adversarial Distillation for Ordered Top-k Attacks Zekun Zhang Tianfu Wu AAML 11 2 0 25 May 2019
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning Rui Zhao Xudong Sun Volker Tresp 29 80 0 21 May 2019
Learning to Groove with Inverse Sequence Transformations Jon Gillick Adam Roberts Jesse Engel Douglas Eck David Bamman SLR BDL 21 81 0 14 May 2019
Anonymized BERT: An Augmentation Approach to the Gendered Pronoun Resolution Challenge Bo Liu 26 7 0 06 May 2019