Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1701.06548
Cited By
Regularizing Neural Networks by Penalizing Confident Output Distributions
23 January 2017
Gabriel Pereyra
George Tucker
J. Chorowski
Lukasz Kaiser
Geoffrey E. Hinton
NoLa
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Regularizing Neural Networks by Penalizing Confident Output Distributions"
50 / 640 papers shown
Title
Searching for Robustness: Loss Learning for Noisy Classification Tasks
Boyan Gao
Henry Gouk
Timothy M. Hospedales
OOD
NoLa
36
18
0
27 Feb 2021
Siamese Labels Auxiliary Learning
Wenrui Gan
Zhulin Liu
Chong Chen
Tong Zhang
25
1
0
27 Feb 2021
IIE-NLP-Eyas at SemEval-2021 Task 4: Enhancing PLM for ReCAM with Special Tokens, Re-Ranking, Siamese Encoders and Back Translation
Yuqiang Xie
Luxi Xing
Wei Peng
Yue Hu
15
4
0
25 Feb 2021
Multi-View Feature Representation for Dialogue Generation with Bidirectional Distillation
Shaoxiong Feng
Xuancheng Ren
Kan Li
Xu Sun
24
11
0
22 Feb 2021
Sample Efficient Learning of Image-Based Diagnostic Classifiers Using Probabilistic Labels
Roberto Vega
Pouneh Gorji
Zichen Zhang
Xuebin Qin
A. Hareendranathan
J. Kapur
Jacob L. Jaremko
Russell Greiner
14
3
0
11 Feb 2021
Label Smoothed Embedding Hypothesis for Out-of-Distribution Detection
Dara Bahri
Heinrich Jiang
Yi Tay
Donald Metzler
OODD
23
3
0
09 Feb 2021
Concentrated Document Topic Model
Hao Lei
Ying Chen
14
1
0
06 Feb 2021
On the Reproducibility of Neural Network Predictions
Srinadh Bhojanapalli
Kimberly Wilber
Andreas Veit
A. S. Rawat
Seungyeon Kim
A. Menon
Sanjiv Kumar
29
35
0
05 Feb 2021
Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASR
Ruizhi Li
Gregory Sell
H. Hermansky
23
1
0
05 Feb 2021
Rethinking Soft Labels for Knowledge Distillation: A Bias-Variance Tradeoff Perspective
Helong Zhou
Liangchen Song
Jiajie Chen
Ye Zhou
Guoli Wang
Junsong Yuan
Qian Zhang
30
170
0
01 Feb 2021
Deep Deterministic Information Bottleneck with Matrix-based Entropy Functional
Xi Yu
Shujian Yu
José C. Príncipe
AAML
22
26
0
31 Jan 2021
BERTaú: Itaú BERT for digital customer service
Paulo Finardi
José Dié Viegas
Gustavo T. Ferreira
Alex F. Mansano
Vinicius Fernandes Caridá
35
11
0
28 Jan 2021
Calibrating and Improving Graph Contrastive Learning
Kaili Ma
Haochen Yang
Han Yang
Yongqiang Chen
James Cheng
52
6
0
27 Jan 2021
Measuring Dependence with Matrix-based Entropy Functional
Shujian Yu
Francesco Alesiani
Xi Yu
Robert Jenssen
José C. Príncipe
24
24
0
25 Jan 2021
Self-Adaptive Training: Bridging Supervised and Self-Supervised Learning
Lang Huang
Chaoning Zhang
Hongyang R. Zhang
SSL
36
24
0
21 Jan 2021
BERT & Family Eat Word Salad: Experiments with Text Understanding
Ashim Gupta
Giorgi Kvernadze
Vivek Srikumar
211
73
0
10 Jan 2021
From Black-box to White-box: Examining Confidence Calibration under different Conditions
Franziska Schwaiger
Maximilian Henne
Fabian Küppers
Felippe Schmoeller da Roza
Karsten Roscher
Anselm Haselhoff
39
10
0
08 Jan 2021
Bridging In- and Out-of-distribution Samples for Their Better Discriminability
Engkarat Techapanurak
Anh-Chuong Dang
Takayuki Okatani
OODD
25
3
0
07 Jan 2021
Learning with Retrospection
Xiang Deng
Zhongfei Zhang
35
17
0
24 Dec 2020
How Does a Neural Network's Architecture Impact Its Robustness to Noisy Labels?
Jingling Li
Mozhi Zhang
Keyulu Xu
John P. Dickerson
Jimmy Ba
OOD
NoLa
30
19
0
23 Dec 2020
On Calibration of Scene-Text Recognition Models
Ron Slossberg
Oron Anschel
Amir Markovitz
Ron Litman
Aviad Aberdam
Shahar Tsiper
Shai Mazor
Jon Wu
R. Manmatha
32
13
0
23 Dec 2020
SENTRY: Selective Entropy Optimization via Committee Consistency for Unsupervised Domain Adaptation
Viraj Prabhu
Shivam Khare
Deeksha Kartik
Judy Hoffman
46
133
0
21 Dec 2020
MASKER: Masked Keyword Regularization for Reliable Text Classification
S. Moon
Sangwoo Mo
Kimin Lee
Jaeho Lee
Jinwoo Shin
32
38
0
17 Dec 2020
Deep Unsupervised Image Anomaly Detection: An Information Theoretic Framework
Fei Ye
Huangjie Zheng
Chaoqin Huang
Ya Zhang
19
13
0
09 Dec 2020
Self-Training for Class-Incremental Semantic Segmentation
Lu Yu
Xialei Liu
Joost van de Weijer
CLL
SSL
28
54
0
06 Dec 2020
Cross-Layer Distillation with Semantic Calibration
Defang Chen
Jian-Ping Mei
Yuan Zhang
Can Wang
Yan Feng
Chun-Yen Chen
FedML
45
288
0
06 Dec 2020
Graph Mixture Density Networks
Federico Errica
D. Bacciu
Alessio Micheli
32
17
0
05 Dec 2020
Matching Distributions via Optimal Transport for Semi-Supervised Learning
Fariborz Taherkhani
Hadi Kazemi
Ali Dabouei
J. Dawson
Nasser M. Nasrabadi
OT
42
1
0
04 Dec 2020
Regularization via Adaptive Pairwise Label Smoothing
Hongyu Guo
26
0
0
02 Dec 2020
Rethinking Uncertainty in Deep Learning: Whether and How it Improves Robustness
Yilun Jin
Lixin Fan
Kam Woh Ng
Ce Ju
Qiang Yang
AAML
OOD
20
1
0
27 Nov 2020
Tight Integrated End-to-End Training for Cascaded Speech Translation
Parnia Bahar
Tobias Bieschke
Ralf Schluter
Hermann Ney
47
26
0
24 Nov 2020
Uncertainty Estimation and Calibration with Finite-State Probabilistic RNNs
Cheng Wang
Carolin (Haas) Lawrence
Mathias Niepert
UQCV
29
10
0
24 Nov 2020
Improving Classifier Confidence using Lossy Label-Invariant Transformations
Sooyong Jang
Insup Lee
James Weimer
UQCV
16
7
0
09 Nov 2020
Parametric Flatten-T Swish: An Adaptive Non-linear Activation Function For Deep Learning
Hock Hung Chieng
Noorhaniza Wahid
P. Ong
21
6
0
06 Nov 2020
Extracting Chemical-Protein Interactions via Calibrated Deep Neural Network and Self-training
Dongha Choi
Hyunju Lee
22
5
0
04 Nov 2020
Specialization in Hierarchical Learning Systems
Heinke Hihn
Daniel A. Braun
29
16
0
03 Nov 2020
Why Do Better Loss Functions Lead to Less Transferable Features?
Simon Kornblith
Ting-Li Chen
Honglak Lee
Mohammad Norouzi
FaML
45
90
0
30 Oct 2020
An iterative framework for self-supervised deep speaker representation learning
Danwei Cai
Weiqing Wang
Ming Li
SSL
22
37
0
25 Oct 2020
An Investigation of how Label Smoothing Affects Generalization
Blair Chen
Liu Ziyin
Zihao Wang
Paul Pu Liang
UQCV
21
17
0
23 Oct 2020
Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data
Lingkai Kong
Haoming Jiang
Yuchen Zhuang
Jie Lyu
T. Zhao
Chao Zhang
OODD
30
26
0
22 Oct 2020
Stronger Transformers for Neural Multi-Hop Question Generation
Devendra Singh Sachan
Lingfei Wu
Mrinmaya Sachan
William L. Hamilton
19
8
0
22 Oct 2020
Beyond English-Centric Multilingual Machine Translation
Angela Fan
Shruti Bhosale
Holger Schwenk
Zhiyi Ma
Ahmed El-Kishky
...
Vitaliy Liptchinsky
Sergey Edunov
Edouard Grave
Michael Auli
Armand Joulin
LRM
41
832
0
21 Oct 2020
Facial Emotion Recognition with Noisy Multi-task Annotations
Siwei Zhang
Zhiwu Huang
D. Paudel
Luc Van Gool
CVBM
24
10
0
19 Oct 2020
Multi-Modal Super Resolution for Dense Microscopic Particle Size Estimation
Sarvesh Patil
Chava Y P D Phani Rajanish
N. Margankunte
MedIm
16
0
0
19 Oct 2020
Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach
Yue Yu
Simiao Zuo
Haoming Jiang
Wendi Ren
T. Zhao
Chao Zhang
AI4MH
15
131
0
15 Oct 2020
Semantic Label Smoothing for Sequence to Sequence Problems
Michal Lukasik
Himanshu Jain
A. Menon
Seungyeon Kim
Srinadh Bhojanapalli
Felix X. Yu
Sanjiv Kumar
AI4TS
25
18
0
15 Oct 2020
Temperature check: theory and practice for training models with softmax-cross-entropy losses
Atish Agarwala
Jeffrey Pennington
Yann N. Dauphin
S. Schoenholz
UQCV
16
33
0
14 Oct 2020
Summarize, Outline, and Elaborate: Long-Text Generation via Hierarchical Supervision from Extractive Summaries
Xiaofei Sun
Zijun Sun
Yuxian Meng
Jiwei Li
Chun Fan
11
18
0
14 Oct 2020
Training Binary Neural Networks through Learning with Noisy Supervision
Kai Han
Yunhe Wang
Yixing Xu
Chunjing Xu
Enhua Wu
Chang Xu
MQ
15
55
0
10 Oct 2020
Token-level Adaptive Training for Neural Machine Translation
Shuhao Gu
Jinchao Zhang
Fandong Meng
Yang Feng
Wanying Xie
Jie Zhou
Dong Yu
27
32
0
09 Oct 2020
Previous
1
2
3
...
7
8
9
...
11
12
13
Next