ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1701.06548
  4. Cited By
Regularizing Neural Networks by Penalizing Confident Output
  Distributions

Regularizing Neural Networks by Penalizing Confident Output Distributions

23 January 2017
Gabriel Pereyra
George Tucker
J. Chorowski
Lukasz Kaiser
Geoffrey E. Hinton
    NoLa
ArXivPDFHTML

Papers citing "Regularizing Neural Networks by Penalizing Confident Output Distributions"

50 / 640 papers shown
Title
Deep Collective Knowledge Distillation
Deep Collective Knowledge Distillation
Jihyeon Seo
Kyusam Oh
Chanho Min
Yongkeun Yun
Sungwoo Cho
19
0
0
18 Apr 2023
Approaching Test Time Augmentation in the Context of Uncertainty
  Calibration for Deep Neural Networks
Approaching Test Time Augmentation in the Context of Uncertainty Calibration for Deep Neural Networks
Pedro Conde
T. Barros
Rui L. Lopes
C. Premebida
U. J. Nunes
UQCV
32
7
0
11 Apr 2023
Uncertainty-inspired Open Set Learning for Retinal Anomaly
  Identification
Uncertainty-inspired Open Set Learning for Retinal Anomaly Identification
Meng Wang
Tian Lin
Lianyu Wang
Aidi Lin
K. Zou
...
Yong Liu
C. Pang
Xinjian Chen
Haoyu Chen
Huazhu Fu
111
36
0
08 Apr 2023
Towards Unbiased Calibration using Meta-Regularization
Towards Unbiased Calibration using Meta-Regularization
Cheng Wang
Jacek Golebiowski
39
1
0
27 Mar 2023
Collision Cross-entropy for Soft Class Labels and Deep Clustering
Collision Cross-entropy for Soft Class Labels and Deep Clustering
Z. Zhang
Yuri Boykov
23
0
0
13 Mar 2023
Transformer-based World Models Are Happy With 100k Interactions
Transformer-based World Models Are Happy With 100k Interactions
Jan Robine
Marc Höftmann
Tobias Uelwer
Stefan Harmeling
OffRL
27
71
0
13 Mar 2023
Trust your neighbours: Penalty-based constraints for model calibration
Trust your neighbours: Penalty-based constraints for model calibration
Balamurali Murugesan
V. SukeshAdiga
Bingyuan Liu
H. Lombaert
Ismail Ben Ayed
Jose Dolz
UQCV
26
9
0
11 Mar 2023
TANGOS: Regularizing Tabular Neural Networks through Gradient
  Orthogonalization and Specialization
TANGOS: Regularizing Tabular Neural Networks through Gradient Orthogonalization and Specialization
Alan Jeffares
Tennison Liu
Jonathan Crabbé
F. Imrie
M. Schaar
CML
58
23
0
09 Mar 2023
Rethinking Confidence Calibration for Failure Prediction
Rethinking Confidence Calibration for Failure Prediction
Fei Zhu
Zhen Cheng
Xu-Yao Zhang
Cheng-Lin Liu
UQCV
22
39
0
06 Mar 2023
Fine-Grained Classification with Noisy Labels
Fine-Grained Classification with Noisy Labels
Qinglai Wei
Lei Feng
Haoliang Sun
Ren Wang
Chenhui Guo
Yilong Yin
NoLa
110
20
0
04 Mar 2023
Uncertainty Estimation by Fisher Information-based Evidential Deep
  Learning
Uncertainty Estimation by Fisher Information-based Evidential Deep Learning
Danruo Deng
Guangyong Chen
Yang Yu
Fu-Lun Liu
Pheng-Ann Heng
EDL
UQCV
FedML
37
41
0
03 Mar 2023
Towards Fine-Grained Information: Identifying the Type and Location of
  Translation Errors
Towards Fine-Grained Information: Identifying the Type and Location of Translation Errors
Keqin Bao
Boyi Deng
Dayiheng Liu
Baosong Yang
Wenqiang Lei
Xiangnan He
Derek F.Wong
Jun Xie
42
4
0
17 Feb 2023
Bag of Tricks for In-Distribution Calibration of Pretrained Transformers
Bag of Tricks for In-Distribution Calibration of Pretrained Transformers
Jaeyoung Kim
Dongbin Na
Sungchul Choi
Sungbin Lim
VLM
43
5
0
13 Feb 2023
Encoding Sentence Position in Context-Aware Neural Machine Translation
  with Concatenation
Encoding Sentence Position in Context-Aware Neural Machine Translation with Concatenation
Lorenzo Lupo
Marco Dinarelli
Laurent Besacier
39
9
0
13 Feb 2023
Learning from Noisy Crowd Labels with Logics
Learning from Noisy Crowd Labels with Logics
Zhijun Chen
Hailong Sun
Haoqian He
Pengpeng Chen
NoLa
NAI
34
7
0
13 Feb 2023
Mutation-Based Adversarial Attacks on Neural Text Detectors
Mutation-Based Adversarial Attacks on Neural Text Detectors
G. Liang
Jesus Guerrero
I. Alsmadi
DeLMO
40
7
0
11 Feb 2023
Realistic Conversational Question Answering with Answer Selection based
  on Calibrated Confidence and Uncertainty Measurement
Realistic Conversational Question Answering with Answer Selection based on Calibrated Confidence and Uncertainty Measurement
Soyeong Jeong
Jinheon Baek
Sung Ju Hwang
Jong C. Park
29
2
0
10 Feb 2023
A Prototype-Oriented Clustering for Domain Shift with Source Privacy
A Prototype-Oriented Clustering for Domain Shift with Source Privacy
Korawat Tanwisuth
Shujian Zhang
Pengcheng He
Mingyuan Zhou
34
3
0
08 Feb 2023
APAM: Adaptive Pre-training and Adaptive Meta Learning in Language Model
  for Noisy Labels and Long-tailed Learning
APAM: Adaptive Pre-training and Adaptive Meta Learning in Language Model for Noisy Labels and Long-tailed Learning
Sunyi Chi
B. Dong
Yiming Xu
Zhenyu Shi
Zheng Du
NoLa
44
3
0
06 Feb 2023
Flat Seeking Bayesian Neural Networks
Flat Seeking Bayesian Neural Networks
Van-Anh Nguyen
L. Vuong
Hoang Phan
Thanh-Toan Do
Dinh Q. Phung
Trung Le
BDL
42
8
0
06 Feb 2023
Generalized Uncertainty of Deep Neural Networks: Taxonomy and
  Applications
Generalized Uncertainty of Deep Neural Networks: Taxonomy and Applications
Chengyu Dong
OOD
UQCV
BDL
AI4CE
41
0
0
02 Feb 2023
Controlling Steering with Energy-Based Models
Controlling Steering with Energy-Based Models
Mykyta Baliesnyi
Ardi Tampuu
Tambet Matiisen
LLMSV
43
2
0
28 Jan 2023
Discriminative Entropy Clustering and its Relation to K-means and SVM
Discriminative Entropy Clustering and its Relation to K-means and SVM
Z. Zhang
Yuri Boykov
11
0
0
26 Jan 2023
Leveraging Speaker Embeddings with Adversarial Multi-task Learning for
  Age Group Classification
Leveraging Speaker Embeddings with Adversarial Multi-task Learning for Age Group Classification
Kwangje Baeg
Yeong-Gwan Kim
Youngsub Han
Byoung-Ki Jeon
24
0
0
22 Jan 2023
Improving Deep Regression with Ordinal Entropy
Improving Deep Regression with Ordinal Entropy
Shihao Zhang
Linlin Yang
Michael Bi Mi
Xiaoxu Zheng
Angela Yao
UQCV
27
39
0
21 Jan 2023
Why do Nearest Neighbor Language Models Work?
Why do Nearest Neighbor Language Models Work?
Frank F. Xu
Uri Alon
Graham Neubig
RALM
30
22
0
07 Jan 2023
Guidance Through Surrogate: Towards a Generic Diagnostic Attack
Guidance Through Surrogate: Towards a Generic Diagnostic Attack
Muzammal Naseer
Salman Khan
Fatih Porikli
Fahad Shahbaz Khan
AAML
28
1
0
30 Dec 2022
Annealing Double-Head: An Architecture for Online Calibration of Deep
  Neural Networks
Annealing Double-Head: An Architecture for Online Calibration of Deep Neural Networks
Erdong Guo
D. Draper
Maria de Iorio
41
0
0
27 Dec 2022
Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation
Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation
Tomer Wullach
Shlomo E. Chazan
30
1
0
27 Dec 2022
The Forward-Forward Algorithm: Some Preliminary Investigations
The Forward-Forward Algorithm: Some Preliminary Investigations
Geoffrey E. Hinton
39
260
0
27 Dec 2022
A Survey of Mix-based Data Augmentation: Taxonomy, Methods,
  Applications, and Explainability
A Survey of Mix-based Data Augmentation: Taxonomy, Methods, Applications, and Explainability
Chengtai Cao
Fan Zhou
Yurou Dai
Jianping Wang
Kunpeng Zhang
AAML
31
28
0
21 Dec 2022
Calibrating Deep Neural Networks using Explicit Regularisation and
  Dynamic Data Pruning
Calibrating Deep Neural Networks using Explicit Regularisation and Dynamic Data Pruning
R. Hebbalaguppe
Rishabh Patra
T. Dash
Gautam M. Shroff
L. Vig
25
14
0
20 Dec 2022
Rethinking Label Smoothing on Multi-hop Question Answering
Rethinking Label Smoothing on Multi-hop Question Answering
Zhangyue Yin
Yuxin Wang
Xiannian Hu
Yiguang Wu
Hang Yan
Xinyu Zhang
Bo Zhao
Xuanjing Huang
Xipeng Qiu
26
9
0
19 Dec 2022
Learning from Training Dynamics: Identifying Mislabeled Data Beyond
  Manually Designed Features
Learning from Training Dynamics: Identifying Mislabeled Data Beyond Manually Designed Features
Qingrui Jia
Xuhong Li
Lei Yu
Jiang Bian
Penghao Zhao
Shupeng Li
Haoyi Xiong
Dejing Dou
NoLa
35
5
0
19 Dec 2022
NLIP: Noise-robust Language-Image Pre-training
NLIP: Noise-robust Language-Image Pre-training
Runhu Huang
Yanxin Long
Jianhua Han
Hang Xu
Xiwen Liang
Chunjing Xu
Xiaodan Liang
VLM
41
30
0
14 Dec 2022
Improving group robustness under noisy labels using predictive
  uncertainty
Improving group robustness under noisy labels using predictive uncertainty
Dongpin Oh
Dae Lee
Jeunghyun Byun
Bonggun Shin
UQCV
25
3
0
14 Dec 2022
Leveraging Unlabeled Data to Track Memorization
Leveraging Unlabeled Data to Track Memorization
Mahsa Forouzesh
Hanie Sedghi
Patrick Thiran
NoLa
TDI
39
4
0
08 Dec 2022
DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding
DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding
Jianhao Yan
Jin Xu
Fandong Meng
Jie Zhou
Yue Zhang
26
3
0
08 Dec 2022
FoPro: Few-Shot Guided Robust Webly-Supervised Prototypical Learning
FoPro: Few-Shot Guided Robust Webly-Supervised Prototypical Learning
Yulei Qin
Xingyu Chen
Chao Chen
Yunhang Shen
Bohan Ren
Yun Gu
Jie Yang
Chunhua Shen
49
4
0
01 Dec 2022
Transfer Entropy Bottleneck: Learning Sequence to Sequence Information
  Transfer
Transfer Entropy Bottleneck: Learning Sequence to Sequence Information Transfer
Damjan Kalajdzievski
Ximeng Mao
Pascal Fortier-Poisson
Guillaume Lajoie
Blake A. Richards
AI4TS
18
3
0
29 Nov 2022
Revisiting Distance Metric Learning for Few-Shot Natural Language
  Classification
Revisiting Distance Metric Learning for Few-Shot Natural Language Classification
Witold Sosnowski
Anna Wróblewska
Karolina Seweryn
P. Gawrysiak
23
0
0
28 Nov 2022
Distance Metric Learning Loss Functions in Few-Shot Scenarios of
  Supervised Language Models Fine-Tuning
Distance Metric Learning Loss Functions in Few-Shot Scenarios of Supervised Language Models Fine-Tuning
Witold Sosnowski
Karolina Seweryn
Anna Wróblewska
P. Gawrysiak
26
0
0
28 Nov 2022
Class Adaptive Network Calibration
Class Adaptive Network Calibration
Bingyuan Liu
Jérôme Rony
Adrian Galdran
Jose Dolz
Ismail Ben Ayed
48
8
0
28 Nov 2022
Cross-Domain Ensemble Distillation for Domain Generalization
Cross-Domain Ensemble Distillation for Domain Generalization
Kyung-Jin Lee
Sungyeon Kim
Suha Kwak
FedML
OOD
28
38
0
25 Nov 2022
Improving Multi-task Learning via Seeking Task-based Flat Regions
Improving Multi-task Learning via Seeking Task-based Flat Regions
Hoang Phan
Lam C. Tran
Ngoc N. Tran
Nhat Ho
Dinh Q. Phung
Trung Le
38
11
0
24 Nov 2022
Understanding the Role of Mixup in Knowledge Distillation: An Empirical
  Study
Understanding the Role of Mixup in Knowledge Distillation: An Empirical Study
Hongjun Choi
Eunyeong Jeon
Ankita Shukla
Pavan Turaga
26
8
0
08 Nov 2022
Parallel Attention Forcing for Machine Translation
Parallel Attention Forcing for Machine Translation
Qingyun Dou
Mark Gales
27
0
0
06 Nov 2022
Calibration Meets Explanation: A Simple and Effective Approach for Model
  Confidence Estimates
Calibration Meets Explanation: A Simple and Effective Approach for Model Confidence Estimates
Dongfang Li
Baotian Hu
Qingcai Chen
16
8
0
06 Nov 2022
A Close Look into the Calibration of Pre-trained Language Models
A Close Look into the Calibration of Pre-trained Language Models
Yangyi Chen
Lifan Yuan
Yuchen Zhang
Zhiyuan Liu
Heng Ji
42
44
0
31 Oct 2022
On-the-fly Object Detection using StyleGAN with CLIP Guidance
On-the-fly Object Detection using StyleGAN with CLIP Guidance
Yu-Ta Lu
Shusen Liu
Jayaraman J. Thiagarajan
W. Sakla
Rushil Anirudh
VLM
ObjD
35
1
0
30 Oct 2022
Previous
12345...111213
Next