ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.07892
  4. Cited By
Calibration of Pre-trained Transformers

Calibration of Pre-trained Transformers

17 March 2020
Shrey Desai
Greg Durrett
    UQLM
ArXivPDFHTML

Papers citing "Calibration of Pre-trained Transformers"

40 / 40 papers shown
Title
Enhancing Mathematical Reasoning in Large Language Models with Self-Consistency-Based Hallucination Detection
Enhancing Mathematical Reasoning in Large Language Models with Self-Consistency-Based Hallucination Detection
MingShan Liu
Shi Bo
Jialing Fang
LRM
42
2
0
13 Apr 2025
Confidence Regularized Masked Language Modeling using Text Length
Confidence Regularized Masked Language Modeling using Text Length
Seunghyun Ji
Soowon Lee
89
0
0
08 Apr 2025
Large Language Model Confidence Estimation via Black-Box Access
Large Language Model Confidence Estimation via Black-Box Access
Tejaswini Pedapati
Amit Dhurandhar
Soumya Ghosh
Soham Dan
P. Sattigeri
129
5
0
21 Feb 2025
Technical report on label-informed logit redistribution for better domain generalization in low-shot classification with foundation models
Technical report on label-informed logit redistribution for better domain generalization in low-shot classification with foundation models
Behraj Khan
T. Syed
362
1
0
29 Jan 2025
Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation
Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation
Yiming Wang
Pei Zhang
Baosong Yang
Derek F. Wong
Rui Wang
LRM
68
9
0
17 Oct 2024
Calibrating Expressions of Certainty
Calibrating Expressions of Certainty
Peiqi Wang
Barbara D. Lam
Yingcheng Liu
Ameneh Asgari-Targhi
Yikang Shen
W. Wells
Tina Kapur
Polina Golland
63
1
0
06 Oct 2024
Integrative Decoding: Improve Factuality via Implicit Self-consistency
Integrative Decoding: Improve Factuality via Implicit Self-consistency
Yi Cheng
Xiao Liang
Yeyun Gong
Wen Xiao
Song Wang
...
Wenjie Li
Jian Jiao
Qi Chen
Peng Cheng
Wayne Xiong
HILM
78
2
0
02 Oct 2024
Efficient Nearest Neighbor based Uncertainty Estimation for Natural Language Processing Tasks
Efficient Nearest Neighbor based Uncertainty Estimation for Natural Language Processing Tasks
Wataru Hashimoto
Hidetaka Kamigaito
Taro Watanabe
75
0
0
02 Jul 2024
Teaching LLMs to Abstain across Languages via Multilingual Feedback
Teaching LLMs to Abstain across Languages via Multilingual Feedback
Shangbin Feng
Weijia Shi
Yike Wang
Wenxuan Ding
Orevaoghene Ahia
Shuyue Stella Li
Vidhisha Balachandran
Sunayana Sitaram
Yulia Tsvetkov
97
6
0
22 Jun 2024
Reassessing How to Compare and Improve the Calibration of Machine Learning Models
Reassessing How to Compare and Improve the Calibration of Machine Learning Models
M. Chidambaram
Rong Ge
93
1
0
06 Jun 2024
Introspective Planning: Aligning Robots' Uncertainty with Inherent Task Ambiguity
Introspective Planning: Aligning Robots' Uncertainty with Inherent Task Ambiguity
Kaiqu Liang
Zixu Zhang
J. F. Fisac
LLMAG
85
7
0
09 Feb 2024
Predicting generalization performance with correctness discriminators
Predicting generalization performance with correctness discriminators
Yuekun Yao
Alexander Koller
67
1
0
15 Nov 2023
Evaluating Lottery Tickets Under Distributional Shifts
Evaluating Lottery Tickets Under Distributional Shifts
Shrey Desai
Hongyuan Zhan
Ahmed Aly
UQCV
OOD
30
41
0
28 Oct 2019
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
234
6,420
0
26 Sep 2019
Revealing the Dark Secrets of BERT
Revealing the Dark Secrets of BERT
Olga Kovaleva
Alexey Romanov
Anna Rogers
Anna Rumshisky
22
551
0
21 Aug 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
341
24,160
0
26 Jul 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
156
8,386
0
19 Jun 2019
What Does BERT Look At? An Analysis of BERT's Attention
What Does BERT Look At? An Analysis of BERT's Attention
Kevin Clark
Urvashi Khandelwal
Omer Levy
Christopher D. Manning
MILM
172
1,586
0
11 Jun 2019
Simplified Neural Unsupervised Domain Adaptation
Simplified Neural Unsupervised Domain Adaptation
Timothy A. Miller
15
29
0
22 May 2019
HellaSwag: Can a Machine Really Finish Your Sentence?
HellaSwag: Can a Machine Really Finish Your Sentence?
Rowan Zellers
Ari Holtzman
Yonatan Bisk
Ali Farhadi
Yejin Choi
56
2,373
0
19 May 2019
How to Fine-Tune BERT for Text Classification?
How to Fine-Tune BERT for Text Classification?
Chi Sun
Xipeng Qiu
Yige Xu
Xuanjing Huang
50
1,508
0
14 May 2019
SuperGLUE: A Stickier Benchmark for General-Purpose Language
  Understanding Systems
SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems
Alex Jinpeng Wang
Yada Pruksachatkun
Nikita Nangia
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
148
2,287
0
02 May 2019
Calibration of Encoder Decoder Models for Neural Machine Translation
Calibration of Encoder Decoder Models for Neural Machine Translation
Aviral Kumar
Sunita Sarawagi
78
99
0
03 Mar 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
782
93,936
0
11 Oct 2018
SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense
  Inference
SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference
Rowan Zellers
Yonatan Bisk
Roy Schwartz
Yejin Choi
66
710
0
16 Aug 2018
AllenNLP: A Deep Semantic Natural Language Processing Platform
AllenNLP: A Deep Semantic Natural Language Processing Platform
Matt Gardner
Joel Grus
Mark Neumann
Oyvind Tafjord
Pradeep Dasigi
Nelson F. Liu
Matthew E. Peters
Michael Schmitz
Luke Zettlemoyer
VLM
43
1,280
0
20 Mar 2018
Training Confidence-calibrated Classifiers for Detecting
  Out-of-Distribution Samples
Training Confidence-calibrated Classifiers for Detecting Out-of-Distribution Samples
Kimin Lee
Honglak Lee
Kibok Lee
Jinwoo Shin
OODD
85
880
0
26 Nov 2017
A Continuously Growing Dataset of Sentential Paraphrases
A Continuously Growing Dataset of Sentential Paraphrases
Wuwei Lan
Siyu Qiu
Hua He
Wei Xu
LRM
34
166
0
01 Aug 2017
On Calibration of Modern Neural Networks
On Calibration of Modern Neural Networks
Chuan Guo
Geoff Pleiss
Yu Sun
Kilian Q. Weinberger
UQCV
154
5,774
0
14 Jun 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
309
129,831
0
12 Jun 2017
Enhancing The Reliability of Out-of-distribution Image Detection in
  Neural Networks
Enhancing The Reliability of Out-of-distribution Image Detection in Neural Networks
Shiyu Liang
Yixuan Li
R. Srikant
UQCV
OODD
86
2,046
0
08 Jun 2017
A Broad-Coverage Challenge Corpus for Sentence Understanding through
  Inference
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
Adina Williams
Nikita Nangia
Samuel R. Bowman
332
4,444
0
18 Apr 2017
What Uncertainties Do We Need in Bayesian Deep Learning for Computer
  Vision?
What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision?
Alex Kendall
Y. Gal
BDL
OOD
UD
UQCV
PER
238
4,667
0
15 Mar 2017
Regularizing Neural Networks by Penalizing Confident Output
  Distributions
Regularizing Neural Networks by Penalizing Confident Output Distributions
Gabriel Pereyra
George Tucker
J. Chorowski
Lukasz Kaiser
Geoffrey E. Hinton
NoLa
93
1,133
0
23 Jan 2017
A Baseline for Detecting Misclassified and Out-of-Distribution Examples
  in Neural Networks
A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks
Dan Hendrycks
Kevin Gimpel
UQCV
86
3,420
0
07 Oct 2016
Enhanced LSTM for Natural Language Inference
Enhanced LSTM for Natural Language Inference
Qian Chen
Xiao-Dan Zhu
Zhenhua Ling
Si Wei
Hui Jiang
Diana Inkpen
LRM
ReLM
56
1,128
0
20 Sep 2016
A Decomposable Attention Model for Natural Language Inference
A Decomposable Attention Model for Natural Language Inference
Ankur P. Parikh
Oscar Täckström
Dipanjan Das
Jakob Uszkoreit
302
1,369
0
06 Jun 2016
Adversarial Deep Averaging Networks for Cross-Lingual Sentiment
  Classification
Adversarial Deep Averaging Networks for Cross-Lingual Sentiment Classification
Xilun Chen
Yu Sun
Ben Athiwaratkun
Claire Cardie
Kilian Q. Weinberger
248
315
0
06 Jun 2016
A large annotated corpus for learning natural language inference
A large annotated corpus for learning natural language inference
Samuel R. Bowman
Gabor Angeli
Christopher Potts
Christopher D. Manning
192
4,256
0
21 Aug 2015
Posterior calibration and exploratory analysis for natural language
  processing models
Posterior calibration and exploratory analysis for natural language processing models
Khanh Nguyen
Brendan O'Connor
62
137
0
21 Aug 2015
1