ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.16175
  4. Cited By
Quantifying Uncertainty in Answers from any Language Model and Enhancing
  their Trustworthiness

Quantifying Uncertainty in Answers from any Language Model and Enhancing their Trustworthiness

30 August 2023
Jiuhai Chen
Jonas W. Mueller
ArXivPDFHTML

Papers citing "Quantifying Uncertainty in Answers from any Language Model and Enhancing their Trustworthiness"

18 / 18 papers shown
Title
LAMP: Extracting Locally Linear Decision Surfaces from LLM World Models
LAMP: Extracting Locally Linear Decision Surfaces from LLM World Models
Ryan Chen
Youngmin Ko
Zeyu Zhang
Catherine Cho
Sunny Chung
Mauro Giuffré
Dennis L. Shung
Bradly C. Stadie
137
0
0
17 May 2025
Real-Time Evaluation Models for RAG: Who Detects Hallucinations Best?
Real-Time Evaluation Models for RAG: Who Detects Hallucinations Best?
Ashish Sardana
HILM
VLM
103
0
0
27 Mar 2025
A Survey of Uncertainty Estimation Methods on Large Language Models
A Survey of Uncertainty Estimation Methods on Large Language Models
Zhiqiu Xia
Jinxuan Xu
Yuqian Zhang
Hang Liu
69
3
0
28 Feb 2025
CER: Confidence Enhanced Reasoning in LLMs
CER: Confidence Enhanced Reasoning in LLMs
Ali Razghandi
Seyed Mohammad Hadi Hosseini
Mahdieh Soleymani Baghshah
LRM
139
5
0
20 Feb 2025
Graph-based Confidence Calibration for Large Language Models
Graph-based Confidence Calibration for Large Language Models
Yukun Li
Sijia Wang
Lifu Huang
Li-Ping Liu
UQCV
124
2
0
03 Nov 2024
Functional-level Uncertainty Quantification for Calibrated Fine-tuning on LLMs
Functional-level Uncertainty Quantification for Calibrated Fine-tuning on LLMs
Ruijia Niu
D. Wu
Rose Yu
Yi-An Ma
57
2
0
09 Oct 2024
Do Not Design, Learn: A Trainable Scoring Function for Uncertainty Estimation in Generative LLMs
Do Not Design, Learn: A Trainable Scoring Function for Uncertainty Estimation in Generative LLMs
D. Yaldiz
Yavuz Faruk Bakman
Baturalp Buyukates
Chenyang Tao
Anil Ramakrishna
Dimitrios Dimitriadis
Jieyu Zhao
Salman Avestimehr
94
7
0
17 Jun 2024
Uncertainty quantification in fine-tuned LLMs using LoRA ensembles
Uncertainty quantification in fine-tuned LLMs using LoRA ensembles
Oleksandr Balabanov
Hampus Linander
UQCV
88
18
0
19 Feb 2024
When do you need Chain-of-Thought Prompting for ChatGPT?
When do you need Chain-of-Thought Prompting for ChatGPT?
Jiuhai Chen
Lichang Chen
Heng Huang
Dinesh Manocha
LRM
KELM
ReLM
ELM
40
43
0
06 Apr 2023
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
738
9,267
0
28 Jan 2022
A Gentle Introduction to Conformal Prediction and Distribution-Free
  Uncertainty Quantification
A Gentle Introduction to Conformal Prediction and Distribution-Free Uncertainty Quantification
Anastasios Nikolas Angelopoulos
Stephen Bates
OOD
168
615
0
15 Jul 2021
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
Pengcheng He
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
AAML
129
2,724
0
05 Jun 2020
Maximizing Overall Diversity for Improved Uncertainty Estimates in Deep
  Ensembles
Maximizing Overall Diversity for Improved Uncertainty Estimates in Deep Ensembles
Siddhartha Jain
Ge Liu
Jonas W. Mueller
David K Gifford
UQCV
56
60
0
18 Jun 2019
CommonsenseQA: A Question Answering Challenge Targeting Commonsense
  Knowledge
CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge
Alon Talmor
Jonathan Herzig
Nicholas Lourie
Jonathan Berant
RALM
140
1,716
0
02 Nov 2018
Deep k-Nearest Neighbors: Towards Confident, Interpretable and Robust
  Deep Learning
Deep k-Nearest Neighbors: Towards Confident, Interpretable and Robust Deep Learning
Nicolas Papernot
Patrick McDaniel
OOD
AAML
129
507
0
13 Mar 2018
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep
  Networks for Thompson Sampling
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling
C. Riquelme
George Tucker
Jasper Snoek
BDL
62
365
0
26 Feb 2018
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for
  Reading Comprehension
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
RALM
195
2,636
0
09 May 2017
Bayesian Recurrent Neural Networks
Bayesian Recurrent Neural Networks
Meire Fortunato
Charles Blundell
Oriol Vinyals
BDL
53
184
0
10 Apr 2017
1