Quantifying Uncertainty in Answers from any Language Model and Enhancing their Trustworthiness

30 August 2023

Papers citing "Quantifying Uncertainty in Answers from any Language Model and Enhancing their Trustworthiness"

18 / 18 papers shown

Title
LAMP: Extracting Locally Linear Decision Surfaces from LLM World Models Ryan Chen Youngmin Ko Zeyu Zhang Catherine Cho Sunny Chung Mauro Giuffré Dennis L. Shung Bradly C. Stadie 137 0 0 17 May 2025
Real-Time Evaluation Models for RAG: Who Detects Hallucinations Best? Ashish Sardana HILM VLM 103 0 0 27 Mar 2025
A Survey of Uncertainty Estimation Methods on Large Language Models Zhiqiu Xia Jinxuan Xu Yuqian Zhang Hang Liu 69 3 0 28 Feb 2025
CER: Confidence Enhanced Reasoning in LLMs Ali Razghandi Seyed Mohammad Hadi Hosseini Mahdieh Soleymani Baghshah LRM 139 5 0 20 Feb 2025
Graph-based Confidence Calibration for Large Language Models Yukun Li Sijia Wang Lifu Huang Li-Ping Liu UQCV 124 2 0 03 Nov 2024
Functional-level Uncertainty Quantification for Calibrated Fine-tuning on LLMs Ruijia Niu D. Wu Rose Yu Yi-An Ma 57 2 0 09 Oct 2024
Do Not Design, Learn: A Trainable Scoring Function for Uncertainty Estimation in Generative LLMs D. Yaldiz Yavuz Faruk Bakman Baturalp Buyukates Chenyang Tao Anil Ramakrishna Dimitrios Dimitriadis Jieyu Zhao Salman Avestimehr 94 7 0 17 Jun 2024
Uncertainty quantification in fine-tuned LLMs using LoRA ensembles Oleksandr Balabanov Hampus Linander UQCV 88 18 0 19 Feb 2024
When do you need Chain-of-Thought Prompting for ChatGPT? Jiuhai Chen Lichang Chen Heng Huang Dinesh Manocha LRM KELM ReLM ELM 40 43 0 06 Apr 2023
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models Jason W. Wei Xuezhi Wang Dale Schuurmans Maarten Bosma Brian Ichter F. Xia Ed H. Chi Quoc Le Denny Zhou LM&Ro LRM AI4CE ReLM 738 9,267 0 28 Jan 2022
A Gentle Introduction to Conformal Prediction and Distribution-Free Uncertainty Quantification Anastasios Nikolas Angelopoulos Stephen Bates OOD 168 615 0 15 Jul 2021
DeBERTa: Decoding-enhanced BERT with Disentangled Attention Pengcheng He Xiaodong Liu Jianfeng Gao Weizhu Chen AAML 129 2,724 0 05 Jun 2020
Maximizing Overall Diversity for Improved Uncertainty Estimates in Deep Ensembles Siddhartha Jain Ge Liu Jonas W. Mueller David K Gifford UQCV 56 60 0 18 Jun 2019
CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge Alon Talmor Jonathan Herzig Nicholas Lourie Jonathan Berant RALM 140 1,716 0 02 Nov 2018
Deep k-Nearest Neighbors: Towards Confident, Interpretable and Robust Deep Learning Nicolas Papernot Patrick McDaniel OOD AAML 129 507 0 13 Mar 2018
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling C. Riquelme George Tucker Jasper Snoek BDL 62 365 0 26 Feb 2018
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension Mandar Joshi Eunsol Choi Daniel S. Weld Luke Zettlemoyer RALM 195 2,636 0 09 May 2017
Bayesian Recurrent Neural Networks Meire Fortunato Charles Blundell Oriol Vinyals BDL 53 184 0 10 Apr 2017