Taming Overconfidence in LLMs: Reward Calibration in RLHF

13 October 2024

Papers citing "Taming Overconfidence in LLMs: Reward Calibration in RLHF"

6 / 6 papers shown

Title
AGI Is Coming... Right After AI Learns to Play Wordle Sarath Shekkizhar Romain Cosentino LLMAG 45 0 0 21 Apr 2025
QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks? Belinda Z. Li Been Kim Z. Wang LRM 38 2 0 28 Mar 2025
A Guide to Failure in Machine Learning: Reliability and Robustness from Foundations to Practice Eric Heim Oren Wright David Shriver OOD FaML 63 0 0 01 Mar 2025
Mind the Confidence Gap: Overconfidence, Calibration, and Distractor Effects in Large Language Models Prateek Chhikara 39 1 0 16 Feb 2025
Uncertainty-Aware Adaptation of Large Language Models for Protein-Protein Interaction Analysis Sanket R. Jantre Tianle Wang Gilchan Park Kriti Chopra Nicholas Jeon Xiaoning Qian Nathan M. Urban Byung-Jun Yoon 57 0 0 10 Feb 2025
Can LLMs plan paths in the real world? Wanyi Chen Meng-Wen Su Nafisa Mehjabin Mary L. Cummings LLMAG LRM 101 2 0 26 Nov 2024