Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.09724
Cited By
Taming Overconfidence in LLMs: Reward Calibration in RLHF
13 October 2024
Jixuan Leng
Chengsong Huang
Banghua Zhu
Jiaxin Huang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Taming Overconfidence in LLMs: Reward Calibration in RLHF"
6 / 6 papers shown
Title
AGI Is Coming... Right After AI Learns to Play Wordle
Sarath Shekkizhar
Romain Cosentino
LLMAG
45
0
0
21 Apr 2025
QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks?
Belinda Z. Li
Been Kim
Z. Wang
LRM
38
2
0
28 Mar 2025
A Guide to Failure in Machine Learning: Reliability and Robustness from Foundations to Practice
Eric Heim
Oren Wright
David Shriver
OOD
FaML
63
0
0
01 Mar 2025
Mind the Confidence Gap: Overconfidence, Calibration, and Distractor Effects in Large Language Models
Prateek Chhikara
39
1
0
16 Feb 2025
Uncertainty-Aware Adaptation of Large Language Models for Protein-Protein Interaction Analysis
Sanket R. Jantre
Tianle Wang
Gilchan Park
Kriti Chopra
Nicholas Jeon
Xiaoning Qian
Nathan M. Urban
Byung-Jun Yoon
57
0
0
10 Feb 2025
Can LLMs plan paths in the real world?
Wanyi Chen
Meng-Wen Su
Nafisa Mehjabin
Mary L. Cummings
LLMAG
LRM
101
2
0
26 Nov 2024
1