ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.09724
  4. Cited By
Taming Overconfidence in LLMs: Reward Calibration in RLHF

Taming Overconfidence in LLMs: Reward Calibration in RLHF

13 October 2024
Jixuan Leng
Chengsong Huang
Banghua Zhu
Jiaxin Huang
ArXivPDFHTML

Papers citing "Taming Overconfidence in LLMs: Reward Calibration in RLHF"

6 / 6 papers shown
Title
AGI Is Coming... Right After AI Learns to Play Wordle
AGI Is Coming... Right After AI Learns to Play Wordle
Sarath Shekkizhar
Romain Cosentino
LLMAG
45
0
0
21 Apr 2025
QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks?
QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks?
Belinda Z. Li
Been Kim
Z. Wang
LRM
38
2
0
28 Mar 2025
A Guide to Failure in Machine Learning: Reliability and Robustness from Foundations to Practice
Eric Heim
Oren Wright
David Shriver
OOD
FaML
63
0
0
01 Mar 2025
Mind the Confidence Gap: Overconfidence, Calibration, and Distractor Effects in Large Language Models
Mind the Confidence Gap: Overconfidence, Calibration, and Distractor Effects in Large Language Models
Prateek Chhikara
39
1
0
16 Feb 2025
Uncertainty-Aware Adaptation of Large Language Models for Protein-Protein Interaction Analysis
Uncertainty-Aware Adaptation of Large Language Models for Protein-Protein Interaction Analysis
Sanket R. Jantre
Tianle Wang
Gilchan Park
Kriti Chopra
Nicholas Jeon
Xiaoning Qian
Nathan M. Urban
Byung-Jun Yoon
57
0
0
10 Feb 2025
Can LLMs plan paths in the real world?
Can LLMs plan paths in the real world?
Wanyi Chen
Meng-Wen Su
Nafisa Mehjabin
Mary L. Cummings
LLMAG
LRM
101
2
0
26 Nov 2024
1