Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.02623
Cited By
v1
v2
v3 (latest)
Rewarding Doubt: A Reinforcement Learning Approach to Calibrated Confidence Expression of Large Language Models
4 March 2025
Paul Stangel
David Bani-Harouni
Chantal Pellegrini
Ege Özsoy
Kamilia Zaripova
Matthias Keicher
Nassir Navab
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Rewarding Doubt: A Reinforcement Learning Approach to Calibrated Confidence Expression of Large Language Models"
1 / 1 papers shown
Title
Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation
Caiqi Zhang
Xiaochen Zhu
Chengzu Li
Nigel Collier
Andreas Vlachos
OffRL
HILM
53
1
0
29 May 2025
1