Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.21941
Cited By
Test-Time Scaling with Repeated Sampling Improves Multilingual Text Generation
28 May 2025
Ashim Gupta
Vivek Srikumar
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Test-Time Scaling with Repeated Sampling Improves Multilingual Text Generation"
3 / 3 papers shown
Title
M-RewardBench: Evaluating Reward Models in Multilingual Settings
Srishti Gureja
Lester James V. Miranda
Shayekh Bin Islam
Rishabh Maheshwary
Drishti Sharma
Gusti Winata
Nathan Lambert
Sebastian Ruder
Sara Hooker
Marzieh Fadaee
LRM
74
18
0
20 Oct 2024
Uncertainty-aware Reward Model: Teaching Reward Models to Know What is Unknown
Xingzhou Lou
Dong Yan
Wei Shen
Yuzi Yan
Jian Xie
Junge Zhang
128
25
0
01 Oct 2024
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Charlie Snell
Jaehoon Lee
Kelvin Xu
Aviral Kumar
LRM
104
576
0
06 Aug 2024
1