Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws

23 February 2023

Papers citing "Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws"

1 / 1 papers shown

Title
Scaling Laws for Neural Language Models Jared Kaplan Sam McCandlish T. Henighan Tom B. Brown B. Chess R. Child Scott Gray Alec Radford Jeff Wu Dario Amodei 264 4,489 0 23 Jan 2020