Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.09144
Cited By
Goodhart's Law in Reinforcement Learning
13 October 2023
Jacek Karwowski
Oliver Hayman
Xingjian Bai
Klaus Kiendlhofer
Charlie Griffin
Joar Skalse
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Goodhart's Law in Reinforcement Learning"
5 / 5 papers shown
Title
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Zishun Yu
Tengyu Xu
Di Jin
Karthik Abinav Sankararaman
Yun He
...
Eryk Helenowski
Chen Zhu
Sinong Wang
Hao Ma
Han Fang
LRM
177
9
0
29 Jan 2025
Misspecification in Inverse Reinforcement Learning
Joar Skalse
Alessandro Abate
51
23
0
06 Dec 2022
The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models
Alexander Pan
Kush S. Bhatia
Jacob Steinhardt
81
179
0
10 Jan 2022
Consequences of Misaligned AI
Simon Zhuang
Dylan Hadfield-Menell
62
75
0
07 Feb 2021
A Deep Reinforced Model for Abstractive Summarization
Romain Paulus
Caiming Xiong
R. Socher
AI4TS
197
1,557
0
11 May 2017
1