ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1605.03143
  4. Cited By
Avoiding Wireheading with Value Reinforcement Learning

Avoiding Wireheading with Value Reinforcement Learning

10 May 2016
Tom Everitt
Marcus Hutter
    AI4CE
ArXivPDFHTML

Papers citing "Avoiding Wireheading with Value Reinforcement Learning"

8 / 8 papers shown
Title
MONA: Myopic Optimization with Non-myopic Approval Can Mitigate Multi-step Reward Hacking
MONA: Myopic Optimization with Non-myopic Approval Can Mitigate Multi-step Reward Hacking
Sebastian Farquhar
Vikrant Varma
David Lindner
David Elson
Caleb Biddulph
Ian Goodfellow
Rohin Shah
141
2
0
22 Jan 2025
Death and Suicide in Universal Artificial Intelligence
Death and Suicide in Universal Artificial Intelligence
Jarryd Martin
Tom Everitt
Marcus Hutter
29
21
0
02 Jun 2016
Self-Modification of Policy and Utility Function in Rational Agents
Self-Modification of Policy and Utility Function in Rational Agents
Tom Everitt
Daniel Filan
Mayank Daswani
Marcus Hutter
48
29
0
10 May 2016
Towards Resolving Unidentifiability in Inverse Reinforcement Learning
Towards Resolving Unidentifiability in Inverse Reinforcement Learning
Kareem Amin
Satinder Singh
41
30
0
25 Jan 2016
Learning the Preferences of Ignorant, Inconsistent Agents
Learning the Preferences of Ignorant, Inconsistent Agents
Owain Evans
Andreas Stuhlmuller
Noah D. Goodman
48
128
0
18 Dec 2015
Sequential Extensions of Causal and Evidential Decision Theory
Sequential Extensions of Causal and Evidential Decision Theory
Tom Everitt
Jan Leike
Marcus Hutter
CML
41
15
0
24 Jun 2015
Model-based Utility Functions
Model-based Utility Functions
B. Hibbard
106
48
0
16 Nov 2011
Universal Intelligence: A Definition of Machine Intelligence
Universal Intelligence: A Definition of Machine Intelligence
Shane Legg
Marcus Hutter
98
641
0
20 Dec 2007
1