Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2102.01685
Cited By
Agent Incentives: A Causal Perspective
2 February 2021
Tom Everitt
Ryan Carey
Eric D. Langlois
Pedro A. Ortega
Shane Legg
CML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Agent Incentives: A Causal Perspective"
22 / 22 papers shown
Title
MONA: Myopic Optimization with Non-myopic Approval Can Mitigate Multi-step Reward Hacking
Sebastian Farquhar
Vikrant Varma
David Lindner
David Elson
Caleb Biddulph
Ian Goodfellow
Rohin Shah
138
2
0
22 Jan 2025
Causal Imitation Learning with Unobserved Confounders
Junzhe Zhang
D. Kumor
Elias Bareinboim
CML
59
75
0
12 Aug 2022
How RL Agents Behave When Their Actions Are Modified
Eric D. Langlois
Tom Everitt
51
13
0
15 Feb 2021
Equilibrium Refinements for Multi-Agent Influence Diagrams: Theory and Practice
Lewis Hammond
James Fox
Tom Everitt
Alessandro Abate
Michael Wooldridge
37
10
0
09 Feb 2021
AGI Agent Safety by Iteratively Improving the Utility Function
K. Holtman
AI4CE
39
8
0
10 Jul 2020
The Incentives that Shape Behaviour
Ryan Carey
Eric D. Langlois
Tom Everitt
Shane Legg
CML
59
13
0
20 Jan 2020
Reward Tampering Problems and Solutions in Reinforcement Learning: A Causal Influence Diagram Perspective
Tom Everitt
Marcus Hutter
Ramana Kumar
Victoria Krakovna
59
95
0
13 Aug 2019
Modeling AGI Safety Frameworks with Causal Influence Diagrams
Tom Everitt
Ramana Kumar
Victoria Krakovna
Shane Legg
AI4CE
36
22
0
20 Jun 2019
Asymptotically Unambitious Artificial General Intelligence
Michael K. Cohen
Badri N. Vellambi
Marcus Hutter
ELM
AI4CE
39
18
0
29 May 2019
Learning Optimal Fair Policies
Razieh Nabi
Daniel Malinsky
I. Shpitser
FaML
39
87
0
06 Sep 2018
Avoiding Discrimination through Causal Reasoning
Niki Kilbertus
Mateo Rojas-Carulla
Giambattista Parascandolo
Moritz Hardt
Dominik Janzing
Bernhard Schölkopf
FaML
CML
103
581
0
08 Jun 2017
Fair Inference On Outcomes
Razieh Nabi
I. Shpitser
FaML
49
351
0
29 May 2017
Counterfactual Fairness
Matt J. Kusner
Joshua R. Loftus
Chris Russell
Ricardo M. A. Silva
FaML
195
1,576
0
20 Mar 2017
The Off-Switch Game
Dylan Hadfield-Menell
Anca Dragan
Pieter Abbeel
Stuart J. Russell
55
138
0
24 Nov 2016
A causal framework for discovering and removing direct and indirect discrimination
Lu Zhang
Yongkai Wu
Xintao Wu
CML
49
173
0
22 Nov 2016
The AGI Containment Problem
James Babcock
János Kramár
Roman V. Yampolskiy
53
275
0
02 Apr 2016
Causal Networks: Semantics and Expressiveness
Thomas Verma
Judea Pearl
GNN
75
551
0
27 Mar 2013
A Decision-Based View of Causality
David Heckerman
Ross D. Shachter
CML
60
44
0
27 Feb 2013
Bayes-Ball: The Rational Pastime (for Determining Irrelevance and Requisite Information in Belief Networks and Influence Diagrams)
Ross D. Shachter
65
242
0
30 Jan 2013
Welldefined Decision Scenarios
Thomas D. Nielsen
F. V. Jensen
52
67
0
23 Jan 2013
Causal Discovery from Changes
Jin Tian
Judea Pearl
CML
110
164
0
10 Jan 2013
Direct and Indirect Effects
Judea Pearl
CML
89
2,169
0
10 Jan 2013
1