Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.01452
Cited By
v1
v2 (latest)
Causal Reinforcement Learning: A Survey
4 July 2023
Zhi-Hong Deng
Jing Jiang
Guodong Long
Chen Zhang
CML
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Causal Reinforcement Learning: A Survey"
19 / 69 papers shown
Title
Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
Lars Buesing
T. Weber
Yori Zwols
S. Racanière
A. Guez
Jean-Baptiste Lespiau
N. Heess
CML
121
138
0
15 Nov 2018
A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning
Amy Zhang
Nicolas Ballas
Joelle Pineau
CLL
OffRL
100
180
0
20 Jun 2018
Meta-Gradient Reinforcement Learning
Zhongwen Xu
H. V. Hasselt
David Silver
115
327
0
24 May 2018
A Study on Overfitting in Deep Reinforcement Learning
Chiyuan Zhang
Oriol Vinyals
Rémi Munos
Samy Bengio
OffRL
OnRL
59
391
0
18 Apr 2018
Meta-Reinforcement Learning of Structured Exploration Strategies
Abhishek Gupta
Russell Mendonca
YuXuan Liu
Pieter Abbeel
Sergey Levine
OffRL
115
349
0
20 Feb 2018
Visualizing and Understanding Atari Agents
S. Greydanus
Anurag Koul
Jonathan Dodge
Alan Fern
FAtt
128
348
0
31 Oct 2017
Sim-to-Real Transfer of Robotic Control with Dynamics Randomization
Xue Bin Peng
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
117
1,368
0
18 Oct 2017
Causally Regularized Learning with Agnostic Data Selection Bias
Zheyan Shen
Peng Cui
Kun Kuang
Yangqiu Song
Peixuan Chen
OOD
70
93
0
22 Aug 2017
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
I. Higgins
Arka Pal
Andrei A. Rusu
Loic Matthey
Christopher P. Burgess
Alexander Pritzel
M. Botvinick
Charles Blundell
Alexander Lerchner
DRL
126
417
0
26 Jul 2017
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
218
3,377
0
12 Jun 2017
Constrained Policy Optimization
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
134
1,335
0
30 May 2017
Counterfactual Multi-Agent Policy Gradients
Jakob N. Foerster
Gregory Farquhar
Triantafyllos Afouras
Nantas Nardelli
Shimon Whiteson
156
2,090
0
24 May 2017
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
130
2,453
0
15 May 2017
Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World
Joshua Tobin
Rachel Fong
Alex Ray
Jonas Schneider
Wojciech Zaremba
Pieter Abbeel
269
2,976
0
20 Mar 2017
Counterfactual Fairness
Matt J. Kusner
Joshua R. Loftus
Chris Russell
Ricardo M. A. Silva
FaML
230
1,587
0
20 Mar 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
833
11,961
0
09 Mar 2017
RL
2
^2
2
: Fast Reinforcement Learning via Slow Reinforcement Learning
Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
OffRL
107
1,028
0
09 Nov 2016
Pearl's Calculus of Intervention Is Complete
Yimin Huang
Marco Valtorta
CML
100
232
0
27 Jun 2012
Kernel-based Conditional Independence Test and Application in Causal Discovery
Kun Zhang
J. Peters
Dominik Janzing
Bernhard Schölkopf
BDL
CML
109
632
0
14 Feb 2012
Previous
1
2