ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.01452
  4. Cited By
Causal Reinforcement Learning: A Survey
v1v2 (latest)

Causal Reinforcement Learning: A Survey

4 July 2023
Zhi-Hong Deng
Jing Jiang
Guodong Long
Chen Zhang
    CMLLRM
ArXiv (abs)PDFHTML

Papers citing "Causal Reinforcement Learning: A Survey"

19 / 69 papers shown
Title
Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
Lars Buesing
T. Weber
Yori Zwols
S. Racanière
A. Guez
Jean-Baptiste Lespiau
N. Heess
CML
121
138
0
15 Nov 2018
A Dissection of Overfitting and Generalization in Continuous
  Reinforcement Learning
A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning
Amy Zhang
Nicolas Ballas
Joelle Pineau
CLLOffRL
100
180
0
20 Jun 2018
Meta-Gradient Reinforcement Learning
Meta-Gradient Reinforcement Learning
Zhongwen Xu
H. V. Hasselt
David Silver
115
327
0
24 May 2018
A Study on Overfitting in Deep Reinforcement Learning
A Study on Overfitting in Deep Reinforcement Learning
Chiyuan Zhang
Oriol Vinyals
Rémi Munos
Samy Bengio
OffRLOnRL
59
391
0
18 Apr 2018
Meta-Reinforcement Learning of Structured Exploration Strategies
Meta-Reinforcement Learning of Structured Exploration Strategies
Abhishek Gupta
Russell Mendonca
YuXuan Liu
Pieter Abbeel
Sergey Levine
OffRL
115
349
0
20 Feb 2018
Visualizing and Understanding Atari Agents
Visualizing and Understanding Atari Agents
S. Greydanus
Anurag Koul
Jonathan Dodge
Alan Fern
FAtt
128
348
0
31 Oct 2017
Sim-to-Real Transfer of Robotic Control with Dynamics Randomization
Sim-to-Real Transfer of Robotic Control with Dynamics Randomization
Xue Bin Peng
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
117
1,368
0
18 Oct 2017
Causally Regularized Learning with Agnostic Data Selection Bias
Causally Regularized Learning with Agnostic Data Selection Bias
Zheyan Shen
Peng Cui
Kun Kuang
Yangqiu Song
Peixuan Chen
OOD
70
93
0
22 Aug 2017
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
I. Higgins
Arka Pal
Andrei A. Rusu
Loic Matthey
Christopher P. Burgess
Alexander Pritzel
M. Botvinick
Charles Blundell
Alexander Lerchner
DRL
126
417
0
26 Jul 2017
Deep reinforcement learning from human preferences
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
218
3,377
0
12 Jun 2017
Constrained Policy Optimization
Constrained Policy Optimization
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
134
1,335
0
30 May 2017
Counterfactual Multi-Agent Policy Gradients
Counterfactual Multi-Agent Policy Gradients
Jakob N. Foerster
Gregory Farquhar
Triantafyllos Afouras
Nantas Nardelli
Shimon Whiteson
156
2,090
0
24 May 2017
Curiosity-driven Exploration by Self-supervised Prediction
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRMSSL
130
2,453
0
15 May 2017
Domain Randomization for Transferring Deep Neural Networks from
  Simulation to the Real World
Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World
Joshua Tobin
Rachel Fong
Alex Ray
Jonas Schneider
Wojciech Zaremba
Pieter Abbeel
269
2,976
0
20 Mar 2017
Counterfactual Fairness
Counterfactual Fairness
Matt J. Kusner
Joshua R. Loftus
Chris Russell
Ricardo M. A. Silva
FaML
230
1,587
0
20 Mar 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
833
11,961
0
09 Mar 2017
RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning
RL2^22: Fast Reinforcement Learning via Slow Reinforcement Learning
Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
OffRL
107
1,028
0
09 Nov 2016
Pearl's Calculus of Intervention Is Complete
Pearl's Calculus of Intervention Is Complete
Yimin Huang
Marco Valtorta
CML
100
232
0
27 Jun 2012
Kernel-based Conditional Independence Test and Application in Causal
  Discovery
Kernel-based Conditional Independence Test and Application in Causal Discovery
Kun Zhang
J. Peters
Dominik Janzing
Bernhard Schölkopf
BDLCML
109
632
0
14 Feb 2012
Previous
12