ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.06703
  4. Cited By
Learning Retrospective Knowledge with Reverse Reinforcement Learning

Learning Retrospective Knowledge with Reverse Reinforcement Learning

9 July 2020
Shangtong Zhang
Vivek Veeriah
Shimon Whiteson
    OffRL
    AI4TS
ArXivPDFHTML

Papers citing "Learning Retrospective Knowledge with Reverse Reinforcement Learning"

10 / 10 papers shown
Title
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
Yun Qu
Yuhang Jiang
Boyuan Wang
Yixiu Mao
Cheems Wang
Chang-Shu Liu
Xiangyang Ji
132
5
0
10 Jan 2025
Forethought and Hindsight in Credit Assignment
Forethought and Hindsight in Credit Assignment
Veronica Chelu
Doina Precup
H. V. Hasselt
47
25
0
26 Oct 2020
GradientDICE: Rethinking Generalized Offline Estimation of Stationary
  Values
GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values
Shangtong Zhang
Bo Liu
Shimon Whiteson
OffRL
26
103
0
29 Jan 2020
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary
  Distribution Corrections
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Ofir Nachum
Yinlam Chow
Bo Dai
Lihong Li
OffRL
87
332
0
10 Jun 2019
Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate
  Shift
Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift
Carles Gelada
Marc G. Bellemare
OffRL
44
97
0
27 Jan 2019
Deep Learning for Anomaly Detection: A Survey
Deep Learning for Anomaly Detection: A Survey
Raghavendra Chalapathy
Sanjay Chawla
AI4TS
110
1,486
0
10 Jan 2019
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
149
1,584
0
05 Feb 2018
OpenAI Gym
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
174
5,056
0
05 Jun 2016
On Convergence of Emphatic Temporal-Difference Learning
On Convergence of Emphatic Temporal-Difference Learning
Huizhen Yu
OffRL
38
73
0
08 Jun 2015
An Emphatic Approach to the Problem of Off-policy Temporal-Difference
  Learning
An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning
R. Sutton
A. R. Mahmood
Martha White
59
269
0
14 Mar 2015
1