ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.00379
  4. Cited By
Recall Traces: Backtracking Models for Efficient Reinforcement Learning
v1v2 (latest)

Recall Traces: Backtracking Models for Efficient Reinforcement Learning

2 April 2018
Anirudh Goyal
Philemon Brakel
W. Fedus
Soumye Singhal
Timothy Lillicrap
Sergey Levine
Hugo Larochelle
Yoshua Bengio
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Recall Traces: Backtracking Models for Efficient Reinforcement Learning"

29 / 29 papers shown
Title
Offline Imitation Learning with Model-based Reverse Augmentation
Offline Imitation Learning with Model-based Reverse Augmentation
Jie-Jing Shao
Hao-Sen Shi
Lan-Zhe Guo
Yu-Feng Li
OffRL
64
5
0
18 Jun 2024
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets
Haoran He
C. Chang
Huazhe Xu
Ling Pan
209
7
0
03 Jun 2024
Backward Learning for Goal-Conditioned Policies
Backward Learning for Goal-Conditioned Policies
Marc Höftmann
Jan Robine
Stefan Harmeling
103
1
0
08 Dec 2023
Backward Imitation and Forward Reinforcement Learning via Bi-directional
  Model Rollouts
Backward Imitation and Forward Reinforcement Learning via Bi-directional Model Rollouts
Yuxin Pan
Fangzhen Lin
OffRL
64
3
0
04 Aug 2022
A Survey on Model-based Reinforcement Learning
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRLLRM
125
111
0
19 Jun 2022
Double Check Your State Before Trusting It: Confidence-Aware
  Bidirectional Offline Model-Based Imagination
Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination
Jiafei Lyu
Xiu Li
Zongqing Lu
OffRL
97
26
0
16 Jun 2022
Retrieval-Augmented Reinforcement Learning
Retrieval-Augmented Reinforcement Learning
Anirudh Goyal
A. Friesen
Andrea Banino
T. Weber
Nan Rosemary Ke
...
Michal Valko
Simon Osindero
Timothy Lillicrap
N. Heess
Charles Blundell
OffRL
92
56
0
17 Feb 2022
PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for
  Reinforcement Learning
PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning
Tao Yu
Cuiling Lan
Wenjun Zeng
Mingxiao Feng
Zhizheng Zhang
Zhibo Chen
OffRL
101
46
0
08 Jun 2021
Solving Sokoban with forward-backward reinforcement learning
Solving Sokoban with forward-backward reinforcement learning
Yaron Shoham
G. Elidan
OffRL
118
6
0
05 May 2021
Sample-Efficient Reinforcement Learning via Counterfactual-Based Data
  Augmentation
Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation
Chaochao Lu
Erdun Gao
Ke Wang
José Miguel Hernández-Lobato
Kun Zhang
Bernhard Schölkopf
CMLOODOffRL
92
60
0
16 Dec 2020
Forethought and Hindsight in Credit Assignment
Forethought and Hindsight in Credit Assignment
Veronica Chelu
Doina Precup
H. V. Hasselt
92
26
0
26 Oct 2020
REMAX: Relational Representation for Multi-Agent Exploration
REMAX: Relational Representation for Multi-Agent Exploration
Heechang Ryu
Hayong Shin
Jinkyoo Park
68
4
0
12 Aug 2020
Complex Robotic Manipulation via Graph-Based Hindsight Goal Generation
Complex Robotic Manipulation via Graph-Based Hindsight Goal Generation
Zhenshan Bing
Matthias Brucker
F. O. Morin
Kai-Qi Huang
Alois C. Knoll
73
27
0
27 Jul 2020
Bidirectional Model-based Policy Optimization
Bidirectional Model-based Policy Optimization
Hang Lai
Jian Shen
Weinan Zhang
Yong Yu
76
58
0
04 Jul 2020
Empirical Policy Evaluation with Supergraphs
Empirical Policy Evaluation with Supergraphs
Daniel Vial
V. Subramanian
OffRL
34
0
0
18 Feb 2020
Hindsight Credit Assignment
Hindsight Credit Assignment
Anna Harutyunyan
Will Dabney
Thomas Mesnard
M. G. Azar
Bilal Piot
...
H. V. Hasselt
Greg Wayne
Satinder Singh
Doina Precup
Rémi Munos
95
75
0
05 Dec 2019
Learning the Arrow of Time
Learning the Arrow of Time
Nasim Rahaman
Steffen Wolf
Anirudh Goyal
Roman Remme
Yoshua Bengio
61
5
0
02 Jul 2019
Exploration via Hindsight Goal Generation
Exploration via Hindsight Goal Generation
Zhizhou Ren
Kefan Dong
Yuanshuo Zhou
Qiang Liu
Jian-wei Peng
87
90
0
10 Jun 2019
Generative Exploration and Exploitation
Generative Exploration and Exploitation
Jiechuan Jiang
Zongqing Lu
39
6
0
21 Apr 2019
Generative predecessor models for sample-efficient imitation learning
Generative predecessor models for sample-efficient imitation learning
Yannick Schroecker
Mel Vecerík
Jonathan Scholz
VLM
60
31
0
01 Apr 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
156
370
0
30 Jan 2019
InfoBot: Transfer and Exploration via the Information Bottleneck
InfoBot: Transfer and Exploration via the Information Bottleneck
Anirudh Goyal
Riashat Islam
Daniel Strouse
Zafarali Ahmed
M. Botvinick
Hugo Larochelle
Yoshua Bengio
Sergey Levine
OffRL
141
167
0
30 Jan 2019
Generative Adversarial Self-Imitation Learning
Generative Adversarial Self-Imitation Learning
Yijie Guo
Junhyuk Oh
Satinder Singh
Honglak Lee
GAN
102
59
0
03 Dec 2018
Time Reversal as Self-Supervision
Time Reversal as Self-Supervision
Suraj Nair
Mohammad Babaeizadeh
Chelsea Finn
Sergey Levine
Vikash Kumar
SSL
96
12
0
02 Oct 2018
Backplay: "Man muss immer umkehren"
Backplay: "Man muss immer umkehren"
Cinjon Resnick
R. Raileanu
Sanyam Kapoor
Alex Peysakhovich
Kyunghyun Cho
Joan Bruna
OffRL
101
45
0
18 Jul 2018
Remember and Forget for Experience Replay
Remember and Forget for Experience Replay
G. Novati
Petros Koumoutsakos
OffRL
108
92
0
16 Jul 2018
RUDDER: Return Decomposition for Delayed Rewards
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
134
222
0
20 Jun 2018
The Effect of Planning Shape on Dyna-style Planning in High-dimensional
  State Spaces
The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces
G. Z. Holland
Erik Talvitie
Michael Bowling
AI4CE
72
43
0
05 Jun 2018
Imitating Latent Policies from Observation
Imitating Latent Policies from Observation
Ashley D. Edwards
Himanshu Sahni
Yannick Schroecker
Charles Isbell
108
139
0
21 May 2018
1