Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.00379
Cited By
v1
v2 (latest)
Recall Traces: Backtracking Models for Efficient Reinforcement Learning
2 April 2018
Anirudh Goyal
Philemon Brakel
W. Fedus
Soumye Singhal
Timothy Lillicrap
Sergey Levine
Hugo Larochelle
Yoshua Bengio
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Recall Traces: Backtracking Models for Efficient Reinforcement Learning"
29 / 29 papers shown
Title
Offline Imitation Learning with Model-based Reverse Augmentation
Jie-Jing Shao
Hao-Sen Shi
Lan-Zhe Guo
Yu-Feng Li
OffRL
64
5
0
18 Jun 2024
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets
Haoran He
C. Chang
Huazhe Xu
Ling Pan
209
7
0
03 Jun 2024
Backward Learning for Goal-Conditioned Policies
Marc Höftmann
Jan Robine
Stefan Harmeling
103
1
0
08 Dec 2023
Backward Imitation and Forward Reinforcement Learning via Bi-directional Model Rollouts
Yuxin Pan
Fangzhen Lin
OffRL
64
3
0
04 Aug 2022
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
125
111
0
19 Jun 2022
Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination
Jiafei Lyu
Xiu Li
Zongqing Lu
OffRL
97
26
0
16 Jun 2022
Retrieval-Augmented Reinforcement Learning
Anirudh Goyal
A. Friesen
Andrea Banino
T. Weber
Nan Rosemary Ke
...
Michal Valko
Simon Osindero
Timothy Lillicrap
N. Heess
Charles Blundell
OffRL
92
56
0
17 Feb 2022
PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning
Tao Yu
Cuiling Lan
Wenjun Zeng
Mingxiao Feng
Zhizheng Zhang
Zhibo Chen
OffRL
101
46
0
08 Jun 2021
Solving Sokoban with forward-backward reinforcement learning
Yaron Shoham
G. Elidan
OffRL
118
6
0
05 May 2021
Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation
Chaochao Lu
Erdun Gao
Ke Wang
José Miguel Hernández-Lobato
Kun Zhang
Bernhard Schölkopf
CML
OOD
OffRL
92
60
0
16 Dec 2020
Forethought and Hindsight in Credit Assignment
Veronica Chelu
Doina Precup
H. V. Hasselt
92
26
0
26 Oct 2020
REMAX: Relational Representation for Multi-Agent Exploration
Heechang Ryu
Hayong Shin
Jinkyoo Park
68
4
0
12 Aug 2020
Complex Robotic Manipulation via Graph-Based Hindsight Goal Generation
Zhenshan Bing
Matthias Brucker
F. O. Morin
Kai-Qi Huang
Alois C. Knoll
73
27
0
27 Jul 2020
Bidirectional Model-based Policy Optimization
Hang Lai
Jian Shen
Weinan Zhang
Yong Yu
76
58
0
04 Jul 2020
Empirical Policy Evaluation with Supergraphs
Daniel Vial
V. Subramanian
OffRL
34
0
0
18 Feb 2020
Hindsight Credit Assignment
Anna Harutyunyan
Will Dabney
Thomas Mesnard
M. G. Azar
Bilal Piot
...
H. V. Hasselt
Greg Wayne
Satinder Singh
Doina Precup
Rémi Munos
95
75
0
05 Dec 2019
Learning the Arrow of Time
Nasim Rahaman
Steffen Wolf
Anirudh Goyal
Roman Remme
Yoshua Bengio
61
5
0
02 Jul 2019
Exploration via Hindsight Goal Generation
Zhizhou Ren
Kefan Dong
Yuanshuo Zhou
Qiang Liu
Jian-wei Peng
87
90
0
10 Jun 2019
Generative Exploration and Exploitation
Jiechuan Jiang
Zongqing Lu
39
6
0
21 Apr 2019
Generative predecessor models for sample-efficient imitation learning
Yannick Schroecker
Mel Vecerík
Jonathan Scholz
VLM
60
31
0
01 Apr 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
156
370
0
30 Jan 2019
InfoBot: Transfer and Exploration via the Information Bottleneck
Anirudh Goyal
Riashat Islam
Daniel Strouse
Zafarali Ahmed
M. Botvinick
Hugo Larochelle
Yoshua Bengio
Sergey Levine
OffRL
141
167
0
30 Jan 2019
Generative Adversarial Self-Imitation Learning
Yijie Guo
Junhyuk Oh
Satinder Singh
Honglak Lee
GAN
102
59
0
03 Dec 2018
Time Reversal as Self-Supervision
Suraj Nair
Mohammad Babaeizadeh
Chelsea Finn
Sergey Levine
Vikash Kumar
SSL
96
12
0
02 Oct 2018
Backplay: "Man muss immer umkehren"
Cinjon Resnick
R. Raileanu
Sanyam Kapoor
Alex Peysakhovich
Kyunghyun Cho
Joan Bruna
OffRL
101
45
0
18 Jul 2018
Remember and Forget for Experience Replay
G. Novati
Petros Koumoutsakos
OffRL
108
92
0
16 Jul 2018
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
134
222
0
20 Jun 2018
The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces
G. Z. Holland
Erik Talvitie
Michael Bowling
AI4CE
72
43
0
05 Jun 2018
Imitating Latent Policies from Observation
Ashley D. Edwards
Himanshu Sahni
Yannick Schroecker
Charles Isbell
108
139
0
21 May 2018
1