Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.01495
Cited By
v1
v2
v3 (latest)
Hindsight Experience Replay
5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Hindsight Experience Replay"
50 / 1,267 papers shown
Title
Imitating Past Successes can be Very Suboptimal
Benjamin Eysenbach
Soumith Udatha
Sergey Levine
Ruslan Salakhutdinov
OffRL
92
19
0
07 Jun 2022
Introspective Experience Replay: Look Back When Surprised
Ramnath Kumar
Dheeraj M. Nagaraj
OffRL
65
2
0
07 Jun 2022
How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via
f
f
f
-Advantage Regression
Yecheng Jason Ma
Jason Yan
Dinesh Jayaraman
Osbert Bastani
OffRL
87
58
0
07 Jun 2022
Achieving Goals using Reward Shaping and Curriculum Learning
M. Anca
Jonathan D. Thomas
Dabal Pedamonti
M. Studley
Mark Hansen
41
1
0
06 Jun 2022
Language and Culture Internalisation for Human-Like Autotelic AI
Cédric Colas
Tristan Karch
Clément Moulin-Frier
Pierre-Yves Oudeyer
LM&Ro
98
27
0
02 Jun 2022
When does return-conditioned supervised learning work for offline reinforcement learning?
David Brandfonbrener
A. Bietti
Jacob Buckman
Romain Laroche
Joan Bruna
OffRL
77
65
0
02 Jun 2022
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search
Michał Zawalski
Michał Tyrolski
K. Czechowski
Tomasz Odrzygó'zd'z
Damian Stachura
Piotr Pikekos
Yuhuai Wu
Lukasz Kuciñski
Piotr Milo's
LRM
140
11
0
01 Jun 2022
Human-AI Shared Control via Policy Dissection
Quanyi Li
Zhenghao Peng
Haibin Wu
Lan Feng
Bolei Zhou
80
13
0
31 May 2022
DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal Systems
Pierre Schumacher
Daniel Haeufle
Le Chen
Syn Schmitt
Georg Martius
70
34
0
30 May 2022
Autoformalization with Large Language Models
Yuhuai Wu
Albert Q. Jiang
Wenda Li
M. Rabe
Charles Staats
M. Jamnik
Christian Szegedy
AI4CE
293
178
0
25 May 2022
Scalable Multi-Agent Model-Based Reinforcement Learning
Vladimir Egorov
A. Shpilman
90
27
0
25 May 2022
Hierarchical Planning Through Goal-Conditioned Offline Reinforcement Learning
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
137
60
0
24 May 2022
Task Relabelling for Multi-task Transfer using Successor Features
Martin Balla
Diego Perez-Liebana
36
1
0
20 May 2022
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
331
705
0
20 May 2022
A Fully Controllable Agent in the Path Planning using Goal-Conditioned Reinforcement Learning
G. Lee
58
0
0
20 May 2022
Transformer with Memory Replay
R. Liu
Barzan Mozafari
OffRL
105
4
0
19 May 2022
Dexterous Robotic Manipulation using Deep Reinforcement Learning and Knowledge Transfer for Complex Sparse Reward-based Tasks
Qiang Wang
Francisco Roldan Sanchez
Robert McCarthy
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
M. Wuthrich
Felix Widmaier
Stefan Bauer
S. Redmond
107
15
0
19 May 2022
Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in Latent Space
Kuan Fang
Patrick Yin
Ashvin Nair
Sergey Levine
OffRL
109
35
0
17 May 2022
Provably Safe Deep Reinforcement Learning for Robotic Manipulation in Human Environments
Jakob Thumm
Matthias Althoff
129
36
0
12 May 2022
A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning
Archit Sharma
Rehaan Ahmad
Chelsea Finn
OOD
OffRL
69
21
0
11 May 2022
Simultaneous Double Q-learning with Conservative Advantage Learning for Actor-Critic Methods
Qing Li
Wen-gang Zhou
Zhenbo Lu
Houqiang Li
OffRL
32
2
0
08 May 2022
Diverse Imitation Learning via Self-Organizing Generative Models
Arash Vahabpour
Tianyi Wang
Qiujing Lu
Omead Brandon Pooladzandi
V. Roychowdhury
SSL
80
1
0
06 May 2022
State Representation Learning for Goal-Conditioned Reinforcement Learning
Lorenzo Steccanella
Anders Jonsson
SSL
OffRL
61
5
0
04 May 2022
Unsupervised Reinforcement Learning for Transferable Manipulation Skill Discovery
Daesol Cho
Jigang Kim
H. J. Kim
OffRL
SSL
104
17
0
29 Apr 2022
Bilinear value networks
Zhang-Wei Hong
Ge Yang
Pulkit Agrawal
OffRL
75
8
0
28 Apr 2022
Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning
Philippe Hansen-Estruch
Amy Zhang
Ashvin Nair
Patrick Yin
Sergey Levine
AI4CE
107
28
0
27 Apr 2022
Relational Abstractions for Generalized Reinforcement Learning on Symbolic Problems
Rushang Karia
Siddharth Srivastava
NAI
OffRL
42
12
0
27 Apr 2022
Executive Function: A Contrastive Value Policy for Resampling and Relabeling Perceptions via Hindsight Summarization?
Christopher T. Lengerich
Ben Lengerich
55
1
0
27 Apr 2022
Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?
Yuchen Cui
S. Niekum
Abhi Gupta
Vikash Kumar
Aravind Rajeswaran
LM&Ro
88
80
0
23 Apr 2022
Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning
Gheorghe Comanici
Amelia Glaese
Anita Gergely
Daniel Toyama
Zafarali Ahmed
Tyler Jackson
P. Hamel
Doina Precup
22
2
0
21 Apr 2022
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
Charles Burton Snell
Mengjiao Yang
Justin Fu
Yi Su
Sergey Levine
61
22
0
18 Apr 2022
Divide & Conquer Imitation Learning
Alexandre Chenu
Nicolas Perrin-Gilbert
Olivier Sigaud
115
5
0
15 Apr 2022
Efficient and practical quantum compiler towards multi-qubit systems with deep reinforcement learning
Qiuhao Chen
Yuxuan Du
Qi Zhao
Yuliang Jiao
Xiliang Lu
Xingyao Wu
59
13
0
14 Apr 2022
GloCAL: Glocalized Curriculum-Aided Learning of Multiple Tasks with Application to Robotic Grasping
Anil Kurkcu
C. Acar
D. Campolo
K. P. Tee
61
1
0
14 Apr 2022
What Matters in Language Conditioned Robotic Imitation Learning over Unstructured Data
Oier Mees
Lukás Hermann
Wolfram Burgard
LM&Ro
109
156
0
13 Apr 2022
Automatically Learning Fallback Strategies with Model-Free Reinforcement Learning in Safety-Critical Driving Scenarios
Ugo Lecerf
Christelle Yemdji Tchassi
S. Aubert
Pietro Michiardi
66
1
0
11 Apr 2022
Learning Object-Centered Autotelic Behaviors with Graph Neural Networks
Ahmed Akakzia
Olivier Sigaud
72
0
0
11 Apr 2022
gTLO: A Generalized and Non-linear Multi-Objective Deep Reinforcement Learning Approach
Johannes Dornheim
OffRL
AI4CE
51
3
0
11 Apr 2022
Grounding Hindsight Instructions in Multi-Goal Reinforcement Learning for Robotics
Frank Röder
Manfred Eppe
S. Wermter
80
7
0
08 Apr 2022
Automatic Parameter Optimization Using Genetic Algorithm in Deep Reinforcement Learning for Robotic Manipulation Tasks
Adarsh Sehgal
Nicholas Ward
Hung M. La
S. Louis
45
1
0
07 Apr 2022
Model Based Meta Learning of Critics for Policy Gradients
Sarah Bechtle
Ludovic Righetti
Franziska Meier
OffRL
45
0
0
05 Apr 2022
Hierarchical Reinforcement Learning under Mixed Observability
Hai V. Nguyen
Zhihan Yang
Andrea Baisero
Xiao Ma
Robert Platt
Chris Amato
56
4
0
02 Apr 2022
DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with Tools
Xingyu Lin
Zhiao Huang
Yunzhu Li
J. Tenenbaum
David Held
Chuang Gan
88
73
0
31 Mar 2022
When to Go, and When to Explore: The Benefit of Post-Exploration in Intrinsic Motivation
Zhao Yang
Thomas M. Moerland
Mike Preuss
Aske Plaat
47
1
0
29 Mar 2022
A Visual Navigation Perspective for Category-Level Object Pose Estimation
Jiaxin Guo
Fangxun Zhong
R. Xiong
Yunhui Liu
Yue Wang
Yiyi Liao
OCL
85
7
0
25 Mar 2022
The Challenges of Continuous Self-Supervised Learning
Senthil Purushwalkam
Pedro Morgado
Abhinav Gupta
CLL
85
44
0
23 Mar 2022
Possibility Before Utility: Learning And Using Hierarchical Affordances
Robby Costales
Shariq Iqbal
Fei Sha
73
5
0
23 Mar 2022
One After Another: Learning Incremental Skills for a Changing World
Nur Muhammad (Mahi) Shafiullah
Lerrel Pinto
CLL
76
13
0
21 Mar 2022
Reinforcement learning for automatic quadrilateral mesh generation: a soft actor-critic approach
J. Pan
Jingwei Huang
G. Cheng
Yong Zeng
AI4CE
73
41
0
19 Mar 2022
Goal-conditioned dual-action imitation learning for dexterous dual-arm robot manipulation
Heecheol Kim
Yoshiyuki Ohmura
Yasuo Kuniyoshi
87
28
0
18 Mar 2022
Previous
1
2
3
...
11
12
13
...
24
25
26
Next