Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.01495
Cited By
Hindsight Experience Replay
5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hindsight Experience Replay"
50 / 1,244 papers shown
Title
Planning Goals for Exploration
E. Hu
Richard Chang
Oleh Rybkin
Dinesh Jayaraman
43
24
0
23 Mar 2023
A Survey of Historical Learning: Learning Models with Learning History
Xiang Li
Ge Wu
Lingfeng Yang
Wenzhe Wang
Renjie Song
Jian Yang
MU
AI4TS
31
2
0
23 Mar 2023
Imitating Graph-Based Planning with Goal-Conditioned Policies
Junsup Kim
Younggyo Seo
Sungsoo Ahn
Kyunghwan Son
Jinwoo Shin
34
10
0
20 Mar 2023
Conversational Tree Search: A New Hybrid Dialog Task
Dirk Vath
Lindsey Vanderlyn
Ngoc Thang Vu
51
7
0
17 Mar 2023
Efficient Learning of High Level Plans from Play
Núria Armengol Urpí
Marco Bagatella
Otmar Hilliges
Georg Martius
Stelian Coros
OffRL
27
3
0
16 Mar 2023
Goal-conditioned Offline Reinforcement Learning through State Space Partitioning
Mianchu Wang
Yue Jin
Giovanni Montana
OffRL
21
3
0
16 Mar 2023
GOATS: Goal Sampling Adaptation for Scooping with Curriculum Reinforcement Learning
Yaru Niu
Shiyu Jin
Zeqing Zhang
Jiacheng Zhu
Ding Zhao
Liangjun Zhang
35
7
0
09 Mar 2023
Exploiting Contextual Structure to Generate Useful Auxiliary Tasks
Benedict Quartey
Ankit Shah
George Konidaris
33
3
0
09 Mar 2023
Grasping Student: semi-supervised learning for robotic manipulation
P. Krzywicki
Krzysztof Ciebiera
Rafal Michaluk
Inga Maziarz
Marek Cygan
SSL
27
0
0
08 Mar 2023
One-4-All: Neural Potential Fields for Embodied Navigation
Sacha Morin
Miguel A. Saavedra-Ruiz
Liam Paull
35
5
0
07 Mar 2023
Hindsight States: Blending Sim and Real Task Elements for Efficient Reinforcement Learning
Simon Guist
Jan Schneider-Barnes
Alexander Dittrich
V. Berenz
Bernhard Schölkopf
Le Chen
29
3
0
03 Mar 2023
Hierarchical Reinforcement Learning in Complex 3D Environments
Bernardo Avila-Pires
Feryal M. P. Behbahani
Hubert Soyer
Kyriacos Nikiforou
Thomas Keck
Satinder Singh
OffRL
23
0
0
28 Feb 2023
Auxiliary Task-based Deep Reinforcement Learning for Quantum Control
Shumin Zhou
Hailan Ma
S. Kuang
Daoyi Dong
29
5
0
28 Feb 2023
A Supervisory Learning Control Framework for Autonomous & Real-time Task Planning for an Underactuated Cooperative Robotic task
Sander De Witte
Tom Lefebvre
Thijs Van Hauwermeiren
Guillaume Crevecoeur
21
0
0
22 Feb 2023
Curiosity-driven Exploration in Sparse-reward Multi-agent Reinforcement Learning
Jiong Li
Pratik Gajane
37
4
0
21 Feb 2023
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems
Yihao Feng
Shentao Yang
Shujian Zhang
Jianguo Zhang
Caiming Xiong
Mi Zhou
Haiquan Wang
OffRL
31
24
0
20 Feb 2023
Understanding the effect of varying amounts of replay per step
A. Paul
Videh Raj Nema
8
0
0
20 Feb 2023
Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot
Tao Huang
Kai-xiang Chen
Bin Li
Yunhui Liu
Qingxu Dou
35
23
0
20 Feb 2023
Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation
Jing-Cheng Pang
Xinyi Yang
Sibei Yang
Yang Yu
29
8
0
18 Feb 2023
Swapped goal-conditioned offline reinforcement learning
Wenyan Yang
Huiling Wang
Dingding Cai
Joni Pajarinen
Joni-Kristen Kämäräinen
OffRL
OnRL
36
1
0
17 Feb 2023
Prioritized offline Goal-swapping Experience Replay
Wenyan Yang
Joni Pajarinen
Dinging Cai
Joni Kämäräinen
OffRL
OnRL
32
0
0
15 Feb 2023
ALAN: Autonomously Exploring Robotic Agents in the Real World
Russell Mendonca
Shikhar Bahl
Deepak Pathak
LM&Ro
36
20
0
13 Feb 2023
A Song of Ice and Fire: Analyzing Textual Autotelic Agents in ScienceWorld
Laetitia Teodorescu
Eric Yuan
Marc-Alexandre Côté
Pierre-Yves Oudeyer
32
1
0
10 Feb 2023
The Wisdom of Hindsight Makes Language Models Better Instruction Followers
Tianjun Zhang
Fangchen Liu
Justin Wong
Pieter Abbeel
Joseph E. Gonzalez
31
44
0
10 Feb 2023
Scaling Goal-based Exploration via Pruning Proto-goals
Akhil Bagaria
Ray Jiang
Ramana Kumar
Tom Schaul
LRM
11
2
0
09 Feb 2023
Sample-efficient Multi-objective Molecular Optimization with GFlowNets
Yiheng Zhu
Jialun Wu
Chaowen Hu
Jiahuan Yan
Chang-Yu Hsieh
Tingjun Hou
Jian Wu
27
33
0
08 Feb 2023
Object-Centric Scene Representations using Active Inference
Toon Van de Maele
Tim Verbelen
Pietro Mazzaglia
Stefano Ferraro
Bart Dhoedt
OCL
BDL
43
5
0
07 Feb 2023
SurgT challenge: Benchmark of Soft-Tissue Trackers for Robotic Surgery
João Cartucho
Alistair Weld
Samyakh Tukra
Haozheng Xu
Hiroki Matsuzaki
...
B. Silva
Estevão Lima
João L. Vilaça
Sandro Queiros
Stamatia Giannarou
22
11
0
06 Feb 2023
Chain of Hindsight Aligns Language Models with Feedback
Hao Liu
Carmelo Sferrazza
Pieter Abbeel
ALM
31
117
0
06 Feb 2023
Skill Decision Transformer
Shyam Sudhakaran
S. Risi
OffRL
26
5
0
31 Jan 2023
On the Statistical Benefits of Temporal Difference Learning
David Cheikhi
Daniel Russo
11
4
0
30 Jan 2023
Distilling Internet-Scale Vision-Language Models into Embodied Agents
T. Sumers
Kenneth Marino
Arun Ahuja
Rob Fergus
Ishita Dasgupta
LM&Ro
46
24
0
29 Jan 2023
A Memory Efficient Deep Reinforcement Learning Approach For Snake Game Autonomous Agents
Md. Rafat Rahman Tushar
Shahnewaz Siddique
17
5
0
27 Jan 2023
Outcome-directed Reinforcement Learning by Uncertainty & Temporal Distance-Aware Curriculum Goal Generation
Daesol Cho
Seungjae Lee
H. J. Kim
31
14
0
27 Jan 2023
Alien Coding
Thibault Gauthier
Miroslav Olsák
J. Urban
32
7
0
27 Jan 2023
AutoCost: Evolving Intrinsic Cost for Zero-violation Reinforcement Learning
Tairan He
Weiye Zhao
Changliu Liu
OffRL
34
17
0
24 Jan 2023
PushWorld: A benchmark for manipulation planning with tools and movable obstacles
Ken Kansky
Skanda Vaidyanath
Scott Swingle
Xinghua Lou
Miguel Lazaro-Gredilla
Dileep George
26
4
0
24 Jan 2023
SMART: Self-supervised Multi-task pretrAining with contRol Transformers
Yanchao Sun
Shuang Ma
Ratnesh Madaan
Rogerio Bonatti
Furong Huang
Ashish Kapoor
35
40
0
24 Jan 2023
Deep Reinforcement Learning for Concentric Tube Robot Path Following
Keshav Iyengar
Sarah Spurgeon
Danail Stoyanov
MedIm
18
4
0
22 Jan 2023
A Survey on Transformers in Reinforcement Learning
Wenzhe Li
Hao Luo
Zichuan Lin
Chongjie Zhang
Zongqing Lu
Deheng Ye
OffRL
MU
AI4CE
37
55
0
08 Jan 2023
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
A. Lazaric
Alahari Karteek
OffRL
44
18
0
05 Jan 2023
Genetic Imitation Learning by Reward Extrapolation
Boyuan Zheng
Jianlong Zhou
Fang Chen
19
0
0
03 Jan 2023
Human-in-the-loop Embodied Intelligence with Interactive Simulation Environment for Surgical Robot Learning
Yonghao Long
Wang Wei
Tao Huang
Yuehao Wang
Qingxu Dou
39
32
0
01 Jan 2023
Towards automating Codenames spymasters with deep reinforcement learning
Sherman Siu
22
2
0
28 Dec 2022
Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error
Bumgeun Park
Taeyoung Kim
Woohyeon Moon
L. Vecchietti
Dongsoo Har
OffRL
26
2
0
26 Dec 2022
Understanding the Complexity Gains of Single-Task RL with a Curriculum
Qiyang Li
Yuexiang Zhai
Yi Ma
Sergey Levine
37
14
0
24 Dec 2022
SHIRO: Soft Hierarchical Reinforcement Learning
Kandai Watanabe
Mathew Strong
Omer Eldar
24
1
0
24 Dec 2022
Reward Bonuses with Gain Scheduling Inspired by Iterative Deepening Search
Taisuke Kobayashi
17
1
0
21 Dec 2022
Planning Immediate Landmarks of Targets for Model-Free Skill Transfer across Agents
Minghuan Liu
Zhengbang Zhu
Menghui Zhu
Yuzheng Zhuang
Weinan Zhang
Jianye Hao
18
0
0
18 Dec 2022
Offline Reinforcement Learning for Visual Navigation
Dhruv Shah
Arjun Bhorkar
Hrish Leen
Ilya Kostrikov
Nicholas Rhinehart
Sergey Levine
OffRL
24
29
0
16 Dec 2022
Previous
1
2
3
...
7
8
9
...
23
24
25
Next