Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.01495
Cited By
v1
v2
v3 (latest)
Hindsight Experience Replay
5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Hindsight Experience Replay"
50 / 1,267 papers shown
Title
SMORE: Score Models for Offline Goal-Conditioned Reinforcement Learning
Harshit S. Sikchi
Rohan Chitnis
Ahmed Touati
A. Geramifard
Amy Zhang
S. Niekum
OffRL
130
9
0
03 Nov 2023
Selectively Sharing Experiences Improves Multi-Agent Reinforcement Learning
M. Gerstgrasser
Tom Danino
Sarah Keren
77
7
0
01 Nov 2023
Autonomous Robotic Reinforcement Learning with Asynchronous Human Feedback
Max Balsells
M. Torné
Zihan Wang
Samedh Desai
Pulkit Agrawal
Abhishek Gupta
97
10
0
31 Oct 2023
Learning to Discover Skills through Guidance
Hyunseung Kim
ByungKun Lee
Hojoon Lee
Dongyoon Hwang
Sejik Park
Kyushik Min
Jaegul Choo
112
6
0
31 Oct 2023
Contrastive Difference Predictive Coding
Chongyi Zheng
Ruslan Salakhutdinov
Benjamin Eysenbach
AI4TS
OffRL
72
17
0
31 Oct 2023
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models
Mianchu Wang
Rui Yang
Xi Chen
Hao Sun
Meng Fang
Giovanni Montana
OffRL
115
11
0
30 Oct 2023
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills
Seongun Kim
Kyowoon Lee
Jaesik Choi
SSL
DRL
84
10
0
30 Oct 2023
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning
Zhaoyi Zhou
Chuning Zhu
Runlong Zhou
Qiwen Cui
Abhishek Gupta
S. S. Du
OffRL
82
9
0
30 Oct 2023
Diversify & Conquer: Outcome-directed Curriculum RL via Out-of-Distribution Disagreement
Daesol Cho
Seungjae Lee
H. J. Kim
OODD
84
2
0
30 Oct 2023
Unsupervised Behavior Extraction via Random Intent Priors
Haotian Hu
Yiqin Yang
Jianing Ye
Ziqing Mai
Chongjie Zhang
OffRL
81
9
0
28 Oct 2023
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning
Nicholas Corrado
Yu-Tao Qu
John U. Balis
Adam Labiosa
Josiah P. Hanna
OffRL
81
4
0
27 Oct 2023
Understanding when Dynamics-Invariant Data Augmentations Benefit Model-Free Reinforcement Learning Updates
Nicholas Corrado
Josiah P. Hanna
83
5
0
26 Oct 2023
CQM: Curriculum Reinforcement Learning with a Quantized World Model
Seungjae Lee
Daesol Cho
Jonghae Park
H. J. Kim
85
9
0
26 Oct 2023
Learning Agility and Adaptive Legged Locomotion via Curricular Hindsight Reinforcement Learning
Sicen Li
Yiming Pang
Panju Bai
Zhaojin Liu
Jiawei Li
Shihao Hu
Liquan Wang
Gang Wang
80
3
0
24 Oct 2023
Cold Diffusion on the Replay Buffer: Learning to Plan from Known Good States
Zidan Wang
Takeru Oba
Takuma Yoneda
Rui Shen
Matthew R. Walter
Bradly C. Stadie
DiffM
110
10
0
21 Oct 2023
Teaching Language Models to Self-Improve through Interactive Demonstrations
Xiao Yu
Baolin Peng
Michel Galley
Jianfeng Gao
Zhou Yu
LRM
ReLM
104
22
0
20 Oct 2023
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control
Chao Li
Chen Gong
Qiang He
Xinwen Hou
64
1
0
17 Oct 2023
CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization
Bodhisattwa Prasad Majumder
Bhavana Dalvi
Peter Alexander Jansen
Oyvind Tafjord
Niket Tandon
Li Zhang
Chris Callison-Burch
Peter Clark
LRM
LLMAG
CLL
66
48
0
16 Oct 2023
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRL
LM&Ro
121
33
0
15 Oct 2023
GROOT: Learning to Follow Instructions by Watching Gameplay Videos
Shaofei Cai
Bowei Zhang
Zihao Wang
Xiaojian Ma
Hoang Trung-Dung
Yitao Liang
161
27
0
12 Oct 2023
Understanding the Effects of RLHF on LLM Generalisation and Diversity
Robert Kirk
Ishita Mediratta
Christoforos Nalmpantis
Jelena Luketina
Eric Hambro
Edward Grefenstette
Roberta Raileanu
AI4CE
ALM
212
150
0
10 Oct 2023
Human-Robot Gym: Benchmarking Reinforcement Learning in Human-Robot Collaboration
Jakob Thumm
Felix Trost
Matthias Althoff
OffRL
96
6
0
09 Oct 2023
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Hao Sun
OffRL
87
23
0
09 Oct 2023
Learning Interactive Real-World Simulators
Mengjiao Yang
Yilun Du
Kamyar Ghasemipour
Jonathan Tompson
Leslie Kaelbling
Dale Schuurmans
Pieter Abbeel
LM&Ro
PINN
79
215
0
09 Oct 2023
Compositional Servoing by Recombining Demonstrations
Max Argus
Abhijeet Nayak
Martin Buchner
Silvio Galesso
Abhinav Valada
Thomas Brox
72
0
0
06 Oct 2023
Improving Reinforcement Learning Efficiency with Auxiliary Tasks in Non-Visual Environments: A Comparison
Moritz Lange
Noah Krystiniak
Raphael C. Engelhardt
Wolfgang Konen
Laurenz Wiskott
OffRL
59
1
0
06 Oct 2023
Pre-Training and Fine-Tuning Generative Flow Networks
Ling Pan
Moksh Jain
Kanika Madan
Yoshua Bengio
111
13
0
05 Oct 2023
Roadmaps with Gaps over Controllers: Achieving Efficiency in Planning under Dynamics
Aravind Sivaramakrishnan
Sumanth Tangirala
Edgar Granados
Noah R. Carver
Kostas E. Bekris
63
3
0
05 Oct 2023
Learning to Reach Goals via Diffusion
V. Jain
Siamak Ravanbakhsh
DiffM
OffRL
87
5
0
04 Oct 2023
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yao Mu
Yan Zheng
Yujing Hu
Tangjie Lv
Changjie Fan
Zhipeng Hu
103
31
0
03 Oct 2023
Learning and reusing primitive behaviours to improve Hindsight Experience Replay sample efficiency
Francisco Roldan Sanchez
Qiang Wang
David Córdova Bulens
Kevin McGuinness
Stephen J. Redmond
Noel E. O'Connor
OffRL
OnRL
78
1
0
03 Oct 2023
Efficient Planning with Latent Diffusion
Wenhao Li
DiffM
137
8
0
30 Sep 2023
On Generating Explanations for Reinforcement Learning Policies: An Empirical Study
Mikihisa Yuasa
Huy T. Tran
R. Sreenivas
FAtt
LRM
151
1
0
29 Sep 2023
HyperPPO: A scalable method for finding small policies for robotic control
Luming Tang
Zhehui Huang
Gaurav Sukhatme
65
4
0
28 Sep 2023
RLLTE: Long-Term Evolution Project of Reinforcement Learning
Tao Lv
Zequn Zhang
Yang Xu
Shihao Luo
Bo Li
Xin Jin
Wenjun Zeng
OffRL
70
1
0
28 Sep 2023
Efficiency Separation between RL Methods: Model-Free, Model-Based and Goal-Conditioned
Han Bao
Raphaël Jungers
Jean-Charles Delvenne
OffRL
72
1
0
28 Sep 2023
Learning to Terminate in Object Navigation
Yuhang Song
Anh Nguyen
Chun-Yi Lee
60
3
0
28 Sep 2023
Distill Knowledge in Multi-task Reinforcement Learning with Optimal-Transport Regularization
Bang Giang Le
Viet-Cuong Ta
OT
81
1
0
27 Sep 2023
Maximum diffusion reinforcement learning
Thomas A. Berrueta
Allison Pinosky
Todd Murphey
AI4CE
DiffM
99
5
0
26 Sep 2023
On the Benefit of Optimal Transport for Curriculum Reinforcement Learning
Pascal Klink
Carlo DÉramo
Jan Peters
Joni Pajarinen
84
3
0
25 Sep 2023
Policy Stitching: Learning Transferable Robot Policies
Pingcheng Jian
Easop Lee
Zachary I. Bell
Michael M. Zavlanos
Boyuan Chen
OffRL
61
8
0
24 Sep 2023
Boosting Offline Reinforcement Learning for Autonomous Driving with Hierarchical Latent Skills
Zenan Li
Fan Nie
Q. Sun
Fang Da
Hang Zhao
OffRL
64
7
0
24 Sep 2023
Guided Cooperation in Hierarchical Reinforcement Learning via Model-based Rollout
Haoran Wang
Zeshen Tang
Leya Yang
Yaoru Sun
Fang Wang
Siyu Zhang
Ye-Ting Chen
96
2
0
24 Sep 2023
Robotic Offline RL from Internet Videos via Value-Function Pre-Training
Chethan Bhateja
Derek Guo
Dibya Ghosh
Anika Singh
Manan Tomar
Q. Vuong
Yevgen Chebotar
Sergey Levine
Aviral Kumar
OffRL
104
22
0
22 Sep 2023
Machine Learning Meets Advanced Robotic Manipulation
Saeid Nahavandi
R. Alizadehsani
D. Nahavandi
Chee Peng Lim
Kevin Kelly
Fernando Bello
92
20
0
22 Sep 2023
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions
Yevgen Chebotar
Q. Vuong
A. Irpan
Karol Hausman
F. Xia
...
Brianna Zitkovich
Tomas Jackson
Kanishka Rao
Chelsea Finn
Sergey Levine
OffRL
203
90
0
18 Sep 2023
Contrastive Initial State Buffer for Reinforcement Learning
Nico Messikommer
Yunlong Song
Davide Scaramuzza
OffRL
103
9
0
18 Sep 2023
Equivariant Data Augmentation for Generalization in Offline Reinforcement Learning
Cristina Pinneri
Sarah Bechtle
Markus Wulfmeier
Arunkumar Byravan
Jingwei Zhang
William F. Whitney
Martin Riedmiller
OffRL
53
2
0
14 Sep 2023
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning
David Yunis
Justin Jung
Falcon Z. Dai
Matthew R. Walter
OffRL
88
0
0
08 Sep 2023
Foundational Policy Acquisition via Multitask Learning for Motor Skill Generation
Satoshi Yamamori
Jun Morimoto
83
0
0
31 Aug 2023
Previous
1
2
3
...
5
6
7
...
24
25
26
Next