Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.01495
Cited By
v1
v2
v3 (latest)
Hindsight Experience Replay
5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Hindsight Experience Replay"
50 / 1,267 papers shown
Title
R-AIF: Solving Sparse-Reward Robotic Tasks from Pixels with Active Inference and World Models
Viet Dung Nguyen
Zhizhuo Yang
Christopher L. Buckley
Alexander Ororbia
89
4
0
21 Sep 2024
Representing Positional Information in Generative World Models for Object Manipulation
Stefano Ferraro
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Sai Rajeswar
LM&Ro
OCL
73
0
0
18 Sep 2024
Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal Guidance
Renming Huang
Shaochong Liu
Yunqiang Pei
Peng Wang
Guoqing Wang
Yang Yang
Hengtao Shen
OffRL
86
0
0
06 Sep 2024
Simplex-enabled Safe Continual Learning Machine
H. Cao
Y. Mao
Yihao Cai
L. Sha
Marco Caccamo
78
3
0
05 Sep 2024
ELO-Rated Sequence Rewards: Advancing Reinforcement Learning Models
Qi Ju
Falin Hei
Zhemei Fang
Yunfeng Luo
93
1
0
05 Sep 2024
Surgical Task Automation Using Actor-Critic Frameworks and Self-Supervised Imitation Learning
Jingshuai Liu
Alain Andres
Yonghang Jiang
Xichun Luo
Wenmiao Shu
Sotirios A. Tsaftaris
127
0
0
04 Sep 2024
A Tighter Convergence Proof of Reverse Experience Replay
Nan Jiang
Jinzhao Li
Yexiang Xue
56
0
0
30 Aug 2024
Safe Policy Exploration Improvement via Subgoals
Brian Angulo
G. Gorbov
Aleksandr I. Panov
Konstantin Yakovlev
OffRL
68
0
0
25 Aug 2024
Scaling Cross-Embodied Learning: One Policy for Manipulation, Navigation, Locomotion and Aviation
Ria Doshi
Homer Walke
Oier Mees
Sudeep Dasari
Sergey Levine
140
59
0
21 Aug 2024
Online Behavior Modification for Expressive User Control of RL-Trained Robots
Isaac S. Sheidlower
Mavis Murdock
Emma Bethel
Reuben M. Aronson
E. Short
OffRL
88
3
0
15 Aug 2024
How to Solve Contextual Goal-Oriented Problems with Offline Datasets?
Ying Fan
Jingling Li
Adith Swaminathan
Aditya Modi
Ching-An Cheng
OffRL
134
0
0
14 Aug 2024
A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals
Grace Liu
Michael Tang
Benjamin Eysenbach
OffRL
131
2
0
11 Aug 2024
Contrast, Imitate, Adapt: Learning Robotic Skills From Raw Human Videos
Zhifeng Qian
Mingyu You
Hongjun Zhou
Xuanhui Xu
Hao Fu
Jinzhe Xue
Bin He
139
1
0
10 Aug 2024
Navigating the Human Maze: Real-Time Robot Pathfinding with Generative Imitation Learning
Martin Moder
Stephen Adhisaputra
Josef Pauli
67
0
0
07 Aug 2024
Scalable Signal Temporal Logic Guided Reinforcement Learning via Value Function Space Optimization
Yiting He
Peiran Liu
Yiding Ji
OffRL
87
0
0
04 Aug 2024
Jacta: A Versatile Planner for Learning Dexterous and Whole-body Manipulation
Jan Brüdigam
Ali-Adeeb Abbas
Maks Sorokin
Kuan Fang
Brandon Hung
Maya Guru
Stefan Sosnowski
Jiuguang Wang
Sandra Hirche
Simon Le Cleac'h
99
3
0
02 Aug 2024
Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning
Norman Di Palo
Leonard Hasenclever
Jan Humplik
Arunkumar Byravan
74
3
0
30 Jul 2024
Autonomous Improvement of Instruction Following Skills via Foundation Models
Zhiyuan Zhou
P. Atreya
Abraham Lee
Homer Walke
Oier Mees
Sergey Levine
95
14
0
30 Jul 2024
Gymnasium: A Standard Interface for Reinforcement Learning Environments
Mark Towers
Ariel Kwiatkowski
Jordan Terry
John U. Balis
Gianluca De Cola
...
Andrea Pierré
Sander Schulhoff
Jun Jet Tai
Hannah Tan
Omar G. Younis
AuLLM
OffRL
88
212
0
24 Jul 2024
WayEx: Waypoint Exploration using a Single Demonstration
Mara Levy
Nirat Saini
Abhinav Shrivastava
90
1
0
22 Jul 2024
Learning Goal-Conditioned Representations for Language Reward Models
Vaskar Nath
Dylan Slack
Jeff Da
Yuntao Ma
Hugh Zhang
Spencer Whitehead
Sean Hendryx
56
0
0
18 Jul 2024
Variable-Agnostic Causal Exploration for Reinforcement Learning
Minh Hoang Nguyen
Hung Le
Svetha Venkatesh
CML
61
2
0
17 Jul 2024
Investigating the Interplay of Prioritized Replay and Generalization
Parham Mohammad Panahi
Andrew Patterson
Martha White
Adam White
81
0
0
12 Jul 2024
TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations
Junik Bae
Kwanyoung Park
Youngwoon Lee
84
3
0
11 Jul 2024
Double-Ended Synthesis Planning with Goal-Constrained Bidirectional Search
Kevin Yu
Jihye Roh
Ziang Li
Wenhao Gao
Runzhong Wang
Connor W. Coley
108
8
0
08 Jul 2024
Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search through State Occupancy Regularization
Liam Schramm
Abdeslam Boularias
56
1
0
07 Jul 2024
Embracing Massive Medical Data
Yu-Cheng Chou
Zongwei Zhou
Alan Yuille
CLL
OOD
62
4
0
05 Jul 2024
Hindsight Preference Learning for Offline Preference-based Reinforcement Learning
Chen-Xiao Gao
Shengjun Fang
Chenjun Xiao
Yang Yu
Zongzhang Zhang
OffRL
54
1
0
05 Jul 2024
EAGERx: Graph-Based Framework for Sim2real Robot Learning
B. V. D. Heijden
Jelle Luijkx
Laura Ferranti
Jens Kober
Robert Babuška
59
0
0
05 Jul 2024
Disentangled Representations for Causal Cognition
Filippo Torresan
Manuel Baltieri
CML
101
2
0
30 Jun 2024
Learning Formal Mathematics From Intrinsic Motivation
Gabriel Poesia
David Broman
Nick Haber
Noah D. Goodman
LRM
109
17
0
30 Jun 2024
Revisiting Sparse Rewards for Goal-Reaching Reinforcement Learning
Gautham Vasan
Yan Wang
Fahim Shahriar
James Bergstra
Martin Jägersand
A. R. Mahmood
82
3
0
29 Jun 2024
Bidirectional-Reachable Hierarchical Reinforcement Learning with Mutually Responsive Policies
Yu-Juan Luo
Fuchun Sun
Tianying Ji
Xianyuan Zhan
59
0
0
26 Jun 2024
OCALM: Object-Centric Assessment with Language Models
Timo Kaufmann
Johannes Czech
Antonia Wüst
Quentin Delfosse
Kristian Kersting
Eyke Hüllermeier
LM&Ro
LRM
88
1
0
24 Jun 2024
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making
Vivek Myers
Chongyi Zheng
Anca Dragan
Sergey Levine
Benjamin Eysenbach
OffRL
144
14
0
24 Jun 2024
Learning Abstract World Model for Value-preserving Planning with Options
Rafael Rodríguez-Sánchez
George Konidaris
86
1
0
22 Jun 2024
Learning telic-controllable state representations
Nadav Amir
Stas Tiomkin
Angela Langdon
88
0
0
20 Jun 2024
Metacognitive AI: Framework and the Case for a Neurosymbolic Approach
Hua Wei
Paulo Shakarian
Christian Lebiere
Bruce Draper
Nikhil Krishnaswamy
Sergei Nirenburg
LRM
66
6
0
17 Jun 2024
Large Reasoning Models for 3D Floorplanning in EDA: Learning from Imperfections
Fin Amin
N. Rouf
Tse-Han Pan
Md. Kamal Ibn Shafi
Paul D. Franzon
42
0
0
15 Jun 2024
Is Value Learning Really the Main Bottleneck in Offline RL?
Seohong Park
Kevin Frans
Sergey Levine
Aviral Kumar
OffRL
91
20
0
13 Jun 2024
CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep Reinforcement Learning Algorithms
Arda Sarp Yenicesu
Furkan B. Mutlu
Suleyman S. Kozat
Ozgur S. Oguz
29
1
0
13 Jun 2024
Multi-agent Reinforcement Learning with Deep Networks for Diverse Q-Vectors
Zhenglong Luo
Zhiyong Chen
James Welsh
37
1
0
12 Jun 2024
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
Utsav Singh
Pramit Bhattacharyya
Vinay P. Namboodiri
LM&Ro
100
1
0
09 Jun 2024
What Matters in Hierarchical Search for Combinatorial Reasoning Problems?
Michał Zawalski
Gracjan Góral
Michał Tyrolski
Emilia Wisnios
Franciszek Budrowski
Marek Cygan
Łukasz Kuciński
Piotr Miłoś
113
0
0
05 Jun 2024
Multi-Agent Transfer Learning via Temporal Contrastive Learning
Weihao Zeng
Joseph Campbell
Simon Stepputtis
Katia Sycara
OffRL
110
2
0
03 Jun 2024
Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment
Chen Zhang
Qiang He
Zhou Yuan
Elvis S. Liu
Hong Wang
Jian Zhao
Yang-Feng Wang
116
2
0
03 Jun 2024
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets
Haoran He
C. Chang
Huazhe Xu
Ling Pan
204
7
0
03 Jun 2024
Shared-unique Features and Task-aware Prioritized Sampling on Multi-task Reinforcement Learning
Po-Shao Lin
Jia-Fong Yeh
Yi-Ting Chen
Winston H. Hsu
85
0
0
02 Jun 2024
Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient
Zechu Li
Rickmer Krohn
Tao Chen
Anurag Ajay
Pulkit Agrawal
Georgia Chalvatzaki
DiffM
130
18
0
02 Jun 2024
Exploring the limits of Hierarchical World Models in Reinforcement Learning
Robin Schiewer
Anand Subramoney
Laurenz Wiskott
76
1
0
01 Jun 2024
Previous
1
2
3
4
5
6
...
24
25
26
Next