ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXivPDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,243 papers shown
Title
Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations
Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations
Cevahir Köprülü
Po-han Li
Tianyu Qiu
Ruihan Zhao
T. Westenbroek
David Fridovich-Keil
Sandeep P. Chinchali
Ufuk Topcu
OffRL
94
0
0
02 Dec 2024
Umbrella Reinforcement Learning -- computationally efficient tool for
  hard non-linear problems
Umbrella Reinforcement Learning -- computationally efficient tool for hard non-linear problems
Egor E. Nuzhin
Nikolai V. Brilliantov
59
1
0
21 Nov 2024
Pre-trained Visual Dynamics Representations for Efficient Policy
  Learning
Pre-trained Visual Dynamics Representations for Efficient Policy Learning
Hao Luo
Bohan Zhou
Zongqing Lu
30
1
0
05 Nov 2024
Formal Theorem Proving by Rewarding LLMs to Decompose Proofs
  Hierarchically
Formal Theorem Proving by Rewarding LLMs to Decompose Proofs Hierarchically
Kefan Dong
Arvind V. Mahankali
Tengyu Ma
ReLM
LRM
30
6
0
04 Nov 2024
Learning World Models for Unconstrained Goal Navigation
Learning World Models for Unconstrained Goal Navigation
Yuanlin Duan
Wensen Mao
He Zhu
34
1
0
03 Nov 2024
Exploring the Edges of Latent State Clusters for Goal-Conditioned
  Reinforcement Learning
Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement Learning
Yuanlin Duan
Guofeng Cui
He Zhu
OffRL
34
0
0
03 Nov 2024
Hierarchical Preference Optimization: Learning to achieve goals via
  feasible subgoals prediction
Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction
Utsav Singh
Souradip Chakraborty
Wesley A Suttle
Brian M. Sadler
Anit Kumar Sahu
Mubarak Shah
Vinay P. Namboodiri
Amrit Singh Bedi
49
1
0
01 Nov 2024
Compositional Automata Embeddings for Goal-Conditioned Reinforcement Learning
Compositional Automata Embeddings for Goal-Conditioned Reinforcement Learning
Beyazit Yalcinkaya
Niklas Lauffer
Marcell Vazquez-Chanlatte
S. Seshia
AI4CE
55
5
0
31 Oct 2024
Maximum Entropy Hindsight Experience Replay
Maximum Entropy Hindsight Experience Replay
Douglas C. Crowder
Matthew L. Trappett
Darrien M. McKenzie
Frances S. Chance
37
0
0
31 Oct 2024
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Kaiyan Zhao
Yiming Wang
Yuyang Chen
Yan Li
Leong Hou U
Xiaoguang Niu
39
1
0
27 Oct 2024
OGBench: Benchmarking Offline Goal-Conditioned RL
OGBench: Benchmarking Offline Goal-Conditioned RL
Seohong Park
Kevin Frans
Benjamin Eysenbach
Sergey Levine
OffRL
53
9
0
26 Oct 2024
SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions
SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions
Zizhao Wang
Jiaheng Hu
Caleb Chuck
Stephen Chen
Roberto Martín-Martín
Amy Zhang
S. Niekum
Peter Stone
OffRL
61
0
0
24 Oct 2024
Safe Load Balancing in Software-Defined-Networking
Safe Load Balancing in Software-Defined-Networking
L. Dinh
Pham Tran Anh Quang
Jérémie Leguay
34
0
0
22 Oct 2024
Interpretable end-to-end Neurosymbolic Reinforcement Learning agents
Interpretable end-to-end Neurosymbolic Reinforcement Learning agents
Nils Grandien
Quentin Delfosse
Kristian Kersting
OffRL
27
2
0
18 Oct 2024
Novelty-based Sample Reuse for Continuous Robotics Control
Novelty-based Sample Reuse for Continuous Robotics Control
Ke Duan
Kai Yang
Houde Liu
Xueqian Wang
35
0
0
17 Oct 2024
SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and
  Hindsight Relabeling
SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Loris Gaven
Clément Romac
Thomas Carta
Sylvain Lamprier
Olivier Sigaud
Pierre-Yves Oudeyer
LLMAG
OffRL
30
1
0
16 Oct 2024
Potential-Based Intrinsic Motivation: Preserving Optimality With
  Complex, Non-Markovian Shaping Rewards
Potential-Based Intrinsic Motivation: Preserving Optimality With Complex, Non-Markovian Shaping Rewards
Grant C. Forbes
Leonardo Villalobos-Arias
Jianxun Wang
Arnav Jhala
David L. Roberts
29
1
0
16 Oct 2024
The State of Robot Motion Generation
The State of Robot Motion Generation
Kostas E. Bekris
Joe H. Doerr
Patrick Meng
Sumanth Tangirala
3DV
36
2
0
16 Oct 2024
Zero-Shot Offline Imitation Learning via Optimal Transport
Zero-Shot Offline Imitation Learning via Optimal Transport
Thomas Rupf
Marco Bagatella
Nico Gürtler
Jonas Frey
Georg Martius
OffRL
159
0
0
11 Oct 2024
Effective Exploration Based on the Structural Information Principles
Effective Exploration Based on the Structural Information Principles
Xianghua Zeng
Hao Peng
Angsheng Li
21
1
0
09 Oct 2024
Unsupervised Skill Discovery for Robotic Manipulation through Automatic
  Task Generation
Unsupervised Skill Discovery for Robotic Manipulation through Automatic Task Generation
Paul Jansonnie
Bingbing Wu
Julien Perez
Jan Peters
SSL
25
0
0
07 Oct 2024
Choices are More Important than Efforts: LLM Enables Efficient
  Multi-Agent Exploration
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Yun Qu
Boyuan Wang
Yuhang Jiang
Jianzhun Shao
Yixiu Mao
Cheems Wang
Chang Liu
Xiangyang Ji
46
4
0
03 Oct 2024
Learning to Bridge the Gap: Efficient Novelty Recovery with Planning and
  Reinforcement Learning
Learning to Bridge the Gap: Efficient Novelty Recovery with Planning and Reinforcement Learning
Alicia Li
Nishanth Kumar
Tomás Lozano-Pérez
Leslie Kaelbling
OffRL
47
0
0
28 Sep 2024
Synatra: Turning Indirect Knowledge into Direct Demonstrations for
  Digital Agents at Scale
Synatra: Turning Indirect Knowledge into Direct Demonstrations for Digital Agents at Scale
Tianyue Ou
Frank F. Xu
Aman Madaan
J. Liu
Robert Lo
Abishek Sridhar
Sudipta Sengupta
Dan Roth
Graham Neubig
Shuyan Zhou
OffRL
41
9
0
24 Sep 2024
Autonomous Wheel Loader Navigation Using Goal-Conditioned Actor-Critic MPC
Autonomous Wheel Loader Navigation Using Goal-Conditioned Actor-Critic MPC
Aleksi Mäki-Penttilä
Naeim Ebrahimi Toulkani
Reza Ghabcheloo
34
0
0
24 Sep 2024
R-AIF: Solving Sparse-Reward Robotic Tasks from Pixels with Active
  Inference and World Models
R-AIF: Solving Sparse-Reward Robotic Tasks from Pixels with Active Inference and World Models
Viet Dung Nguyen
Zhizhuo Yang
Christopher L. Buckley
Alexander Ororbia
39
2
0
21 Sep 2024
Representing Positional Information in Generative World Models for
  Object Manipulation
Representing Positional Information in Generative World Models for Object Manipulation
Stefano Ferraro
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Sai Rajeswar
LM&Ro
OCL
43
0
0
18 Sep 2024
Goal-Reaching Policy Learning from Non-Expert Observations via Effective
  Subgoal Guidance
Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal Guidance
Renming Huang
Shaochong Liu
Yunqiang Pei
Peng Wang
Guoqing Wang
Yang Yang
Hengtao Shen
OffRL
37
0
0
06 Sep 2024
Simplex-enabled Safe Continual Learning Machine
Simplex-enabled Safe Continual Learning Machine
H. Cao
Y. Mao
Yihao Cai
L. Sha
Marco Caccamo
36
3
0
05 Sep 2024
ELO-Rated Sequence Rewards: Advancing Reinforcement Learning Models
ELO-Rated Sequence Rewards: Advancing Reinforcement Learning Models
Qi Ju
Falin Hei
Zhemei Fang
Yunfeng Luo
37
0
0
05 Sep 2024
Surgical Task Automation Using Actor-Critic Frameworks and
  Self-Supervised Imitation Learning
Surgical Task Automation Using Actor-Critic Frameworks and Self-Supervised Imitation Learning
Jingshuai Liu
Alain Andres
Yonghang Jiang
Xichun Luo
Wenmiao Shu
Sotirios A. Tsaftaris
39
0
0
04 Sep 2024
A Tighter Convergence Proof of Reverse Experience Replay
A Tighter Convergence Proof of Reverse Experience Replay
Nan Jiang
Jinzhao Li
Yexiang Xue
19
0
0
30 Aug 2024
Safe Policy Exploration Improvement via Subgoals
Safe Policy Exploration Improvement via Subgoals
Brian Angulo
G. Gorbov
Aleksandr I. Panov
Konstantin Yakovlev
OffRL
32
0
0
25 Aug 2024
Scaling Cross-Embodied Learning: One Policy for Manipulation,
  Navigation, Locomotion and Aviation
Scaling Cross-Embodied Learning: One Policy for Manipulation, Navigation, Locomotion and Aviation
Ria Doshi
Homer Walke
Oier Mees
Sudeep Dasari
Sergey Levine
45
48
0
21 Aug 2024
Online Behavior Modification for Expressive User Control of RL-Trained
  Robots
Online Behavior Modification for Expressive User Control of RL-Trained Robots
Isaac S. Sheidlower
Mavis Murdock
Emma Bethel
Reuben M. Aronson
E. Short
OffRL
37
3
0
15 Aug 2024
How to Solve Contextual Goal-Oriented Problems with Offline Datasets?
How to Solve Contextual Goal-Oriented Problems with Offline Datasets?
Ying Fan
Jingling Li
Adith Swaminathan
Aditya Modi
Ching-An Cheng
OffRL
72
0
0
14 Aug 2024
A Single Goal is All You Need: Skills and Exploration Emerge from
  Contrastive RL without Rewards, Demonstrations, or Subgoals
A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals
Grace Liu
Michael Tang
Benjamin Eysenbach
OffRL
42
1
0
11 Aug 2024
Contrast, Imitate, Adapt: Learning Robotic Skills From Raw Human Videos
Contrast, Imitate, Adapt: Learning Robotic Skills From Raw Human Videos
Zhifeng Qian
Mingyu You
Hongjun Zhou
Xuanhui Xu
Hao Fu
Jinzhe Xue
Bin He
26
0
0
10 Aug 2024
Navigating the Human Maze: Real-Time Robot Pathfinding with Generative
  Imitation Learning
Navigating the Human Maze: Real-Time Robot Pathfinding with Generative Imitation Learning
Martin Moder
Stephen Adhisaputra
Josef Pauli
18
0
0
07 Aug 2024
Scalable Signal Temporal Logic Guided Reinforcement Learning via Value
  Function Space Optimization
Scalable Signal Temporal Logic Guided Reinforcement Learning via Value Function Space Optimization
Yiting He
Peiran Liu
Yiding Ji
OffRL
36
0
0
04 Aug 2024
Jacta: A Versatile Planner for Learning Dexterous and Whole-body
  Manipulation
Jacta: A Versatile Planner for Learning Dexterous and Whole-body Manipulation
Jan Brüdigam
Ali-Adeeb Abbas
Maks Sorokin
Kuan Fang
Brandon Hung
Maya Guru
Stefan Sosnowski
Jiuguang Wang
Sandra Hirche
Simon Le Cleac'h
36
2
0
02 Aug 2024
Diffusion Augmented Agents: A Framework for Efficient Exploration and
  Transfer Learning
Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning
Norman Di Palo
Leonard Hasenclever
Jan Humplik
Arunkumar Byravan
51
2
0
30 Jul 2024
Autonomous Improvement of Instruction Following Skills via Foundation
  Models
Autonomous Improvement of Instruction Following Skills via Foundation Models
Zhiyuan Zhou
P. Atreya
Abraham Lee
Homer Walke
Oier Mees
Sergey Levine
32
11
0
30 Jul 2024
Gymnasium: A Standard Interface for Reinforcement Learning Environments
Gymnasium: A Standard Interface for Reinforcement Learning Environments
Mark Towers
Ariel Kwiatkowski
Jordan Terry
John U. Balis
Gianluca De Cola
...
Andrea Pierré
Sander Schulhoff
Jun Jet Tai
Hannah Tan
Omar G. Younis
AuLLM
OffRL
24
164
0
24 Jul 2024
WayEx: Waypoint Exploration using a Single Demonstration
WayEx: Waypoint Exploration using a Single Demonstration
Mara Levy
Nirat Saini
Abhinav Shrivastava
61
1
0
22 Jul 2024
Learning Goal-Conditioned Representations for Language Reward Models
Learning Goal-Conditioned Representations for Language Reward Models
Vaskar Nath
Dylan Slack
Jeff Da
Yuntao Ma
Hugh Zhang
Spencer Whitehead
Sean Hendryx
32
0
0
18 Jul 2024
Variable-Agnostic Causal Exploration for Reinforcement Learning
Variable-Agnostic Causal Exploration for Reinforcement Learning
Minh Hoang Nguyen
Hung Le
Svetha Venkatesh
CML
35
2
0
17 Jul 2024
Investigating the Interplay of Prioritized Replay and Generalization
Investigating the Interplay of Prioritized Replay and Generalization
Parham Mohammad Panahi
Andrew Patterson
Martha White
Adam White
48
0
0
12 Jul 2024
TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware
  Representations
TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations
Junik Bae
Kwanyoung Park
Youngwoon Lee
37
2
0
11 Jul 2024
Double-Ended Synthesis Planning with Goal-Constrained Bidirectional
  Search
Double-Ended Synthesis Planning with Goal-Constrained Bidirectional Search
Kevin Yu
Jihye Roh
Ziang Li
Wenhao Gao
Runzhong Wang
Connor W. Coley
40
5
0
08 Jul 2024
Previous
12345...232425
Next