ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXivPDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,244 papers shown
Title
Planning Goals for Exploration
Planning Goals for Exploration
E. Hu
Richard Chang
Oleh Rybkin
Dinesh Jayaraman
43
24
0
23 Mar 2023
A Survey of Historical Learning: Learning Models with Learning History
A Survey of Historical Learning: Learning Models with Learning History
Xiang Li
Ge Wu
Lingfeng Yang
Wenzhe Wang
Renjie Song
Jian Yang
MU
AI4TS
31
2
0
23 Mar 2023
Imitating Graph-Based Planning with Goal-Conditioned Policies
Imitating Graph-Based Planning with Goal-Conditioned Policies
Junsup Kim
Younggyo Seo
Sungsoo Ahn
Kyunghwan Son
Jinwoo Shin
34
10
0
20 Mar 2023
Conversational Tree Search: A New Hybrid Dialog Task
Conversational Tree Search: A New Hybrid Dialog Task
Dirk Vath
Lindsey Vanderlyn
Ngoc Thang Vu
51
7
0
17 Mar 2023
Efficient Learning of High Level Plans from Play
Efficient Learning of High Level Plans from Play
Núria Armengol Urpí
Marco Bagatella
Otmar Hilliges
Georg Martius
Stelian Coros
OffRL
27
3
0
16 Mar 2023
Goal-conditioned Offline Reinforcement Learning through State Space
  Partitioning
Goal-conditioned Offline Reinforcement Learning through State Space Partitioning
Mianchu Wang
Yue Jin
Giovanni Montana
OffRL
21
3
0
16 Mar 2023
GOATS: Goal Sampling Adaptation for Scooping with Curriculum
  Reinforcement Learning
GOATS: Goal Sampling Adaptation for Scooping with Curriculum Reinforcement Learning
Yaru Niu
Shiyu Jin
Zeqing Zhang
Jiacheng Zhu
Ding Zhao
Liangjun Zhang
35
7
0
09 Mar 2023
Exploiting Contextual Structure to Generate Useful Auxiliary Tasks
Exploiting Contextual Structure to Generate Useful Auxiliary Tasks
Benedict Quartey
Ankit Shah
George Konidaris
33
3
0
09 Mar 2023
Grasping Student: semi-supervised learning for robotic manipulation
Grasping Student: semi-supervised learning for robotic manipulation
P. Krzywicki
Krzysztof Ciebiera
Rafal Michaluk
Inga Maziarz
Marek Cygan
SSL
27
0
0
08 Mar 2023
One-4-All: Neural Potential Fields for Embodied Navigation
One-4-All: Neural Potential Fields for Embodied Navigation
Sacha Morin
Miguel A. Saavedra-Ruiz
Liam Paull
35
5
0
07 Mar 2023
Hindsight States: Blending Sim and Real Task Elements for Efficient
  Reinforcement Learning
Hindsight States: Blending Sim and Real Task Elements for Efficient Reinforcement Learning
Simon Guist
Jan Schneider-Barnes
Alexander Dittrich
V. Berenz
Bernhard Schölkopf
Le Chen
29
3
0
03 Mar 2023
Hierarchical Reinforcement Learning in Complex 3D Environments
Hierarchical Reinforcement Learning in Complex 3D Environments
Bernardo Avila-Pires
Feryal M. P. Behbahani
Hubert Soyer
Kyriacos Nikiforou
Thomas Keck
Satinder Singh
OffRL
23
0
0
28 Feb 2023
Auxiliary Task-based Deep Reinforcement Learning for Quantum Control
Auxiliary Task-based Deep Reinforcement Learning for Quantum Control
Shumin Zhou
Hailan Ma
S. Kuang
Daoyi Dong
29
5
0
28 Feb 2023
A Supervisory Learning Control Framework for Autonomous & Real-time Task
  Planning for an Underactuated Cooperative Robotic task
A Supervisory Learning Control Framework for Autonomous & Real-time Task Planning for an Underactuated Cooperative Robotic task
Sander De Witte
Tom Lefebvre
Thijs Van Hauwermeiren
Guillaume Crevecoeur
21
0
0
22 Feb 2023
Curiosity-driven Exploration in Sparse-reward Multi-agent Reinforcement
  Learning
Curiosity-driven Exploration in Sparse-reward Multi-agent Reinforcement Learning
Jiong Li
Pratik Gajane
37
4
0
21 Feb 2023
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning
  for Task-oriented Dialogue Systems
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems
Yihao Feng
Shentao Yang
Shujian Zhang
Jianguo Zhang
Caiming Xiong
Mi Zhou
Haiquan Wang
OffRL
31
24
0
20 Feb 2023
Understanding the effect of varying amounts of replay per step
Understanding the effect of varying amounts of replay per step
A. Paul
Videh Raj Nema
8
0
0
20 Feb 2023
Demonstration-Guided Reinforcement Learning with Efficient Exploration
  for Task Automation of Surgical Robot
Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot
Tao Huang
Kai-xiang Chen
Bin Li
Yunhui Liu
Qingxu Dou
35
23
0
20 Feb 2023
Natural Language-conditioned Reinforcement Learning with Inside-out Task
  Language Development and Translation
Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation
Jing-Cheng Pang
Xinyi Yang
Sibei Yang
Yang Yu
29
8
0
18 Feb 2023
Swapped goal-conditioned offline reinforcement learning
Swapped goal-conditioned offline reinforcement learning
Wenyan Yang
Huiling Wang
Dingding Cai
Joni Pajarinen
Joni-Kristen Kämäräinen
OffRL
OnRL
36
1
0
17 Feb 2023
Prioritized offline Goal-swapping Experience Replay
Prioritized offline Goal-swapping Experience Replay
Wenyan Yang
Joni Pajarinen
Dinging Cai
Joni Kämäräinen
OffRL
OnRL
32
0
0
15 Feb 2023
ALAN: Autonomously Exploring Robotic Agents in the Real World
ALAN: Autonomously Exploring Robotic Agents in the Real World
Russell Mendonca
Shikhar Bahl
Deepak Pathak
LM&Ro
36
20
0
13 Feb 2023
A Song of Ice and Fire: Analyzing Textual Autotelic Agents in
  ScienceWorld
A Song of Ice and Fire: Analyzing Textual Autotelic Agents in ScienceWorld
Laetitia Teodorescu
Eric Yuan
Marc-Alexandre Côté
Pierre-Yves Oudeyer
32
1
0
10 Feb 2023
The Wisdom of Hindsight Makes Language Models Better Instruction
  Followers
The Wisdom of Hindsight Makes Language Models Better Instruction Followers
Tianjun Zhang
Fangchen Liu
Justin Wong
Pieter Abbeel
Joseph E. Gonzalez
31
44
0
10 Feb 2023
Scaling Goal-based Exploration via Pruning Proto-goals
Scaling Goal-based Exploration via Pruning Proto-goals
Akhil Bagaria
Ray Jiang
Ramana Kumar
Tom Schaul
LRM
11
2
0
09 Feb 2023
Sample-efficient Multi-objective Molecular Optimization with GFlowNets
Sample-efficient Multi-objective Molecular Optimization with GFlowNets
Yiheng Zhu
Jialun Wu
Chaowen Hu
Jiahuan Yan
Chang-Yu Hsieh
Tingjun Hou
Jian Wu
27
33
0
08 Feb 2023
Object-Centric Scene Representations using Active Inference
Object-Centric Scene Representations using Active Inference
Toon Van de Maele
Tim Verbelen
Pietro Mazzaglia
Stefano Ferraro
Bart Dhoedt
OCL
BDL
43
5
0
07 Feb 2023
SurgT challenge: Benchmark of Soft-Tissue Trackers for Robotic Surgery
SurgT challenge: Benchmark of Soft-Tissue Trackers for Robotic Surgery
João Cartucho
Alistair Weld
Samyakh Tukra
Haozheng Xu
Hiroki Matsuzaki
...
B. Silva
Estevão Lima
João L. Vilaça
Sandro Queiros
Stamatia Giannarou
22
11
0
06 Feb 2023
Chain of Hindsight Aligns Language Models with Feedback
Chain of Hindsight Aligns Language Models with Feedback
Hao Liu
Carmelo Sferrazza
Pieter Abbeel
ALM
31
117
0
06 Feb 2023
Skill Decision Transformer
Skill Decision Transformer
Shyam Sudhakaran
S. Risi
OffRL
26
5
0
31 Jan 2023
On the Statistical Benefits of Temporal Difference Learning
On the Statistical Benefits of Temporal Difference Learning
David Cheikhi
Daniel Russo
11
4
0
30 Jan 2023
Distilling Internet-Scale Vision-Language Models into Embodied Agents
Distilling Internet-Scale Vision-Language Models into Embodied Agents
T. Sumers
Kenneth Marino
Arun Ahuja
Rob Fergus
Ishita Dasgupta
LM&Ro
46
24
0
29 Jan 2023
A Memory Efficient Deep Reinforcement Learning Approach For Snake Game
  Autonomous Agents
A Memory Efficient Deep Reinforcement Learning Approach For Snake Game Autonomous Agents
Md. Rafat Rahman Tushar
Shahnewaz Siddique
17
5
0
27 Jan 2023
Outcome-directed Reinforcement Learning by Uncertainty & Temporal
  Distance-Aware Curriculum Goal Generation
Outcome-directed Reinforcement Learning by Uncertainty & Temporal Distance-Aware Curriculum Goal Generation
Daesol Cho
Seungjae Lee
H. J. Kim
31
14
0
27 Jan 2023
Alien Coding
Alien Coding
Thibault Gauthier
Miroslav Olsák
J. Urban
32
7
0
27 Jan 2023
AutoCost: Evolving Intrinsic Cost for Zero-violation Reinforcement
  Learning
AutoCost: Evolving Intrinsic Cost for Zero-violation Reinforcement Learning
Tairan He
Weiye Zhao
Changliu Liu
OffRL
34
17
0
24 Jan 2023
PushWorld: A benchmark for manipulation planning with tools and movable
  obstacles
PushWorld: A benchmark for manipulation planning with tools and movable obstacles
Ken Kansky
Skanda Vaidyanath
Scott Swingle
Xinghua Lou
Miguel Lazaro-Gredilla
Dileep George
26
4
0
24 Jan 2023
SMART: Self-supervised Multi-task pretrAining with contRol Transformers
SMART: Self-supervised Multi-task pretrAining with contRol Transformers
Yanchao Sun
Shuang Ma
Ratnesh Madaan
Rogerio Bonatti
Furong Huang
Ashish Kapoor
35
40
0
24 Jan 2023
Deep Reinforcement Learning for Concentric Tube Robot Path Following
Deep Reinforcement Learning for Concentric Tube Robot Path Following
Keshav Iyengar
Sarah Spurgeon
Danail Stoyanov
MedIm
18
4
0
22 Jan 2023
A Survey on Transformers in Reinforcement Learning
A Survey on Transformers in Reinforcement Learning
Wenzhe Li
Hao Luo
Zichuan Lin
Chongjie Zhang
Zongqing Lu
Deheng Ye
OffRL
MU
AI4CE
37
55
0
08 Jan 2023
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward
  Shaping
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
A. Lazaric
Alahari Karteek
OffRL
44
18
0
05 Jan 2023
Genetic Imitation Learning by Reward Extrapolation
Genetic Imitation Learning by Reward Extrapolation
Boyuan Zheng
Jianlong Zhou
Fang Chen
19
0
0
03 Jan 2023
Human-in-the-loop Embodied Intelligence with Interactive Simulation
  Environment for Surgical Robot Learning
Human-in-the-loop Embodied Intelligence with Interactive Simulation Environment for Surgical Robot Learning
Yonghao Long
Wang Wei
Tao Huang
Yuehao Wang
Qingxu Dou
39
32
0
01 Jan 2023
Towards automating Codenames spymasters with deep reinforcement learning
Towards automating Codenames spymasters with deep reinforcement learning
Sherman Siu
22
2
0
28 Dec 2022
Off-Policy Reinforcement Learning with Loss Function Weighted by
  Temporal Difference Error
Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error
Bumgeun Park
Taeyoung Kim
Woohyeon Moon
L. Vecchietti
Dongsoo Har
OffRL
26
2
0
26 Dec 2022
Understanding the Complexity Gains of Single-Task RL with a Curriculum
Understanding the Complexity Gains of Single-Task RL with a Curriculum
Qiyang Li
Yuexiang Zhai
Yi Ma
Sergey Levine
37
14
0
24 Dec 2022
SHIRO: Soft Hierarchical Reinforcement Learning
SHIRO: Soft Hierarchical Reinforcement Learning
Kandai Watanabe
Mathew Strong
Omer Eldar
24
1
0
24 Dec 2022
Reward Bonuses with Gain Scheduling Inspired by Iterative Deepening
  Search
Reward Bonuses with Gain Scheduling Inspired by Iterative Deepening Search
Taisuke Kobayashi
17
1
0
21 Dec 2022
Planning Immediate Landmarks of Targets for Model-Free Skill Transfer
  across Agents
Planning Immediate Landmarks of Targets for Model-Free Skill Transfer across Agents
Minghuan Liu
Zhengbang Zhu
Menghui Zhu
Yuzheng Zhuang
Weinan Zhang
Jianye Hao
18
0
0
18 Dec 2022
Offline Reinforcement Learning for Visual Navigation
Offline Reinforcement Learning for Visual Navigation
Dhruv Shah
Arjun Bhorkar
Hrish Leen
Ilya Kostrikov
Nicholas Rhinehart
Sergey Levine
OffRL
24
29
0
16 Dec 2022
Previous
123...789...232425
Next