ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay
v1v2v3 (latest)

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,267 papers shown
Title
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Charles Packer
Pieter Abbeel
Joseph E. Gonzalez
OffRL
71
17
0
02 Dec 2021
Wish you were here: Hindsight Goal Selection for long-horizon dexterous
  manipulation
Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation
Todor Davchev
Oleg O. Sushkov
Jean-Baptiste Regli
S. Schaal
Y. Aytar
Markus Wulfmeier
Jonathan Scholz
44
18
0
01 Dec 2021
Learning Long-Term Reward Redistribution via Randomized Return
  Decomposition
Learning Long-Term Reward Redistribution via Randomized Return Decomposition
Zhizhou Ren
Ruihan Guo
Yuanshuo Zhou
Jian-wei Peng
127
38
0
26 Nov 2021
Adaptive Multi-Goal Exploration
Adaptive Multi-Goal Exploration
Jean Tarbouriech
O. D. Domingues
Pierre Ménard
Matteo Pirotta
Michal Valko
A. Lazaric
130
3
0
23 Nov 2021
Generalized Decision Transformer for Offline Hindsight Information
  Matching
Generalized Decision Transformer for Offline Hindsight Information Matching
Hiroki Furuta
Y. Matsuo
S. Gu
OffRL
116
104
0
19 Nov 2021
Successor Feature Landmarks for Long-Horizon Goal-Conditioned
  Reinforcement Learning
Successor Feature Landmarks for Long-Horizon Goal-Conditioned Reinforcement Learning
Christopher Hoang
Sungryull Sohn
Jongwook Choi
Wilka Carvalho
Honglak Lee
74
32
0
18 Nov 2021
Learning to Execute: Efficient Learning of Universal Plan-Conditioned
  Policies in Robotics
Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in Robotics
Ingmar Schubert
Danny Driess
Ozgur S. Oguz
Marc Toussaint
OffRL
40
1
0
15 Nov 2021
Improving Experience Replay through Modeling of Similar Transitions'
  Sets
Improving Experience Replay through Modeling of Similar Transitions' Sets
Daniel Eugênio Neves
João Pedro Oliveira Batisteli
Eduardo Felipe Lopes
Lucila Ishitani
Zenilton K. G. Patrocínio
OffRL
25
1
0
12 Nov 2021
One model Packs Thousands of Items with Recurrent Conditional Query
  Learning
One model Packs Thousands of Items with Recurrent Conditional Query Learning
Dongda Li
Zhaoquan Gu
Yuexuan Wang
Changwei Ren
F. Lau
81
17
0
12 Nov 2021
Distilling Motion Planner Augmented Policies into Visual Control
  Policies for Robot Manipulation
Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot Manipulation
Isabella Liu
Shagun Uppal
Gaurav Sukhatme
Joseph J. Lim
Péter Englert
Youngwoon Lee
53
13
0
11 Nov 2021
Data-Efficient Deep Reinforcement Learning for Attitude Control of
  Fixed-Wing UAVs: Field Experiments
Data-Efficient Deep Reinforcement Learning for Attitude Control of Fixed-Wing UAVs: Field Experiments
Eivind Bøhn
E. M. Coates
D. Reinhardt
T. Johansen
43
29
0
07 Nov 2021
Automatic Goal Generation using Dynamical Distance Learning
Automatic Goal Generation using Dynamical Distance Learning
Bharat Prakash
Nicholas R. Waytowich
T. Mohsenin
Tim Oates
41
2
0
07 Nov 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon
  Reasoning
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning
Dhruv Shah
Peng Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
81
43
0
04 Nov 2021
Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task
  Learning
Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning
Wenlong Huang
Igor Mordatch
Pieter Abbeel
Deepak Pathak
136
64
0
04 Nov 2021
Causal versus Marginal Shapley Values for Robotic Lever Manipulation
  Controlled using Deep Reinforcement Learning
Causal versus Marginal Shapley Values for Robotic Lever Manipulation Controlled using Deep Reinforcement Learning
Sindre Benjamin Remman
Inga Strümke
A. Lekkas
CML
48
7
0
04 Nov 2021
Autonomous Attack Mitigation for Industrial Control Systems
Autonomous Attack Mitigation for Industrial Control Systems
John Mern
Kyle Hatch
Ryan Silva
Cameron Hickert
Tamim I. Sookoor
Mykel J. Kochenderfer
AAML
68
7
0
03 Nov 2021
Discovering and Exploiting Sparse Rewards in a Learned Behavior Space
Discovering and Exploiting Sparse Rewards in a Learned Behavior Space
Giuseppe Paolo
Alexandre Coninx
Alban Laflaquière
Stéphane Doncieux
34
3
0
02 Nov 2021
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms
  via Batch Prioritized Experience Replay
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay
Dogan C. Cicek
Enes Duran
Baturay Saglam
Furkan B. Mutlu
Suleyman S. Kozat
OffRL
35
11
0
02 Nov 2021
Robot Learning from Randomized Simulations: A Review
Robot Learning from Randomized Simulations: A Review
Fabio Muratore
Fabio Ramos
Greg Turk
Wenhao Yu
Michael Gienger
Jan Peters
AI4CE
119
83
0
01 Nov 2021
Adjacency constraint for efficient hierarchical reinforcement learning
Adjacency constraint for efficient hierarchical reinforcement learning
Tianren Zhang
Shangqi Guo
Tian Tan
Xiao M Hu
Feng Chen
87
17
0
30 Oct 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Tung M. Luu
Chang D. Yoo
80
8
0
28 Oct 2021
Similarity-Aware Skill Reproduction based on Multi-Representational
  Learning from Demonstration
Similarity-Aware Skill Reproduction based on Multi-Representational Learning from Demonstration
Brendan Hertel
S. Ahmadzadeh
53
8
0
28 Oct 2021
Learning from demonstrations with SACR2: Soft Actor-Critic with Reward
  Relabeling
Learning from demonstrations with SACR2: Soft Actor-Critic with Reward Relabeling
Jesús Bujalance Martín
Raphael Chekroun
Fabien Moutarde
OffRL
62
5
0
27 Oct 2021
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State
  Covering and Goal Reaching
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching
Pierre-Alexandre Kamienny
Jean Tarbouriech
Sylvain Lamprier
A. Lazaric
Ludovic Denoyer
SSL
109
18
0
27 Oct 2021
Learning Domain Invariant Representations in Goal-conditioned Block MDPs
Learning Domain Invariant Representations in Goal-conditioned Block MDPs
Beining Han
Chongyi Zheng
Harris Chan
Keiran Paster
Michael Ruogu Zhang
Jimmy Ba
OODAI4CE
106
14
0
27 Oct 2021
Learning Diverse Policies in MOBA Games via Macro-Goals
Learning Diverse Policies in MOBA Games via Macro-Goals
Yiming Gao
Bei Shi
Xueying Du
Liang Wang
Guangwei Chen
...
Weixuan Wang
Deheng Ye
Qiang Fu
Wei Yang
Lanxiao Huang
76
11
0
27 Oct 2021
Multitask Adaptation by Retrospective Exploration with Learned World
  Models
Multitask Adaptation by Retrospective Exploration with Learned World Models
Artem Zholus
Aleksandr I. Panov
CLL
29
0
0
25 Oct 2021
Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning
Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning
Kibeom Kim
Min Whoo Lee
Yoonsung Kim
Je-hwan Ryu
Minsu Lee
Byoung-Tak Zhang
71
8
0
25 Oct 2021
Learning Stochastic Shortest Path with Linear Function Approximation
Learning Stochastic Shortest Path with Linear Function Approximation
Steffen Czolbe
Jiafan He
Adrian Dalca
Quanquan Gu
86
30
0
25 Oct 2021
Mixture-of-Variational-Experts for Continual Learning
Mixture-of-Variational-Experts for Continual Learning
Y. Yin
Yu Wang
CLLFedML
59
6
0
25 Oct 2021
Contrastive Active Inference
Contrastive Active Inference
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
80
26
0
19 Oct 2021
Discovering and Achieving Goals via World Models
Discovering and Achieving Goals via World Models
Russell Mendonca
Oleh Rybkin
Kostas Daniilidis
Danijar Hafner
Deepak Pathak
99
127
0
18 Oct 2021
Learn Proportional Derivative Controllable Latent Space from Pixels
Learn Proportional Derivative Controllable Latent Space from Pixels
Weiyao Wang
Marin Kobilarov
Gregory Hager
75
1
0
15 Oct 2021
Wasserstein Unsupervised Reinforcement Learning
Wasserstein Unsupervised Reinforcement Learning
Shuncheng He
Yuhang Jiang
Hongchang Zhang
Jianzhun Shao
Xiangyang Ji
OffRL
93
23
0
15 Oct 2021
Improving the sample-efficiency of neural architecture search with
  reinforcement learning
Improving the sample-efficiency of neural architecture search with reinforcement learning
A. Nagy
Ábel Boros
118
3
0
13 Oct 2021
StARformer: Transformer with State-Action-Reward Representations for
  Visual Reinforcement Learning
StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement Learning
Jinghuan Shang
Kumara Kahatapitiya
Xiang Li
Michael S. Ryoo
OffRL
100
36
0
12 Oct 2021
Planning from Pixels in Environments with Combinatorially Hard Search
  Spaces
Planning from Pixels in Environments with Combinatorially Hard Search Spaces
Marco Bagatella
Miroslav Olsák
Michal Rolínek
Georg Martius
OffRL
63
7
0
12 Oct 2021
Auditing Robot Learning for Safety and Compliance during Deployment
Auditing Robot Learning for Safety and Compliance during Deployment
Homanga Bharadhwaj
37
4
0
12 Oct 2021
Braxlines: Fast and Interactive Toolkit for RL-driven Behavior
  Engineering beyond Reward Maximization
Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization
S. Gu
Manfred Diaz
Daniel Freeman
Hiroki Furuta
Seyed Kamyar Seyed Ghasemipour
Anton Raichuk
Byron David
Erik Frey
Erwin Coumans
Olivier Bachem
80
14
0
10 Oct 2021
Learning Visual Shape Control of Novel 3D Deformable Objects from
  Partial-View Point Clouds
Learning Visual Shape Control of Novel 3D Deformable Objects from Partial-View Point Clouds
Bao Thach
Brian Y. Cho
Alan Kuntz
Tucker Hermans
3DPC
84
30
0
10 Oct 2021
Human-Aware Robot Navigation via Reinforcement Learning with Hindsight
  Experience Replay and Curriculum Learning
Human-Aware Robot Navigation via Reinforcement Learning with Hindsight Experience Replay and Curriculum Learning
Keyu Li
Ye Lu
Max Meng
24
9
0
09 Oct 2021
Improving Kinodynamic Planners for Vehicular Navigation with Learned
  Goal-Reaching Controllers
Improving Kinodynamic Planners for Vehicular Navigation with Learned Goal-Reaching Controllers
Aravind Sivaramakrishnan
Edgar Granados
Seth Karten
T. McMahon
Kostas E. Bekris
49
7
0
08 Oct 2021
Learning to Centralize Dual-Arm Assembly
Learning to Centralize Dual-Arm Assembly
Marvin Alles
Elie Aljalbout
67
18
0
08 Oct 2021
Robotic Lever Manipulation using Hindsight Experience Replay and Shapley
  Additive Explanations
Robotic Lever Manipulation using Hindsight Experience Replay and Shapley Additive Explanations
Sindre Benjamin Remman
A. Lekkas
55
14
0
07 Oct 2021
Designing Composites with Target Effective Young's Modulus using
  Reinforcement Learning
Designing Composites with Target Effective Young's Modulus using Reinforcement Learning
Aldair E. Gongora
Siddharth Mysore
Beichen Li
Wan Shou
Wojciech Matusik
E. Morgan
Keith A. Brown
Emily Whiting
AI4CE
62
9
0
07 Oct 2021
Imaginary Hindsight Experience Replay: Curious Model-based Learning for
  Sparse Reward Tasks
Imaginary Hindsight Experience Replay: Curious Model-based Learning for Sparse Reward Tasks
Robert McCarthy
Qiang Wang
S. Redmond
OffRL
72
15
0
05 Oct 2021
Procedure Planning in Instructional Videos via Contextual Modeling and
  Model-based Policy Learning
Procedure Planning in Instructional Videos via Contextual Modeling and Model-based Policy Learning
Jing Bi
Jiebo Luo
Chenliang Xu
124
49
0
05 Oct 2021
Large Batch Experience Replay
Large Batch Experience Replay
Thibault Lahire
Matthieu Geist
Emmanuel Rachelson
OffRL
100
13
0
04 Oct 2021
Sim and Real: Better Together
Sim and Real: Better Together
Shirli Di-Castro Shashua
Dotan DiCastro
Shie Mannor
130
11
0
01 Oct 2021
Decentralized Graph-Based Multi-Agent Reinforcement Learning Using
  Reward Machines
Decentralized Graph-Based Multi-Agent Reinforcement Learning Using Reward Machines
Jueming Hu
Zhe Xu
Weichang Wang
Guannan Qu
Yutian Pang
Yongming Liu
100
12
0
30 Sep 2021
Previous
123...131415...242526
Next