ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay
v1v2v3 (latest)

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,267 papers shown
Title
Deep Learning with Experience Ranking Convolutional Neural Network for
  Robot Manipulator
Deep Learning with Experience Ranking Convolutional Neural Network for Robot Manipulator
Hai V. Nguyen
Hung M. La
M. Deans
SSLOffRL
35
8
0
16 Sep 2018
Sim-to-Real Transfer Learning using Robustified Controllers in Robotic
  Tasks involving Complex Dynamics
Sim-to-Real Transfer Learning using Robustified Controllers in Robotic Tasks involving Complex Dynamics
J. Baar
Alan Sullivan
Radu Cordorel
Devesh K. Jha
Diego Romeres
D. Nikovski
116
57
0
13 Sep 2018
Emergence of Scenario-Appropriate Collaborative Behaviors for Teams of
  Robotic Bodyguards
Emergence of Scenario-Appropriate Collaborative Behaviors for Teams of Robotic Bodyguards
Hassam Sheikh
Ladislau Bölöni
67
3
0
12 Sep 2018
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge
Michal Garmulewicz
Henryk Michalewski
Piotr Milos
70
8
0
10 Sep 2018
Learning Adaptive Display Exposure for Real-Time Advertising
Learning Adaptive Display Exposure for Real-Time Advertising
Weixun Wang
Junqi Jin
Jianye Hao
Chunjie Chen
Chuan Yu
...
Xiaotian Hao
Yixi Wang
Han Li
Jian Xu
Kun Gai
43
6
0
10 Sep 2018
Unity: A General Platform for Intelligent Agents
Unity: A General Platform for Intelligent Agents
Arthur Juliani
Vincent-Pierre Berges
Esh Vckay
Andrew Cohen
Jonathan Harper
...
Chris Goy
Yuan Gao
Hunter Henry
Marwan Mattar
Danny Lange
87
821
0
07 Sep 2018
Improving On-policy Learning with Statistical Reward Accumulation
Improving On-policy Learning with Statistical Reward Accumulation
Yubin Deng
K. Yu
Dahua Lin
Xiaoou Tang
Chen Change Loy
OffRL
31
0
0
07 Sep 2018
Challenges of Context and Time in Reinforcement Learning: Introducing
  Space Fortress as a Benchmark
Challenges of Context and Time in Reinforcement Learning: Introducing Space Fortress as a Benchmark
Akshat Agarwal
Ryan Hope
Katia Sycara
OffRL
34
9
0
06 Sep 2018
ARCHER: Aggressive Rewards to Counter bias in Hindsight Experience
  Replay
ARCHER: Aggressive Rewards to Counter bias in Hindsight Experience Replay
Sameera Lanka
Tianfu Wu
66
30
0
06 Sep 2018
Goal-oriented Dialogue Policy Learning from Failures
Goal-oriented Dialogue Policy Learning from Failures
Keting Lu
Shiqi Zhang
Xiaoping Chen
OffRL
46
29
0
20 Aug 2018
Variational Option Discovery Algorithms
Variational Option Discovery Algorithms
Joshua Achiam
Harrison Edwards
Dario Amodei
Pieter Abbeel
DRL
76
180
0
26 Jul 2018
Learning Plannable Representations with Causal InfoGAN
Learning Plannable Representations with Causal InfoGAN
Thanard Kurutach
Aviv Tamar
Ge Yang
Stuart J. Russell
Pieter Abbeel
GANDRL
82
181
0
24 Jul 2018
Remember and Forget for Experience Replay
Remember and Forget for Experience Replay
G. Novati
Petros Koumoutsakos
OffRL
108
92
0
16 Jul 2018
Hierarchical Reinforcement Learning Framework towards Multi-agent
  Navigation
Hierarchical Reinforcement Learning Framework towards Multi-agent Navigation
Wenhao Ding
Shuaijun Li
Huihuan Qian
114
32
0
14 Jul 2018
Visual Reinforcement Learning with Imagined Goals
Visual Reinforcement Learning with Imagined Goals
Ashvin Nair
Vitchyr H. Pong
Murtaza Dalal
Shikhar Bahl
Steven Lin
Sergey Levine
SSL
93
544
0
12 Jul 2018
Memory Augmented Policy Optimization for Program Synthesis and Semantic
  Parsing
Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing
Chen Liang
Mohammad Norouzi
Jonathan Berant
Quoc V. Le
Ni Lao
128
134
0
06 Jul 2018
A survey on policy search algorithms for learning robot controllers in a
  handful of trials
A survey on policy search algorithms for learning robot controllers in a handful of trials
Konstantinos Chatzilygeroudis
Vassilis Vassiliades
F. Stulp
Sylvain Calinon
Jean-Baptiste Mouret
103
155
0
06 Jul 2018
Goal-oriented Trajectories for Efficient Exploration
Goal-oriented Trajectories for Efficient Exploration
Fabio Pardo
Vitaly Levdik
Petar Kormushev
31
2
0
05 Jul 2018
Curiosity Driven Exploration of Learned Disentangled Goal Spaces
Curiosity Driven Exploration of Learned Disentangled Goal Spaces
A. Laversanne-Finot
Alexandre Péré
Pierre-Yves Oudeyer
DRL
81
88
0
04 Jul 2018
Illuminating Generalization in Deep Reinforcement Learning through
  Procedural Level Generation
Illuminating Generalization in Deep Reinforcement Learning through Procedural Level Generation
Niels Justesen
R. Torrado
Philip Bontrager
Ahmed Khalifa
Julian Togelius
S. Risi
177
186
0
28 Jun 2018
Adversarial Active Exploration for Inverse Dynamics Model Learning
Adversarial Active Exploration for Inverse Dynamics Model Learning
Zhang-Wei Hong
Tsu-Jui Fu
Tzu-Yun Shann
Yi-Hsiang Chang
Chun-Yi Lee
33
6
0
26 Jun 2018
Accuracy-based Curriculum Learning in Deep Reinforcement Learning
Accuracy-based Curriculum Learning in Deep Reinforcement Learning
Pierre Fournier
Olivier Sigaud
Mohamed Chetouani
Pierre-Yves Oudeyer
ODL
137
39
0
25 Jun 2018
Multi-objective Model-based Policy Search for Data-efficient Learning
  with Sparse Rewards
Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards
Rituraj Kaushik
Konstantinos Chatzilygeroudis
Jean-Baptiste Mouret
72
19
0
25 Jun 2018
Many-Goals Reinforcement Learning
Many-Goals Reinforcement Learning
Vivek Veeriah
Junhyuk Oh
Satinder Singh
KELM
75
53
0
22 Jun 2018
Sim-to-Real Reinforcement Learning for Deformable Object Manipulation
Sim-to-Real Reinforcement Learning for Deformable Object Manipulation
J. Matas
Stephen James
Andrew J. Davison
AI4CE
77
361
0
20 Jun 2018
Learning 6-DoF Grasping and Pick-Place Using Attention Focus
Learning 6-DoF Grasping and Pick-Place Using Attention Focus
Marcus Gualtieri
Robert Platt
99
56
0
15 Jun 2018
Maximum a Posteriori Policy Optimisation
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
87
478
0
14 Jun 2018
Unsupervised Meta-Learning for Reinforcement Learning
Unsupervised Meta-Learning for Reinforcement Learning
Abhishek Gupta
Benjamin Eysenbach
Chelsea Finn
Sergey Levine
SSLOffRL
136
107
0
12 Jun 2018
Learning Self-Imitating Diverse Policies
Learning Self-Imitating Diverse Policies
Tanmay Gangwani
Qiang Liu
Jian Peng
99
68
0
25 May 2018
Data-Efficient Hierarchical Reinforcement Learning
Data-Efficient Hierarchical Reinforcement Learning
Ofir Nachum
S. Gu
Honglak Lee
Sergey Levine
OffRL
102
814
0
21 May 2018
Hierarchical Reinforcement Learning with Hindsight
Andrew Levy
Robert Platt
Kate Saenko
102
84
0
21 May 2018
Evolution-Guided Policy Gradient in Reinforcement Learning
Evolution-Guided Policy Gradient in Reinforcement Learning
Shauharda Khadka
Kagan Tumer
132
232
0
21 May 2018
Reward Learning from Narrated Demonstrations
Reward Learning from Narrated Demonstrations
H. Tung
Adam W. Harley
Liang-Kang Huang
Katerina Fragkiadaki
LM&RoSSL
88
29
0
27 Apr 2018
Deep Reinforcement Learning to Acquire Navigation Skills for
  Wheel-Legged Robots in Complex Environments
Deep Reinforcement Learning to Acquire Navigation Skills for Wheel-Legged Robots in Complex Environments
Xi Chen
Ali Ghadirzadeh
John Folkesson
Patric Jensfelt
102
44
0
27 Apr 2018
Zero-Shot Visual Imitation
Zero-Shot Visual Imitation
Deepak Pathak
Parsa Mahmoudieh
Guanghao Luo
Pulkit Agrawal
Dian Chen
Yide Shentu
Evan Shelhamer
Jitendra Malik
Alexei A. Efros
Trevor Darrell
LM&Ro
124
301
0
23 Apr 2018
Universal Planning Networks
Universal Planning Networks
A. Srinivas
Allan Jabri
Pieter Abbeel
Sergey Levine
Chelsea Finn
SSL
79
145
0
02 Apr 2018
Automated Curriculum Learning by Rewarding Temporally Rare Events
Automated Curriculum Learning by Rewarding Temporally Rare Events
Niels Justesen
S. Risi
OffRL
69
20
0
19 Mar 2018
Composable Deep Reinforcement Learning for Robotic Manipulation
Composable Deep Reinforcement Learning for Robotic Manipulation
Tuomas Haarnoja
Vitchyr H. Pong
Aurick Zhou
Murtaza Dalal
Pieter Abbeel
Sergey Levine
143
234
0
19 Mar 2018
Composable Planning with Attributes
Composable Planning with Attributes
Amy Zhang
Adam Lerer
Sainbayar Sukhbaatar
Rob Fergus
Arthur Szlam
175
64
0
01 Mar 2018
Learning by Playing - Solving Sparse Reward Tasks from Scratch
Learning by Playing - Solving Sparse Reward Tasks from Scratch
Martin Riedmiller
Roland Hafner
Thomas Lampe
Michael Neunert
Jonas Degrave
T. Wiele
Volodymyr Mnih
N. Heess
Jost Tobias Springenberg
113
450
0
28 Feb 2018
Computational Theories of Curiosity-Driven Learning
Computational Theories of Curiosity-Driven Learning
Pierre-Yves Oudeyer
89
65
0
28 Feb 2018
Multi-Goal Reinforcement Learning: Challenging Robotics Environments and
  Request for Research
Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research
Matthias Plappert
Marcin Andrychowicz
Alex Ray
Bob McGrew
Bowen Baker
...
Joshua Tobin
Maciek Chociej
Peter Welinder
Vikash Kumar
Wojciech Zaremba
75
573
0
26 Feb 2018
Temporal Difference Models: Model-Free Deep RL for Model-Based Control
Temporal Difference Models: Model-Free Deep RL for Model-Based Control
Vitchyr H. Pong
S. Gu
Murtaza Dalal
Sergey Levine
OffRL
151
240
0
25 Feb 2018
Reinforcement Learning on Web Interfaces Using Workflow-Guided
  Exploration
Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration
Emmy Liu
Kelvin Guu
Panupong Pasupat
Tianlin Shi
Percy Liang
OnRL
79
223
0
24 Feb 2018
Unicorn: Continual Learning with a Universal, Off-policy Agent
Unicorn: Continual Learning with a Universal, Off-policy Agent
D. Mankowitz
Augustin Žídek
André Barreto
Dan Horgan
Matteo Hessel
John Quan
Junhyuk Oh
H. V. Hasselt
David Silver
Tom Schaul
CLLOffRL
70
48
0
22 Feb 2018
GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement
  Learning Algorithms
GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms
Cédric Colas
Olivier Sigaud
Pierre-Yves Oudeyer
135
159
0
14 Feb 2018
Evolved Policy Gradients
Evolved Policy Gradients
Rein Houthooft
Richard Y. Chen
Phillip Isola
Bradly C. Stadie
Filip Wolski
Jonathan Ho
Pieter Abbeel
105
227
0
13 Feb 2018
ScreenerNet: Learning Self-Paced Curriculum for Deep Neural Networks
ScreenerNet: Learning Self-Paced Curriculum for Deep Neural Networks
Tae-Hoon Kim
Jonghyun Choi
89
9
0
03 Jan 2018
RLlib: Abstractions for Distributed Reinforcement Learning
RLlib: Abstractions for Distributed Reinforcement Learning
Eric Liang
Richard Liaw
Philipp Moritz
Robert Nishihara
Roy Fox
Ken Goldberg
Joseph E. Gonzalez
Michael I. Jordan
Ion Stoica
OffRLAI4CE
107
175
0
26 Dec 2017
Learning Multi-Level Hierarchies with Hindsight
Learning Multi-Level Hierarchies with Hindsight
Andrew Levy
George Konidaris
Robert Platt
Kate Saenko
120
79
0
04 Dec 2017
Previous
123...242526
Next