ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay
v1v2v3 (latest)

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,267 papers shown
Title
Scaling Relationship on Learning Mathematical Reasoning with Large
  Language Models
Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
Zheng Yuan
Hongyi Yuan
Cheng Li
Guanting Dong
Keming Lu
Chuanqi Tan
Chang Zhou
Jingren Zhou
LRMALM
127
205
0
03 Aug 2023
ETHER: Aligning Emergent Communication for Hindsight Experience Replay
ETHER: Aligning Emergent Communication for Hindsight Experience Replay
Kevin Denamganai
Daniel Hernández
Ozan Vardal
S. Missaoui
James Alfred Walker
51
0
0
28 Jul 2023
Contrastive Example-Based Control
Contrastive Example-Based Control
Kyle Hatch
Benjamin Eysenbach
Rafael Rafailov
Tianhe Yu
Ruslan Salakhutdinov
Sergey Levine
Chelsea Finn
OffRL
99
4
0
24 Jul 2023
Balancing Exploration and Exploitation in Hierarchical Reinforcement
  Learning via Latent Landmark Graphs
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs
Qingyang Zhang
Yiming Yang
Jingqing Ruan
Xuantang Xiong
Dengpeng Xing
Bo Xu
56
1
0
22 Jul 2023
HIQL: Offline Goal-Conditioned RL with Latent States as Actions
HIQL: Offline Goal-Conditioned RL with Latent States as Actions
Seohong Park
Dibya Ghosh
Benjamin Eysenbach
Sergey Levine
OffRL
128
61
0
22 Jul 2023
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Akash Velu
Skanda Vaidyanath
Dilip Arumugam
OffRL
68
1
0
21 Jul 2023
Breadcrumbs to the Goal: Goal-Conditioned Exploration from
  Human-in-the-Loop Feedback
Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback
M. Torné
Max Balsells
Zihan Wang
Samedh Desai
Tao Chen
Pulkit Agrawal
Abhishek Gupta
90
8
0
20 Jul 2023
Goal-Conditioned Reinforcement Learning with Disentanglement-based
  Reachability Planning
Goal-Conditioned Reinforcement Learning with Disentanglement-based Reachability Planning
Zhifeng Qian
Mingyu You
Hongjun Zhou
Xuanhui Xu
Bin He
62
3
0
20 Jul 2023
Reparameterized Policy Learning for Multimodal Trajectory Optimization
Reparameterized Policy Learning for Multimodal Trajectory Optimization
Zhiao Huang
Litian Liang
Z. Ling
Xuanlin Li
Chuang Gan
H. Su
110
11
0
20 Jul 2023
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement
  Learning
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
Hongyu Ding
Yuan-Yan Tang
Qing Wu
Bo Wang
Chunlin Chen
Zhi Wang
125
5
0
16 Jul 2023
The SocialAI School: Insights from Developmental Psychology Towards
  Artificial Socio-Cultural Agents
The SocialAI School: Insights from Developmental Psychology Towards Artificial Socio-Cultural Agents
Grgur Kovač
Rémy Portelas
Peter Ford Dominey
Pierre-Yves Oudeyer
60
19
0
15 Jul 2023
Bi-Touch: Bimanual Tactile Manipulation with Sim-to-Real Deep
  Reinforcement Learning
Bi-Touch: Bimanual Tactile Manipulation with Sim-to-Real Deep Reinforcement Learning
Yijiong Lin
Alex Church
Max Yang
Haoran Li
John Lloyd
Dandan Zhang
Nathan Lepora
92
28
0
12 Jul 2023
TGRL: An Algorithm for Teacher Guided Reinforcement Learning
TGRL: An Algorithm for Teacher Guided Reinforcement Learning
Idan Shenfeld
Zhang-Wei Hong
Aviv Tamar
Pulkit Agrawal
36
14
0
06 Jul 2023
Learning to Solve Tasks with Exploring Prior Behaviours
Learning to Solve Tasks with Exploring Prior Behaviours
Ruiqi Zhu
Siyuan Li
Tianhong Dai
Chongjie Zhang
Oya Celiktutan
109
4
0
06 Jul 2023
Hierarchical Empowerment: Towards Tractable Empowerment-Based Skill
  Learning
Hierarchical Empowerment: Towards Tractable Empowerment-Based Skill Learning
Andrew Levy
Sreehari Rammohan
A. Allievi
S. Niekum
George Konidaris
62
5
0
06 Jul 2023
Dynamic Feature-based Deep Reinforcement Learning for Flow Control of
  Circular Cylinder with Sparse Surface Pressure Sensing
Dynamic Feature-based Deep Reinforcement Learning for Flow Control of Circular Cylinder with Sparse Surface Pressure Sensing
Qiulei Wang
Lei Yan
Gang Hu
Wenli Chen
Jean Rabault
B. R. Noack
AI4CE
57
30
0
05 Jul 2023
Goal Representations for Instruction Following: A Semi-Supervised
  Language Interface to Control
Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control
Vivek Myers
Andre Wang He
Kuan Fang
Homer Walke
Philippe Hansen-Estruch
Ching-An Cheng
Mihai Jalobeanu
Andrey Kolobov
Anca Dragan
Sergey Levine
LM&Ro
89
31
0
30 Jun 2023
HYDRA: Hybrid Robot Actions for Imitation Learning
HYDRA: Hybrid Robot Actions for Imitation Learning
Suneel Belkhale
Yuchen Cui
Dorsa Sadigh
112
41
0
29 Jun 2023
Would I have gotten that reward? Long-term credit assignment by
  counterfactual contribution analysis
Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Alexander Meulemans
Simon Schug
Seijin Kobayashi
Nathaniel D. Daw
Gregory Wayne
96
3
0
29 Jun 2023
MRHER: Model-based Relay Hindsight Experience Replay for Sequential
  Object Manipulation Tasks with Sparse Rewards
MRHER: Model-based Relay Hindsight Experience Replay for Sequential Object Manipulation Tasks with Sparse Rewards
Yuming Huang
Bin Ren
Ziming Xu
Lianghong Wu
OffRL
61
0
0
28 Jun 2023
CEIL: Generalized Contextual Imitation Learning
CEIL: Generalized Contextual Imitation Learning
Jinxin Liu
Li He
Yachen Kang
Zifeng Zhuang
Donglin Wang
Huazhe Xu
81
17
0
26 Jun 2023
Design from Policies: Conservative Test-Time Adaptation for Offline
  Policy Optimization
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
Jinxin Liu
Hongyin Zhang
Zifeng Zhuang
Yachen Kang
Donglin Wang
Bin Wang
OffRL
115
8
0
26 Jun 2023
Waypoint Transformer: Reinforcement Learning via Supervised Learning
  with Intermediate Targets
Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Anirudhan Badrinath
Yannis Flet-Berliac
Allen Nie
Emma Brunskill
OffRL
99
19
0
24 Jun 2023
Learning from Pixels with Expert Observations
Learning from Pixels with Expert Observations
M. Hoang
Long Dinh
Hai V. Nguyen
OffRL
113
2
0
24 Jun 2023
Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot
  Policy Imitation
Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation
Massimiliano Patacchiola
Mingfei Sun
Katja Hofmann
Richard Turner
OffRL
79
1
0
23 Jun 2023
Granger-Causal Hierarchical Skill Discovery
Granger-Causal Hierarchical Skill Discovery
Caleb Chuck
Kevin Black
Aditya Arjun
Yuke Zhu
S. Niekum
OffRL
132
1
0
15 Jun 2023
Deep Generative Models for Decision-Making and Control
Deep Generative Models for Decision-Making and Control
Michael Janner
66
1
0
15 Jun 2023
Hierarchical Task Network Planning for Facilitating Cooperative
  Multi-Agent Reinforcement Learning
Hierarchical Task Network Planning for Facilitating Cooperative Multi-Agent Reinforcement Learning
Xuechen Mu
H. Zhuo
Chong Chen
Kai Zhang
Chao Yu
Jianye Hao
82
1
0
14 Jun 2023
Curricular Subgoals for Inverse Reinforcement Learning
Curricular Subgoals for Inverse Reinforcement Learning
Shunyu Liu
Yunpeng Qing
Shuqi Xu
Hongyan Wu
Jiangtao Zhang
Jingyuan Cong
Tianhao Chen
Yunfu Liu
Mingli Song
94
2
0
14 Jun 2023
Reinforcement Learning in Robotic Motion Planning by Combined
  Experience-based Planning and Self-Imitation Learning
Reinforcement Learning in Robotic Motion Planning by Combined Experience-based Planning and Self-Imitation Learning
Sha Luo
Lambert Schomaker
151
10
0
11 Jun 2023
PEAR: Primitive enabled Adaptive Relabeling for boosting Hierarchical
  Reinforcement Learning
PEAR: Primitive enabled Adaptive Relabeling for boosting Hierarchical Reinforcement Learning
Utsav Singh
Vinay P. Namboodiri
OffRL
190
3
0
10 Jun 2023
The Role of Diverse Replay for Generalisation in Reinforcement Learning
The Role of Diverse Replay for Generalisation in Reinforcement Learning
Max Weltevrede
M. Spaan
Wendelin Bohmer
OffRL
66
1
0
09 Jun 2023
Instructed Diffuser with Temporal Condition Guidance for Offline
  Reinforcement Learning
Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning
Jifeng Hu
Yan Sun
Sili Huang
Siyuan Guo
Hechang Chen
Li Shen
Lichao Sun
Yi-Ju Chang
Dacheng Tao
DiffMOffRL
74
14
0
08 Jun 2023
Goal-conditioned GFlowNets for Controllable Multi-Objective Molecular
  Design
Goal-conditioned GFlowNets for Controllable Multi-Objective Molecular Design
Julien Roy
Pierre-Luc Bacon
C. Pal
Emmanuel Bengio
AI4CE
75
18
0
07 Jun 2023
Learning with a Mole: Transferable latent spatial representations for
  navigation without reconstruction
Learning with a Mole: Transferable latent spatial representations for navigation without reconstruction
G. Bono
L. Antsfeld
Assem Sadek
G. Monaci
Christian Wolf
SSL
89
5
0
06 Jun 2023
Efficient Multi-Task and Transfer Reinforcement Learning with
  Parameter-Compositional Framework
Efficient Multi-Task and Transfer Reinforcement Learning with Parameter-Compositional Framework
Lingfeng Sun
Haichao Zhang
Wei Xu
Masayoshi Tomizuka
119
9
0
02 Jun 2023
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft
Shalev Lifshitz
Keiran Paster
Harris Chan
Jimmy Ba
Sheila A. McIlraith
LM&Ro
139
76
0
01 Jun 2023
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Shentao Yang
Shujian Zhang
Congying Xia
Yihao Feng
Caiming Xiong
Mi Zhou
146
28
0
01 Jun 2023
Adaptive and Explainable Deployment of Navigation Skills via
  Hierarchical Deep Reinforcement Learning
Adaptive and Explainable Deployment of Navigation Skills via Hierarchical Deep Reinforcement Learning
Kyowoon Lee
Seongun Kim
Jaesik Choi
57
11
0
31 May 2023
What is Essential for Unseen Goal Generalization of Offline
  Goal-conditioned RL?
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
Rui Yang
Yong Lin
Xiaoteng Ma
Haotian Hu
Chongjie Zhang
Tong Zhang
OffRL
76
26
0
30 May 2023
Toward Fine Contact Interactions: Learning to Control Normal Contact
  Force with Limited Information
Toward Fine Contact Interactions: Learning to Control Normal Contact Force with Limited Information
Jinda Cui
Jiawei Xu
David Saldaña
J. Trinkle
39
2
0
29 May 2023
Interpretable Reward Redistribution in Reinforcement Learning: A Causal
  Approach
Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach
Yudi Zhang
Yali Du
Erdun Gao
Ziyan Wang
Jun Wang
Meng Fang
Mykola Pechenizkiy
CML
107
18
0
28 May 2023
Visual Affordance Prediction for Guiding Robot Exploration
Visual Affordance Prediction for Guiding Robot Exploration
Homanga Bharadhwaj
Abhi Gupta
Shubham Tulsiani
119
15
0
28 May 2023
On the Value of Myopic Behavior in Policy Reuse
On the Value of Myopic Behavior in Policy Reuse
Kang Xu
Chenjia Bai
Shuang Qiu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
89
1
0
28 May 2023
Self-Supervised Reinforcement Learning that Transfers using Random
  Features
Self-Supervised Reinforcement Learning that Transfers using Random Features
Boyuan Chen
Chuning Zhu
Pulkit Agrawal
Kai Zhang
Abhishek Gupta
OffRLSSL
84
9
0
26 May 2023
Future-conditioned Unsupervised Pretraining for Decision Transformer
Future-conditioned Unsupervised Pretraining for Decision Transformer
Zhihui Xie
Zichuan Lin
Deheng Ye
Qiang Fu
Wei Yang
Shuai Li
OffRLOnRL
92
23
0
26 May 2023
Emergent Agentic Transformer from Chain of Hindsight Experience
Emergent Agentic Transformer from Chain of Hindsight Experience
Hao Liu
Pieter Abbeel
OffRL
93
29
0
26 May 2023
Reward-Machine-Guided, Self-Paced Reinforcement Learning
Reward-Machine-Guided, Self-Paced Reinforcement Learning
Cevahir Köprülü
Ufuk Topcu
78
3
0
25 May 2023
Beyond Reward: Offline Preference-guided Policy Optimization
Beyond Reward: Offline Preference-guided Policy Optimization
Yachen Kang
Dingxu Shi
Jinxin Liu
Li He
Donglin Wang
OffRL
75
35
0
25 May 2023
ChemGymRL: An Interactive Framework for Reinforcement Learning for
  Digital Chemistry
ChemGymRL: An Interactive Framework for Reinforcement Learning for Digital Chemistry
Chris Beeler
Sriram Ganapathi Subramanian
Kyle Sprague
Nouha Chatti
C. Bellinger
...
Amanuel Dawit
Zihan Yang
Xinkai Li
Mark Crowley
Isaac Tamblyn
OffRL
77
6
0
23 May 2023
Previous
123...678...242526
Next