ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXivPDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,243 papers shown
Title
Towards a Research Community in Interpretable Reinforcement Learning:
  the InterpPol Workshop
Towards a Research Community in Interpretable Reinforcement Learning: the InterpPol Workshop
Hector Kohler
Quentin Delfosse
Paul Festor
Philippe Preux
35
0
0
16 Apr 2024
A Survey on Deep Learning for Theorem Proving
A Survey on Deep Learning for Theorem Proving
Zhaoyu Li
Jialiang Sun
Logan Murphy
Qidong Su
Zenan Li
Xian Zhang
Kaiyu Yang
Xujie Si
LRM
56
21
0
15 Apr 2024
Provable Interactive Learning with Hindsight Instruction Feedback
Provable Interactive Learning with Hindsight Instruction Feedback
Dipendra Kumar Misra
Aldo Pacchiano
Rob Schapire
44
1
0
14 Apr 2024
RLHF Deciphered: A Critical Analysis of Reinforcement Learning from
  Human Feedback for LLMs
RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
Shreyas Chaudhari
Pranjal Aggarwal
Vishvak Murahari
Tanmay Rajpurohit
Ashwin Kalyan
Karthik Narasimhan
Ameet Deshpande
Bruno Castro da Silva
29
34
0
12 Apr 2024
A Data Efficient Framework for Learning Local Heuristics
A Data Efficient Framework for Learning Local Heuristics
Rishi Veerapaneni
Jonathan Park
Muhammad Suhail Saleem
Maxim Likhachev
25
0
0
10 Apr 2024
Demonstration-Enhanced Adaptive Multi-Objective Robot Navigation
Demonstration-Enhanced Adaptive Multi-Objective Robot Navigation
Jorge de Heuvel
Tharun Sethuraman
Maren Bennewitz
46
0
0
07 Apr 2024
Rethinking Teacher-Student Curriculum Learning through the Cooperative
  Mechanics of Experience
Rethinking Teacher-Student Curriculum Learning through the Cooperative Mechanics of Experience
Manfred Diaz
Liam Paull
Andrea Tacchetti
47
0
0
03 Apr 2024
Is Exploration All You Need? Effective Exploration Characteristics for
  Transfer in Reinforcement Learning
Is Exploration All You Need? Effective Exploration Characteristics for Transfer in Reinforcement Learning
Jonathan C. Balloch
Rishav Bhagat
Geigh Zollicoffer
Ruoran Jia
Julia Kim
Mark O. Riedl
OffRL
34
1
0
02 Apr 2024
Entity-Centric Reinforcement Learning for Object Manipulation from
  Pixels
Entity-Centric Reinforcement Learning for Object Manipulation from Pixels
Dan Haramati
Tal Daniel
Aviv Tamar
LM&Ro
OffRL
OCL
40
11
0
01 Apr 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active
  Online Exploration
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRL
OnRL
25
0
0
31 Mar 2024
Trajectory Planning of Robotic Manipulator in Dynamic Environment
  Exploiting DRL
Trajectory Planning of Robotic Manipulator in Dynamic Environment Exploiting DRL
Osama Ahmad
Zawar Hussain
Hammad Naeem
28
0
0
25 Mar 2024
FootstepNet: an Efficient Actor-Critic Method for Fast On-line Bipedal
  Footstep Planning and Forecasting
FootstepNet: an Efficient Actor-Critic Method for Fast On-line Bipedal Footstep Planning and Forecasting
Clément Gaspard
G. Passault
Mélodie Daniel
Olivier Ly
16
1
0
19 Mar 2024
The Value of Reward Lookahead in Reinforcement Learning
The Value of Reward Lookahead in Reinforcement Learning
Nadav Merlis
Dorian Baudry
Vianney Perchet
29
0
0
18 Mar 2024
Phasic Diversity Optimization for Population-Based Reinforcement
  Learning
Phasic Diversity Optimization for Population-Based Reinforcement Learning
Jingcheng Jiang
Haiyin Piao
Yu Fu
Yihang Hao
Chuanlu Jiang
Ziqi Wei
Xin Yang
35
0
0
17 Mar 2024
SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning
SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning
Nicholas Zolman
Urban Fasel
J. Nathan Kutz
Steven L. Brunton
AI4CE
30
11
0
14 Mar 2024
BAGEL: Bootstrapping Agents by Guiding Exploration with Language
BAGEL: Bootstrapping Agents by Guiding Exploration with Language
Shikhar Murty
Christopher D. Manning
Peter Shaw
Mandar Joshi
Kenton Lee
LM&Ro
LLMAG
34
14
0
12 Mar 2024
RLingua: Improving Reinforcement Learning Sample Efficiency in Robotic
  Manipulations With Large Language Models
RLingua: Improving Reinforcement Learning Sample Efficiency in Robotic Manipulations With Large Language Models
Liangliang Chen
Yutian Lei
Shiyu Jin
Ying Zhang
Liangjun Zhang
LM&Ro
45
8
0
11 Mar 2024
Why Online Reinforcement Learning is Causal
Why Online Reinforcement Learning is Causal
Oliver Schulte
Pascal Poupart
CML
OffRL
46
1
0
07 Mar 2024
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
Priya Sundaresan
Q. Vuong
Jiayuan Gu
Peng Xu
Ted Xiao
...
Ajinkya Jain
Karol Hausman
Dorsa Sadigh
Jeannette Bohg
S. Schaal
VGen
37
26
0
05 Mar 2024
Sample Efficient Myopic Exploration Through Multitask Reinforcement
  Learning with Diverse Tasks
Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
Ziping Xu
Zifan Xu
Runxuan Jiang
Peter Stone
Ambuj Tewari
48
1
0
03 Mar 2024
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Hany Hamed
Subin Kim
Dongyeong Kim
Jaesik Yoon
Sungjin Ahn
47
4
0
29 Feb 2024
Unsupervised Zero-Shot Reinforcement Learning via Functional Reward
  Encodings
Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings
Kevin Frans
Seohong Park
Pieter Abbeel
Sergey Levine
OffRL
48
10
0
27 Feb 2024
Foundation Policies with Hilbert Representations
Foundation Policies with Hilbert Representations
Seohong Park
Tobias Kreiman
Sergey Levine
SSL
OffRL
50
21
0
23 Feb 2024
MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback
  and Dynamic Distance Constraint
MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback and Dynamic Distance Constraint
Xinglin Zhou
Yifu Yuan
Shaofu Yang
Jianye Hao
34
1
0
22 Feb 2024
Learning control strategy in soft robotics through a set of
  configuration spaces
Learning control strategy in soft robotics through a set of configuration spaces
Etienne Ménager
Christian Duriez
40
0
0
21 Feb 2024
Rewards-in-Context: Multi-objective Alignment of Foundation Models with
  Dynamic Preference Adjustment
Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment
Rui Yang
Xiaoman Pan
Feng Luo
Shuang Qiu
Han Zhong
Dong Yu
Jianshu Chen
103
67
0
15 Feb 2024
Single-Reset Divide & Conquer Imitation Learning
Single-Reset Divide & Conquer Imitation Learning
Alexandre Chenu
Olivier Serris
Olivier Sigaud
Nicolas Perrin-Gilbert
40
0
0
14 Feb 2024
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic
  Shortest Path
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
Qiwei Di
Jiafan He
Dongruo Zhou
Quanquan Gu
33
2
0
14 Feb 2024
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
Shentao Yang
Tianqi Chen
Mingyuan Zhou
EGVM
34
23
0
13 Feb 2024
Stitching Sub-Trajectories with Conditional Diffusion Model for
  Goal-Conditioned Offline RL
Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL
Sungyoon Kim
Yunseon Choi
Daiki E. Matsunaga
Kee-Eung Kim
OffRL
46
6
0
11 Feb 2024
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Jost Tobias Springenberg
A. Abdolmaleki
Jingwei Zhang
Oliver Groth
Michael Bloesch
...
Sarah Bechtle
Steven Kapturowski
Roland Hafner
N. Heess
Martin Riedmiller
OffRL
LRM
27
12
0
08 Feb 2024
CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay
CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay
Natasha Butt
Blazej Manczak
Auke Wiggers
Corrado Rainone
David W. Zhang
Michaël Defferrard
Taco S. Cohen
ReLM
LRM
46
17
0
07 Feb 2024
Logical Specifications-guided Dynamic Task Sampling for Reinforcement
  Learning Agents
Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents
Yash Shukla
Wenchang Gao
Vasanth Sarathy
Robert Wright
Alvaro Velasquez
Jivko Sinapov
21
0
0
06 Feb 2024
DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised
  Environment Design
DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design
Samuel Garcin
James Doran
Shangmin Guo
Christopher G. Lucas
Stefano V. Albrecht
35
7
0
05 Feb 2024
Trust and ethical considerations in a multi-modal, explainable AI-driven
  chatbot tutoring system: The case of collaboratively solving Rubik's Cube
Trust and ethical considerations in a multi-modal, explainable AI-driven chatbot tutoring system: The case of collaboratively solving Rubik's Cube
Kausik Lakkaraju
Vedant Khandelwal
Biplav Srivastava
Forest Agostinelli
Hengtao Tang
Prathamjeet Singh
Dezhi Wu
Matthew Irvin
Ashish Kundu
27
0
0
30 Jan 2024
Zero-Shot Reinforcement Learning via Function Encoders
Zero-Shot Reinforcement Learning via Function Encoders
Tyler Ingebrand
Amy Zhang
Ufuk Topcu
OffRL
43
2
0
30 Jan 2024
Scilab-RL: A software framework for efficient reinforcement learning and
  cognitive modeling research
Scilab-RL: A software framework for efficient reinforcement learning and cognitive modeling research
Jan Dohmen
Frank Röder
Manfred Eppe
OffRL
26
0
0
25 Jan 2024
Back-stepping Experience Replay with Application to Model-free
  Reinforcement Learning for a Soft Snake Robot
Back-stepping Experience Replay with Application to Model-free Reinforcement Learning for a Soft Snake Robot
Xinda Qi
Dong Chen
Zhao Li
Xiaobo Tan
32
1
0
21 Jan 2024
CivRealm: A Learning and Reasoning Odyssey in Civilization for
  Decision-Making Agents
CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents
Siyuan Qi
Shuo Chen
Yexin Li
Xiangyu Kong
Junqi Wang
...
Zhaowei Zhang
Nian Liu
Wei Wang
Yaodong Yang
Song-Chun Zhu
AI4CE
LRM
27
18
0
19 Jan 2024
Robotic Test Tube Rearrangement Using Combined Reinforcement Learning
  and Motion Planning
Robotic Test Tube Rearrangement Using Combined Reinforcement Learning and Motion Planning
Hao Chen
Weiwei Wan
Masaki Matsushita
Takeyuki Kotaka
Kensuke Harada
35
1
0
18 Jan 2024
Sharing Knowledge in Multi-Task Deep Reinforcement Learning
Sharing Knowledge in Multi-Task Deep Reinforcement Learning
Carlo DÉramo
Davide Tateo
Andrea Bonarini
Marcello Restelli
Jan Peters
59
124
0
17 Jan 2024
Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language
  Model Critique in Text Generation
Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation
Meng Cao
Lei Shu
Lei Yu
Yun Zhu
Nevan Wichers
Yinxiao Liu
Lei Meng
OffRL
ALM
27
4
0
14 Jan 2024
Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents
Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents
Quentin Delfosse
Sebastian Sztwiertnia
M. Rothermel
Wolfgang Stammer
Kristian Kersting
55
18
0
11 Jan 2024
Towards Safe Load Balancing based on Control Barrier Functions and Deep
  Reinforcement Learning
Towards Safe Load Balancing based on Control Barrier Functions and Deep Reinforcement Learning
L. Dinh
Pham Tran Anh Quang
Jérémie Leguay
22
1
0
10 Jan 2024
GLIDE-RL: Grounded Language Instruction through DEmonstration in RL
GLIDE-RL: Grounded Language Instruction through DEmonstration in RL
Chaitanya Kharyal
S. Gottipati
Tanmay Kumar Sinha
Srijita Das
Matthew E. Taylor
LLMAG
15
1
0
03 Jan 2024
Explicit-Implicit Subgoal Planning for Long-Horizon Tasks with Sparse
  Reward
Explicit-Implicit Subgoal Planning for Long-Horizon Tasks with Sparse Reward
Fangyuan Wang
Anqing Duan
Peng Zhou
Shengzeng Huo
Guodong Guo
Chenguang Yang
D. Navarro-Alarcon
OffRL
VLM
33
0
0
25 Dec 2023
Human-AI Collaboration in Real-World Complex Environment with
  Reinforcement Learning
Human-AI Collaboration in Real-World Complex Environment with Reinforcement Learning
Md Saiful Islam
Srijita Das
S. Gottipati
William Duguay
Clodéric Mars
Jalal Arabneydi
Antoine Fagette
Matthew J. Guzdial
Matthew E. Taylor
38
1
0
23 Dec 2023
Open-Source Reinforcement Learning Environments Implemented in MuJoCo
  with Franka Manipulator
Open-Source Reinforcement Learning Environments Implemented in MuJoCo with Franka Manipulator
Zichun Xu
Yuntao Li
Xiaohang Yang
Zhiyuan Zhao
Zhuang Lei
Jingdong Zhao
43
2
0
21 Dec 2023
GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via
  Stationary Distribution Correction Estimation
GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via Stationary Distribution Correction Estimation
Abhinav Jain
Vaibhav Unhelkar
OffRL
27
4
0
17 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
36
8
0
15 Dec 2023
Previous
12345...232425
Next