ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay
v1v2v3 (latest)

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,267 papers shown
Title
Rewards-in-Context: Multi-objective Alignment of Foundation Models with
  Dynamic Preference Adjustment
Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment
Rui Yang
Xiaoman Pan
Feng Luo
Shuang Qiu
Han Zhong
Dong Yu
Jianshu Chen
220
83
0
15 Feb 2024
Single-Reset Divide & Conquer Imitation Learning
Single-Reset Divide & Conquer Imitation Learning
Alexandre Chenu
Olivier Serris
Olivier Sigaud
Nicolas Perrin-Gilbert
69
0
0
14 Feb 2024
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic
  Shortest Path
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
Qiwei Di
Jiafan He
Dongruo Zhou
Quanquan Gu
71
2
0
14 Feb 2024
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
Shentao Yang
Tianqi Chen
Mingyuan Zhou
EGVM
126
30
0
13 Feb 2024
Stitching Sub-Trajectories with Conditional Diffusion Model for
  Goal-Conditioned Offline RL
Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL
Sungyoon Kim
Yunseon Choi
Daiki E. Matsunaga
Kee-Eung Kim
OffRL
103
9
0
11 Feb 2024
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Jost Tobias Springenberg
A. Abdolmaleki
Jingwei Zhang
Oliver Groth
Michael Bloesch
...
Sarah Bechtle
Steven Kapturowski
Roland Hafner
N. Heess
Martin Riedmiller
OffRLLRM
97
16
0
08 Feb 2024
CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay
CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay
Natasha Butt
Blazej Manczak
Auke Wiggers
Corrado Rainone
David W. Zhang
Michaël Defferrard
Taco S. Cohen
ReLMLRM
86
20
0
07 Feb 2024
Logical Specifications-guided Dynamic Task Sampling for Reinforcement
  Learning Agents
Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents
Yash Shukla
Wenchang Gao
Vasanth Sarathy
Robert Wright
Alvaro Velasquez
Jivko Sinapov
68
0
0
06 Feb 2024
DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised
  Environment Design
DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design
Samuel Garcin
James Doran
Shangmin Guo
Christopher G. Lucas
Stefano V. Albrecht
118
9
0
05 Feb 2024
Trust and ethical considerations in a multi-modal, explainable AI-driven
  chatbot tutoring system: The case of collaboratively solving Rubik's Cube
Trust and ethical considerations in a multi-modal, explainable AI-driven chatbot tutoring system: The case of collaboratively solving Rubik's Cube
Kausik Lakkaraju
Vedant Khandelwal
Biplav Srivastava
Forest Agostinelli
Hengtao Tang
Prathamjeet Singh
Dezhi Wu
Matthew Irvin
Ashish Kundu
55
0
0
30 Jan 2024
Zero-Shot Reinforcement Learning via Function Encoders
Zero-Shot Reinforcement Learning via Function Encoders
Tyler Ingebrand
Amy Zhang
Ufuk Topcu
OffRL
114
5
0
30 Jan 2024
Scilab-RL: A software framework for efficient reinforcement learning and
  cognitive modeling research
Scilab-RL: A software framework for efficient reinforcement learning and cognitive modeling research
Jan Dohmen
Frank Röder
Manfred Eppe
OffRL
33
0
0
25 Jan 2024
Back-stepping Experience Replay with Application to Model-free
  Reinforcement Learning for a Soft Snake Robot
Back-stepping Experience Replay with Application to Model-free Reinforcement Learning for a Soft Snake Robot
Xinda Qi
Dong Chen
Zhao Li
Xiaobo Tan
65
2
0
21 Jan 2024
CivRealm: A Learning and Reasoning Odyssey in Civilization for
  Decision-Making Agents
CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents
Siyuan Qi
Shuo Chen
Yexin Li
Xiangyu Kong
Junqi Wang
...
Zhaowei Zhang
Nian Liu
Wei Wang
Yaodong Yang
Song-Chun Zhu
AI4CELRM
119
23
0
19 Jan 2024
Robotic Test Tube Rearrangement Using Combined Reinforcement Learning
  and Motion Planning
Robotic Test Tube Rearrangement Using Combined Reinforcement Learning and Motion Planning
Hao Chen
Weiwei Wan
Masaki Matsushita
Takeyuki Kotaka
Kensuke Harada
79
2
0
18 Jan 2024
Sharing Knowledge in Multi-Task Deep Reinforcement Learning
Sharing Knowledge in Multi-Task Deep Reinforcement Learning
Carlo DÉramo
Davide Tateo
Andrea Bonarini
Marcello Restelli
Jan Peters
196
130
0
17 Jan 2024
Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language
  Model Critique in Text Generation
Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation
Meng Cao
Lei Shu
Lei Yu
Yun Zhu
Nevan Wichers
Yinxiao Liu
Lei Meng
OffRLALM
53
7
0
14 Jan 2024
Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents
Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents
Quentin Delfosse
Sebastian Sztwiertnia
M. Rothermel
Wolfgang Stammer
Kristian Kersting
134
20
0
11 Jan 2024
Towards Safe Load Balancing based on Control Barrier Functions and Deep
  Reinforcement Learning
Towards Safe Load Balancing based on Control Barrier Functions and Deep Reinforcement Learning
L. Dinh
Pham Tran Anh Quang
Jérémie Leguay
36
2
0
10 Jan 2024
GLIDE-RL: Grounded Language Instruction through DEmonstration in RL
GLIDE-RL: Grounded Language Instruction through DEmonstration in RL
Chaitanya Kharyal
S. Gottipati
Tanmay Kumar Sinha
Srijita Das
Matthew E. Taylor
LLMAG
56
1
0
03 Jan 2024
Explicit-Implicit Subgoal Planning for Long-Horizon Tasks with Sparse
  Reward
Explicit-Implicit Subgoal Planning for Long-Horizon Tasks with Sparse Reward
Fangyuan Wang
Anqing Duan
Peng Zhou
Shengzeng Huo
Guodong Guo
Chenguang Yang
D. Navarro-Alarcon
OffRLVLM
73
0
0
25 Dec 2023
Human-AI Collaboration in Real-World Complex Environment with
  Reinforcement Learning
Human-AI Collaboration in Real-World Complex Environment with Reinforcement Learning
Md Saiful Islam
Srijita Das
S. Gottipati
William Duguay
Clodéric Mars
Jalal Arabneydi
Antoine Fagette
Matthew J. Guzdial
Matthew E. Taylor
72
1
0
23 Dec 2023
Open-Source Reinforcement Learning Environments Implemented in MuJoCo
  with Franka Manipulator
Open-Source Reinforcement Learning Environments Implemented in MuJoCo with Franka Manipulator
Zichun Xu
Yuntao Li
Xiaohang Yang
Zhiyuan Zhao
Zhuang Lei
Jingdong Zhao
100
2
0
21 Dec 2023
GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via
  Stationary Distribution Correction Estimation
GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via Stationary Distribution Correction Estimation
Abhinav Jain
Vaibhav Unhelkar
OffRL
70
7
0
17 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
80
10
0
15 Dec 2023
HiER: Highlight Experience Replay for Boosting Off-Policy Reinforcement
  Learning Agents
HiER: Highlight Experience Replay for Boosting Off-Policy Reinforcement Learning Agents
Dániel Horváth
Jesús Bujalance Martín
Ferenc Gàbor Erdos
Z. Istenes
Fabien Moutarde
OffRL
57
1
0
14 Dec 2023
Personalized Path Recourse for Reinforcement Learning Agents
Personalized Path Recourse for Reinforcement Learning Agents
Dat Hong
Tong Wang
48
0
0
14 Dec 2023
Learning adaptive planning representations with natural language
  guidance
Learning adaptive planning representations with natural language guidance
L. Wong
Jiayuan Mao
Pratyusha Sharma
Zachary S. Siegel
Jiahai Feng
Noa Korneev
Joshua B. Tenenbaum
Jacob Andreas
LM&Ro
92
28
0
13 Dec 2023
Building Open-Ended Embodied Agent via Language-Policy Bidirectional
  Adaptation
Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation
Shaopeng Zhai
Jie Wang
Tianyi Zhang
Fuxian Huang
Qi Zhang
Ming Zhou
Jing Hou
Yu Qiao
Yu Liu
LLMAGLM&Ro
98
1
0
12 Dec 2023
Synergizing Quality-Diversity with Descriptor-Conditioned Reinforcement
  Learning
Synergizing Quality-Diversity with Descriptor-Conditioned Reinforcement Learning
Maxence Faldor
Félix Chalumeau
Manon Flageat
Antoine Cully
69
2
0
10 Dec 2023
Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a
  High Replay Ratio and Regularization
Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and Regularization
Takuya Hiraoka
OffRL
84
1
0
10 Dec 2023
Backward Learning for Goal-Conditioned Policies
Backward Learning for Goal-Conditioned Policies
Marc Höftmann
Jan Robine
Stefan Harmeling
100
1
0
08 Dec 2023
PlayFusion: Skill Acquisition via Diffusion from Language-Annotated Play
PlayFusion: Skill Acquisition via Diffusion from Language-Annotated Play
Lili Chen
Shikhar Bahl
Deepak Pathak
75
44
0
07 Dec 2023
Pearl: A Production-ready Reinforcement Learning Agent
Pearl: A Production-ready Reinforcement Learning Agent
Zheqing Zhu
Rodrigo de Salvo Braz
Jalaj Bhandari
Daniel Jiang
Yi Wan
...
D. Korenkevych
Ürün Dogan
Frank Cheng
Zheng Wu
Wanqiao Xu
VLMOffRLOnRL
120
7
0
06 Dec 2023
Diffused Task-Agnostic Milestone Planner
Diffused Task-Agnostic Milestone Planner
Mineui Hong
Minjae Kang
Songhwai Oh
113
6
0
06 Dec 2023
Understanding Representations Pretrained with Auxiliary Losses for
  Embodied Agent Planning
Understanding Representations Pretrained with Auxiliary Losses for Embodied Agent Planning
Samrudhdhi B. Rangrej
James J. Clark
SSL
65
0
0
06 Dec 2023
Contact Energy Based Hindsight Experience Prioritization
Contact Energy Based Hindsight Experience Prioritization
Erdi Sayar
Zhenshan Bing
Carlo DÉramo
Ozgur S. Oguz
Alois Knoll
92
3
0
05 Dec 2023
Visual Hindsight Self-Imitation Learning for Interactive Navigation
Visual Hindsight Self-Imitation Learning for Interactive Navigation
Kibeom Kim
Kisung Shin
Min Whoo Lee
Moonhoen Lee
Minsu Lee
Byoung-Tak Zhang
92
2
0
05 Dec 2023
Working Backwards: Learning to Place by Picking
Working Backwards: Learning to Place by Picking
Oliver Limoyo
Abhisek Konar
Trevor Ablett
Jonathan Kelly
F. Hogan
Gregory Dudek
77
0
0
04 Dec 2023
AdsorbRL: Deep Multi-Objective Reinforcement Learning for Inverse
  Catalysts Design
AdsorbRL: Deep Multi-Objective Reinforcement Learning for Inverse Catalysts Design
Romain Lacombe
Lucas Hendren
Khalid El-Awady
38
2
0
04 Dec 2023
Modular Control Architecture for Safe Marine Navigation: Reinforcement
  Learning and Predictive Safety Filters
Modular Control Architecture for Safe Marine Navigation: Reinforcement Learning and Predictive Safety Filters
Aksel Vaaler
Svein Jostein Husa
Daniel Menges
T. N. Larsen
Adil Rasheed
61
2
0
04 Dec 2023
Bias Resilient Multi-Step Off-Policy Goal-Conditioned Reinforcement
  Learning
Bias Resilient Multi-Step Off-Policy Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
64
0
0
29 Nov 2023
Goal-conditioned Offline Planning from Curious Exploration
Goal-conditioned Offline Planning from Curious Exploration
Marco Bagatella
Georg Martius
OffRL
84
1
0
28 Nov 2023
Offline Skill Generalization via Task and Motion Planning
Offline Skill Generalization via Task and Motion Planning
Shin Watanabe
Geir Horn
J. Tørresen
K. Ellefsen
OffRL
80
0
0
24 Nov 2023
Multi-Objective Reinforcement Learning Based on Decomposition: A
  Taxonomy and Framework
Multi-Objective Reinforcement Learning Based on Decomposition: A Taxonomy and Framework
Florian Felten
El-Ghazali Talbi
Grégoire Danoy
76
17
0
21 Nov 2023
Towards a Standardized Reinforcement Learning Framework for AAM
  Contingency Management
Towards a Standardized Reinforcement Learning Framework for AAM Contingency Management
Luis E. Alvarez
Marc W. Brittain
Kara Breeden
49
3
0
17 Nov 2023
Signal Temporal Logic-Guided Apprenticeship Learning
Signal Temporal Logic-Guided Apprenticeship Learning
Aniruddh Gopinath Puranic
Jyotirmoy V. Deshmukh
Stefanos Nikolaidis
101
2
0
09 Nov 2023
Mitigating Estimation Errors by Twin TD-Regularized Actor and Critic for
  Deep Reinforcement Learning
Mitigating Estimation Errors by Twin TD-Regularized Actor and Critic for Deep Reinforcement Learning
Junmin Zhong
Ruofan Wu
Jennie Si
OffRL
52
1
0
07 Nov 2023
PcLast: Discovering Plannable Continuous Latent States
PcLast: Discovering Plannable Continuous Latent States
Anurag Koul
Shivakanth Sujit
Shaoru Chen
Ben Evans
Lili Wu
...
Yonathan Efroni
Lekan Molu
Miro Dudik
John Langford
Alex Lamb
OffRLBDL
102
1
0
06 Nov 2023
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
Xuzhe Dang
Stefan Edelkamp
171
4
0
06 Nov 2023
Previous
123456...242526
Next