Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.01495
Cited By
v1
v2
v3 (latest)
Hindsight Experience Replay
5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Hindsight Experience Replay"
50 / 1,267 papers shown
Title
Towards Learning Foundation Models for Heuristic Functions to Solve Pathfinding Problems
Vedant Khandelwal
Amit Sheth
Forest Agostinelli
81
2
0
01 Jun 2024
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
Hyungho Na
IL-Chul Moon
67
1
0
30 May 2024
AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data
Zifan Song
Yudong Wang
Wenwei Zhang
Kuikun Liu
Chengqi Lyu
...
Qipeng Guo
Hang Yan
Dahua Lin
Kai-xiang Chen
Cairong Zhao
SyDa
80
2
0
29 May 2024
Causal Action Influence Aware Counterfactual Data Augmentation
Núria Armengol Urpí
Marco Bagatella
Marin Vlastelica
Georg Martius
CML
64
5
0
29 May 2024
Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space
Bangzheng Li
Ningshan Ma
Zifan Wang
44
0
1
26 May 2024
Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search
Max Liu
Chan-Hung Yu
Wei-Hsu Lee
Cheng-Wei Hung
Yen-Chun Chen
Shao-Hua Sun
143
5
0
26 May 2024
RoboArm-NMP: a Learning Environment for Neural Motion Planning
Tom Jurgenson
Matan Sudry
Gal Avineri
Aviv Tamar
59
0
0
25 May 2024
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Junghyuk Yeom
Yonghyeon Jo
Jungmo Kim
Sanghyeon Lee
Seungyul Han
OffRL
107
3
0
23 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
333
54
0
23 May 2024
Octo: An Open-Source Generalist Robot Policy
Octo Model Team
Dibya Ghosh
Homer Walke
Karl Pertsch
Kevin Black
...
Quan Vuong
Ted Xiao
Dorsa Sadigh
Chelsea Finn
Sergey Levine
216
452
0
20 May 2024
Feasibility Consistent Representation Learning for Safe Reinforcement Learning
Zhepeng Cen
Yi-Fan Yao
Zuxin Liu
Ding Zhao
OffRL
91
3
0
20 May 2024
Going into Orbit: Massively Parallelizing Episodic Reinforcement Learning
Jan Oberst
Johann Bonneau
37
0
0
19 May 2024
Generalized Multi-Objective Reinforcement Learning with Envelope Updates in URLLC-enabled Vehicular Networks
Zijiang Yan
Hina Tabassum
80
3
0
18 May 2024
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement
Yiwen Zhu
Jinyi Liu
Wenya Wei
Qianyi Fu
Yujing Hu
Zhou Fang
Bo An
Jianye Hao
Tangjie Lv
Changjie Fan
85
4
0
14 May 2024
CIER: A Novel Experience Replay Approach with Causal Inference in Deep Reinforcement Learning
Jingwen Wang
Dehui Du
Yida Li
Yiyang Li
Yikang Chen
AI4TS
CML
32
0
0
14 May 2024
AnyRotate: Gravity-Invariant In-Hand Object Rotation with Sim-to-Real Touch
Max Yang
Chenghua Lu
Alex Church
Yijiong Lin
Christopher J. Ford
Haoran Li
Efi Psomopoulou
David A.W. Barton
Nathan Lepora
134
17
0
12 May 2024
A Minimalist Prompt for Zero-Shot Policy Learning
Meng Song
Xuezhi Wang
Tanay Biradar
Yao Qin
Manmohan Chandraker
OffRL
66
1
0
09 May 2024
Learning Planning Abstractions from Language
Weiyu Liu
Geng Chen
Joy Hsu
Jiayuan Mao
Jiajun Wu
PINN
107
4
0
06 May 2024
Artificial Intelligence in the Autonomous Navigation of Endovascular Interventions: A Systematic Review
Harry Robertshaw
Lennart Karstensen
Benjamin Jackson
Hadi Sadati
K. Rhode
Sebastien Ourselin
Alejandro Granados
Thomas C Booth
59
14
0
06 May 2024
Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning
Caleb Chuck
Carl Qi
M. Munje
Shuozhe Li
Max Rudolph
...
Kavan Mehta
Anthony Wang
Peter Stone
Amy Zhang
S. Niekum
77
4
0
06 May 2024
Proximal Curriculum with Task Correlations for Deep Reinforcement Learning
Georgios Tzannetos
Parameswaran Kamalaruban
Adish Singla
58
4
0
03 May 2024
MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure
Zhicheng Zhang
Yancheng Liang
Yi Wu
Fei Fang
68
2
0
01 May 2024
DPO Meets PPO: Reinforced Token Optimization for RLHF
Han Zhong
Zikang Shan
Guhao Feng
Wei Xiong
Xinle Cheng
Li Zhao
Di He
Jiang Bian
Liwei Wang
155
72
0
29 Apr 2024
Distilling Privileged Information for Dubins Traveling Salesman Problems with Neighborhoods
M. Shin
Su-Jeong Park
Seung-Keol Ryu
Heeyeon Kim
Han-Lim Choi
139
0
0
25 Apr 2024
PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling
Utsav Singh
Wesley A Suttle
Brian M Sadler
Vinay P. Namboodiri
Amrit Singh Bedi
75
5
0
20 Apr 2024
Towards a Research Community in Interpretable Reinforcement Learning: the InterpPol Workshop
Hector Kohler
Quentin Delfosse
Paul Festor
Philippe Preux
117
0
0
16 Apr 2024
A Survey on Deep Learning for Theorem Proving
Zhaoyu Li
Jialiang Sun
Logan Murphy
Qidong Su
Zenan Li
Xian Zhang
Kaiyu Yang
Xujie Si
LRM
123
32
0
15 Apr 2024
Provable Interactive Learning with Hindsight Instruction Feedback
Dipendra Kumar Misra
Aldo Pacchiano
Rob Schapire
70
1
0
14 Apr 2024
RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
Shreyas Chaudhari
Pranjal Aggarwal
Vishvak Murahari
Tanmay Rajpurohit
Ashwin Kalyan
Karthik Narasimhan
Ameet Deshpande
Bruno Castro da Silva
91
38
0
12 Apr 2024
A Data Efficient Framework for Learning Local Heuristics
Rishi Veerapaneni
Jonathan Park
Muhammad Suhail Saleem
Maxim Likhachev
39
0
0
10 Apr 2024
Demonstration-Enhanced Adaptive Multi-Objective Robot Navigation
Jorge de Heuvel
Tharun Sethuraman
Maren Bennewitz
111
0
0
07 Apr 2024
Rethinking Teacher-Student Curriculum Learning through the Cooperative Mechanics of Experience
Manfred Diaz
Liam Paull
Andrea Tacchetti
126
0
0
03 Apr 2024
Is Exploration All You Need? Effective Exploration Characteristics for Transfer in Reinforcement Learning
Jonathan C. Balloch
Rishav Bhagat
Geigh Zollicoffer
Ruoran Jia
Julia Kim
Mark O. Riedl
OffRL
85
1
0
02 Apr 2024
Entity-Centric Reinforcement Learning for Object Manipulation from Pixels
Dan Haramati
Tal Daniel
Aviv Tamar
LM&Ro
OffRL
OCL
88
14
0
01 Apr 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRL
OnRL
90
0
0
31 Mar 2024
Trajectory Planning of Robotic Manipulator in Dynamic Environment Exploiting DRL
Osama Ahmad
Zawar Hussain
Hammad Naeem
51
0
0
25 Mar 2024
FootstepNet: an Efficient Actor-Critic Method for Fast On-line Bipedal Footstep Planning and Forecasting
Clément Gaspard
G. Passault
Mélodie Daniel
Olivier Ly
42
1
0
19 Mar 2024
The Value of Reward Lookahead in Reinforcement Learning
Nadav Merlis
Dorian Baudry
Vianney Perchet
62
1
0
18 Mar 2024
Phasic Diversity Optimization for Population-Based Reinforcement Learning
Jingcheng Jiang
Haiyin Piao
Yu Fu
Yihang Hao
Chuanlu Jiang
Ziqi Wei
Xin Yang
60
0
0
17 Mar 2024
SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning
Nicholas Zolman
Urban Fasel
J. Nathan Kutz
Steven L. Brunton
AI4CE
84
11
0
14 Mar 2024
BAGEL: Bootstrapping Agents by Guiding Exploration with Language
Shikhar Murty
Christopher D. Manning
Peter Shaw
Mandar Joshi
Kenton Lee
LM&Ro
LLMAG
70
20
0
12 Mar 2024
RLingua: Improving Reinforcement Learning Sample Efficiency in Robotic Manipulations With Large Language Models
Liangliang Chen
Yutian Lei
Shiyu Jin
Ying Zhang
Liangjun Zhang
LM&Ro
105
12
0
11 Mar 2024
Why Online Reinforcement Learning is Causal
Oliver Schulte
Pascal Poupart
CML
OffRL
82
1
0
07 Mar 2024
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
Priya Sundaresan
Q. Vuong
Jiayuan Gu
Peng Xu
Ted Xiao
...
Ajinkya Jain
Karol Hausman
Dorsa Sadigh
Jeannette Bohg
S. Schaal
VGen
99
26
0
05 Mar 2024
Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
Ziping Xu
Zifan Xu
Runxuan Jiang
Peter Stone
Ambuj Tewari
83
1
0
03 Mar 2024
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Hany Hamed
Subin Kim
Dongyeong Kim
Jaesik Yoon
Sungjin Ahn
88
4
0
29 Feb 2024
Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings
Kevin Frans
Seohong Park
Pieter Abbeel
Sergey Levine
OffRL
111
13
0
27 Feb 2024
Foundation Policies with Hilbert Representations
Seohong Park
Tobias Kreiman
Sergey Levine
SSL
OffRL
106
30
0
23 Feb 2024
MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback and Dynamic Distance Constraint
Xinglin Zhou
Yifu Yuan
Shaofu Yang
Jianye Hao
77
2
0
22 Feb 2024
Learning control strategy in soft robotics through a set of configuration spaces
Etienne Ménager
Christian Duriez
87
0
0
21 Feb 2024
Previous
1
2
3
4
5
...
24
25
26
Next