Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.01495
Cited By
Hindsight Experience Replay
5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hindsight Experience Replay"
50 / 1,243 papers shown
Title
Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search through State Occupancy Regularization
Liam Schramm
Abdeslam Boularias
28
1
0
07 Jul 2024
Embracing Massive Medical Data
Yu-Cheng Chou
Zongwei Zhou
Alan Yuille
CLL
OOD
32
4
0
05 Jul 2024
Hindsight Preference Learning for Offline Preference-based Reinforcement Learning
Chen-Xiao Gao
Shengjun Fang
Chenjun Xiao
Yang Yu
Zongzhang Zhang
OffRL
35
0
0
05 Jul 2024
EAGERx: Graph-Based Framework for Sim2real Robot Learning
B. V. D. Heijden
Jelle Luijkx
Laura Ferranti
Jens Kober
Robert Babuška
36
0
0
05 Jul 2024
Disentangled Representations for Causal Cognition
Filippo Torresan
Manuel Baltieri
CML
39
1
0
30 Jun 2024
Learning Formal Mathematics From Intrinsic Motivation
Gabriel Poesia
David Broman
Nick Haber
Noah D. Goodman
LRM
41
10
0
30 Jun 2024
Revisiting Sparse Rewards for Goal-Reaching Reinforcement Learning
G. Vasan
Yan Wang
Fahim Shahriar
James Bergstra
Martin Jägersand
A. R. Mahmood
27
2
0
29 Jun 2024
Bidirectional-Reachable Hierarchical Reinforcement Learning with Mutually Responsive Policies
Yu-Juan Luo
Fuchun Sun
Tianying Ji
Xianyuan Zhan
32
0
0
26 Jun 2024
OCALM: Object-Centric Assessment with Language Models
Timo Kaufmann
Jannis Blüml
Antonia Wüst
Quentin Delfosse
Kristian Kersting
Eyke Hüllermeier
LM&Ro
LRM
42
1
0
24 Jun 2024
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making
Vivek Myers
Chongyi Zheng
Anca Dragan
Sergey Levine
Benjamin Eysenbach
OffRL
45
7
0
24 Jun 2024
Learning Abstract World Model for Value-preserving Planning with Options
Rafael Rodríguez-Sánchez
George Konidaris
33
1
0
22 Jun 2024
Learning telic-controllable state representations
Nadav Amir
Stas Tiomkin
Angela Langdon
41
0
0
20 Jun 2024
Metacognitive AI: Framework and the Case for a Neurosymbolic Approach
Hua Wei
Paulo Shakarian
Christian Lebiere
Bruce Draper
Nikhil Krishnaswamy
Sergei Nirenburg
LRM
27
5
0
17 Jun 2024
Large Reasoning Models for 3D Floorplanning in EDA: Learning from Imperfections
Fin Amin
N. Rouf
Tse-Han Pan
Md. Kamal Ibn Shafi
Paul D. Franzon
26
0
0
15 Jun 2024
Is Value Learning Really the Main Bottleneck in Offline RL?
Seohong Park
Kevin Frans
Sergey Levine
Aviral Kumar
OffRL
53
8
0
13 Jun 2024
CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep Reinforcement Learning Algorithms
Arda Sarp Yenicesu
Furkan B. Mutlu
Suleyman Serdar Kozat
Ozgur S. Oguz
20
1
0
13 Jun 2024
Multi-agent Reinforcement Learning with Deep Networks for Diverse Q-Vectors
Zhenglong Luo
Zhiyong Chen
James Welsh
33
1
0
12 Jun 2024
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
Utsav Singh
Pramit Bhattacharyya
Vinay P. Namboodiri
LM&Ro
47
1
0
09 Jun 2024
What Matters in Hierarchical Search for Combinatorial Reasoning Problems?
Michał Zawalski
Gracjan Góral
Michał Tyrolski
Emilia Wisnios
Franciszek Budrowski
Marek Cygan
Łukasz Kuciński
Piotr Miłoś
47
0
0
05 Jun 2024
Multi-Agent Transfer Learning via Temporal Contrastive Learning
Weihao Zeng
Joseph Campbell
Simon Stepputtis
Katia P. Sycara
OffRL
49
2
0
03 Jun 2024
Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment
Chen Zhang
Qiang He
Zhou Yuan
Elvis S. Liu
Hong Wang
Jian Zhao
Yang-Feng Wang
31
1
0
03 Jun 2024
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets
Haoran He
C. Chang
Huazhe Xu
Ling Pan
89
6
0
03 Jun 2024
Shared-unique Features and Task-aware Prioritized Sampling on Multi-task Reinforcement Learning
Po-Shao Lin
Jia-Fong Yeh
Yi-Ting Chen
Winston H. Hsu
34
0
0
02 Jun 2024
Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient
Zechu Li
Rickmer Krohn
Tao Chen
Anurag Ajay
Pulkit Agrawal
Georgia Chalvatzaki
DiffM
50
9
0
02 Jun 2024
Exploring the limits of Hierarchical World Models in Reinforcement Learning
Robin Schiewer
Anand Subramoney
Laurenz Wiskott
33
1
0
01 Jun 2024
Towards Learning Foundation Models for Heuristic Functions to Solve Pathfinding Problems
Vedant Khandelwal
Amit Sheth
Forest Agostinelli
47
2
0
01 Jun 2024
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
Hyungho Na
IL-Chul Moon
43
1
0
30 May 2024
AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data
Zifan Song
Yudong Wang
Wenwei Zhang
Kuikun Liu
Chengqi Lyu
...
Qipeng Guo
Hang Yan
Dahua Lin
Kai-xiang Chen
Cairong Zhao
SyDa
46
2
0
29 May 2024
Causal Action Influence Aware Counterfactual Data Augmentation
Núria Armengol Urpí
Marco Bagatella
Marin Vlastelica
Georg Martius
CML
35
5
0
29 May 2024
Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space
Bangzheng Li
Ningshan Ma
Zifan Wang
36
0
1
26 May 2024
Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search
Max Liu
Chan-Hung Yu
Wei-Hsu Lee
Cheng-Wei Hung
Yen-Chun Chen
Shao-Hua Sun
55
4
0
26 May 2024
RoboArm-NMP: a Learning Environment for Neural Motion Planning
Tom Jurgenson
Matan Sudry
Gal Avineri
Aviv Tamar
24
0
0
25 May 2024
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Junghyuk Yeom
Yonghyeon Jo
Jungmo Kim
Sanghyeon Lee
Seungyul Han
OffRL
40
2
0
23 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
82
43
0
23 May 2024
Octo: An Open-Source Generalist Robot Policy
Octo Model Team
Dibya Ghosh
Homer Walke
Karl Pertsch
Kevin Black
...
Quan Vuong
Ted Xiao
Dorsa Sadigh
Chelsea Finn
Sergey Levine
69
357
0
20 May 2024
Feasibility Consistent Representation Learning for Safe Reinforcement Learning
Zhepeng Cen
Yi-Fan Yao
Zuxin Liu
Ding Zhao
OffRL
40
3
0
20 May 2024
Going into Orbit: Massively Parallelizing Episodic Reinforcement Learning
Jan Oberst
Johann Bonneau
21
0
0
19 May 2024
Generalized Multi-Objective Reinforcement Learning with Envelope Updates in URLLC-enabled Vehicular Networks
Zijiang Yan
Hina Tabassum
24
2
0
18 May 2024
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement
Yiwen Zhu
Jinyi Liu
Wenya Wei
Qianyi Fu
Yujing Hu
Zhou Fang
Bo An
Jianye Hao
Tangjie Lv
Changjie Fan
34
3
0
14 May 2024
CIER: A Novel Experience Replay Approach with Causal Inference in Deep Reinforcement Learning
Jingwen Wang
Dehui Du
Yida Li
Yiyang Li
Yikang Chen
AI4TS
CML
24
0
0
14 May 2024
AnyRotate: Gravity-Invariant In-Hand Object Rotation with Sim-to-Real Touch
Max Yang
Chenghua Lu
Alex Church
Yijiong Lin
Christopher J. Ford
Haoran Li
Efi Psomopoulou
David A.W. Barton
Nathan Lepora
62
15
0
12 May 2024
A Minimalist Prompt for Zero-Shot Policy Learning
Meng Song
Xuezhi Wang
Tanay Biradar
Yao Qin
Manmohan Chandraker
OffRL
35
1
0
09 May 2024
Learning Planning Abstractions from Language
Weiyu Liu
Geng Chen
Joy Hsu
Jiayuan Mao
Jiajun Wu
PINN
46
2
0
06 May 2024
Artificial Intelligence in the Autonomous Navigation of Endovascular Interventions: A Systematic Review
Harry Robertshaw
Lennart Karstensen
Benjamin Jackson
Hadi Sadati
K. Rhode
Sebastien Ourselin
Alejandro Granados
Thomas C Booth
34
13
0
06 May 2024
Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning
Caleb Chuck
Carl Qi
M. Munje
Shuozhe Li
Max Rudolph
...
Kavan Mehta
Anthony Wang
Peter Stone
Amy Zhang
S. Niekum
46
4
0
06 May 2024
Proximal Curriculum with Task Correlations for Deep Reinforcement Learning
Georgios Tzannetos
Parameswaran Kamalaruban
Adish Singla
34
4
0
03 May 2024
MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure
Zhicheng Zhang
Yancheng Liang
Yi Wu
Fei Fang
28
2
0
01 May 2024
DPO Meets PPO: Reinforced Token Optimization for RLHF
Han Zhong
Guhao Feng
Guhao Feng
Li Zhao
Di He
Jiang Bian
Liwei Wang
Jiang Bian
Liwei Wang
57
57
0
29 Apr 2024
Distilling Privileged Information for Dubins Traveling Salesman Problems with Neighborhoods
M. Shin
Su-Jeong Park
Seung-Keol Ryu
Heeyeon Kim
Han-Lim Choi
54
0
0
25 Apr 2024
PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling
Utsav Singh
Wesley A Suttle
Brian M Sadler
Vinay P. Namboodiri
Amrit Singh Bedi
30
4
0
20 Apr 2024
Previous
1
2
3
4
5
6
...
23
24
25
Next