ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay
v1v2v3 (latest)

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,267 papers shown
Title
Planning under Uncertainty to Goal Distributions
Planning under Uncertainty to Goal Distributions
Adam Conkey
Tucker Hermans
72
3
0
01 Jul 2025
Diffusion Models for Robotic Manipulation: A Survey
Diffusion Models for Robotic Manipulation: A Survey
Rosa Wolf
Yitian Shi
Sheng Liu
Rania Rayyes
127
2
0
01 Jul 2025
BREAD: Branched Rollouts from Expert Anchors Bridge SFT & RL for Reasoning
BREAD: Branched Rollouts from Expert Anchors Bridge SFT & RL for Reasoning
Xuechen Zhang
Zijian Huang
Yingcong Li
Chenshun Ni
Jiasi Chen
Samet Oymak
OffRLMoELRM
27
0
0
20 Jun 2025
Energy-Based Transfer for Reinforcement Learning
Energy-Based Transfer for Reinforcement Learning
Zeyun Deng
Jasorsi Ghosh
Fiona Xie
Yuzhe Lu
Katia Sycara
Joseph Campbell
15
0
0
19 Jun 2025
CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization
CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization
Ranting Hu
OffRL
29
0
0
18 Jun 2025
ClutterDexGrasp: A Sim-to-Real System for General Dexterous Grasping in Cluttered Scenes
ClutterDexGrasp: A Sim-to-Real System for General Dexterous Grasping in Cluttered Scenes
Zeyuan Chen
Qiyang Yan
Yuanpei Chen
Tianhao Wu
Jiyao Zhang
Zihan Ding
Jinzhou Li
Yaodong Yang
Hao Dong
15
0
0
17 Jun 2025
TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization
TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization
Mingkang Zhu
Xi Chen
Zhongdao Wang
Bei Yu
Hengshuang Zhao
Jiaya Jia
15
0
0
17 Jun 2025
DynaGuide: Steering Diffusion Polices with Active Dynamic Guidance
DynaGuide: Steering Diffusion Polices with Active Dynamic Guidance
Maximilian Du
Shuran Song
25
0
0
16 Jun 2025
Goal-based Self-Adaptive Generative Adversarial Imitation Learning (Goal-SAGAIL) for Multi-goal Robotic Manipulation Tasks
Goal-based Self-Adaptive Generative Adversarial Imitation Learning (Goal-SAGAIL) for Multi-goal Robotic Manipulation Tasks
Yingyi Kuang
Luis J. Manso
George Vogiatzis
17
0
0
15 Jun 2025
CIRO7.2: A Material Network with Circularity of -7.2 and Reinforcement-Learning-Controlled Robotic Disassembler
CIRO7.2: A Material Network with Circularity of -7.2 and Reinforcement-Learning-Controlled Robotic Disassembler
Federico Zocco
Monica Malvezzi
15
0
0
13 Jun 2025
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning
Mido Assran
Adrien Bardes
David Fan
Q. Garrido
Russell Howes
...
Sarath Chandar
Franziska Meier
Yann LeCun
Michael G. Rabbat
Nicolas Ballas
71
0
0
11 Jun 2025
Uncertainty Prioritized Experience Replay
Rodrigo Carrasco-Davis
Sebastian Lee
Claudia Clopath
Will Dabney
27
0
0
10 Jun 2025
Learning The Minimum Action Distance
Learning The Minimum Action Distance
Lorenzo Steccanella
Joshua B. Evans
Özgür Simsek
Anders Jonsson
23
0
0
10 Jun 2025
Graph-Assisted Stitching for Offline Hierarchical Reinforcement Learning
Graph-Assisted Stitching for Offline Hierarchical Reinforcement Learning
Seungho Baek
Taegeon Park
Jongchan Park
Seungjun Oh
Yusung Kim
OffRL
22
0
0
09 Jun 2025
Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Junhong Shen
Hao Bai
Lunjun Zhang
Yifei Zhou
Amrith Rajagopal Setlur
...
Diego Caples
Nan Jiang
Tong Zhang
Ameet Talwalkar
Aviral Kumar
LLMAGLRM
23
0
0
09 Jun 2025
Reachability Weighted Offline Goal-conditioned Resampling
Reachability Weighted Offline Goal-conditioned Resampling
Wenyan Yang
Joni Pajarinen
OffRL
70
0
0
03 Jun 2025
SuperRL: Reinforcement Learning with Supervision to Boost Language Model Reasoning
SuperRL: Reinforcement Learning with Supervision to Boost Language Model Reasoning
Yihao Liu
Shuocheng Li
Lang Cao
Yuhang Xie
Mengyu Zhou
Haoyu Dong
Xiaojun Ma
Shi Han
Dongmei Zhang
OffRLReLMLRM
39
0
0
01 Jun 2025
Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners
Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners
Michal Nauman
Marek Cygan
Carmelo Sferrazza
Aviral Kumar
Pieter Abbeel
OffRL
96
0
0
29 May 2025
Diffusion Guidance Is a Controllable Policy Improvement Operator
Diffusion Guidance Is a Controllable Policy Improvement Operator
Kevin Frans
Seohong Park
Pieter Abbeel
Sergey Levine
OffRL
70
0
0
29 May 2025
Can Large Reasoning Models Self-Train?
Can Large Reasoning Models Self-Train?
Sheikh Shafayat
Fahim Tajwar
Ruslan Salakhutdinov
J. Schneider
Andrea Zanette
ReLMOffRLLRM
76
2
0
27 May 2025
Hierarchical Reinforcement Learning with Uncertainty-Guided Diffusional Subgoals
Hierarchical Reinforcement Learning with Uncertainty-Guided Diffusional Subgoals
V. Wang
Tinghuai Wang
Joni Pajarinen
BDL
30
0
0
27 May 2025
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
Leander Diaz-Bone
Marco Bagatella
Jonas Hübotter
Andreas Krause
OffRL
77
0
0
26 May 2025
Extremum Flow Matching for Offline Goal Conditioned Reinforcement Learning
Extremum Flow Matching for Offline Goal Conditioned Reinforcement Learning
Quentin Rouxel
Clemente Donoso
Fei Chen
S. Ivaldi
Jean-Baptiste Mouret
OffRL
128
0
0
26 May 2025
Prompting Decision Transformers for Zero-Shot Reach-Avoid Policies
Prompting Decision Transformers for Zero-Shot Reach-Avoid Policies
Kevin Li
Marinka Zitnik
OffRL
28
0
0
25 May 2025
CiRL: Open-Source Environments for Reinforcement Learning in Circular Economy and Net Zero
CiRL: Open-Source Environments for Reinforcement Learning in Circular Economy and Net Zero
Federico Zocco
Andrea Corti
Monica Malvezzi
AI4CE
35
0
0
24 May 2025
Flattening Hierarchies with Policy Bootstrapping
Flattening Hierarchies with Policy Bootstrapping
John L. Zhou
Jonathan C. Kao
OffRL
99
0
0
20 May 2025
Option-aware Temporally Abstracted Value for Offline Goal-Conditioned Reinforcement Learning
Option-aware Temporally Abstracted Value for Offline Goal-Conditioned Reinforcement Learning
Hongjoon Ahn
Heewoong Choi
Jisu Han
Taesup Moon
OffRL
106
0
0
19 May 2025
Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning
Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning
Dongsu Lee
Minhae Kwon
OffRL
93
0
0
19 May 2025
Attention-Based Reward Shaping for Sparse and Delayed Rewards
Attention-Based Reward Shaping for Sparse and Delayed Rewards
Ian Holmes
Min Chi
OffRL
92
0
0
16 May 2025
Electric Bus Charging Schedules Relying on Real Data-Driven Targets Based on Hierarchical Deep Reinforcement Learning
Electric Bus Charging Schedules Relying on Real Data-Driven Targets Based on Hierarchical Deep Reinforcement Learning
Jiaju Qi
Lei Lei
Thorsteinn Jonsson
L. Hanzo
91
1
0
15 May 2025
General Dynamic Goal Recognition
General Dynamic Goal Recognition
Osher Elhadad
Reuth Mirsky
AI4CE
43
1
0
14 May 2025
Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning
Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning
Shuai Han
Mehdi Dastani
Shihan Wang
86
0
0
13 May 2025
UniSkill: Imitating Human Videos via Cross-Embodiment Skill Representations
UniSkill: Imitating Human Videos via Cross-Embodiment Skill Representations
Hanjung Kim
Jaehyun Kang
Hyolim Kang
Meedeum Cho
Seon Joo Kim
Youngwoon Lee
103
0
0
13 May 2025
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning
Caleb Chuck
Fan Feng
Carl Qi
Chang Shi
Siddhant Agarwal
Amy Zhang
S. Niekum
89
0
0
06 May 2025
D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection
D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection
Chenran Zhao
Dianxi Shi
Mengzhu Wang
Jianqiang Xia
Huanhuan Yang
Songchang Jin
Shaowu Yang
Chunping Qiu
62
0
0
04 May 2025
A Goal-Oriented Reinforcement Learning-Based Path Planning Algorithm for Modular Self-Reconfigurable Satellites
A Goal-Oriented Reinforcement Learning-Based Path Planning Algorithm for Modular Self-Reconfigurable Satellites
Bofei Liu
Dong Ye
Zunhao Yao
Zhaowei Sun
63
0
0
04 May 2025
CAMOUFLAGE: Exploiting Misinformation Detection Systems Through LLM-driven Adversarial Claim Transformation
CAMOUFLAGE: Exploiting Misinformation Detection Systems Through LLM-driven Adversarial Claim Transformation
Mazal Bethany
Nishant Vishwamitra
Cho-Yu Chiang
Peyman Najafirad
AAML
56
0
0
03 May 2025
Neuro-Symbolic Generation of Explanations for Robot Policies with Weighted Signal Temporal Logic
Neuro-Symbolic Generation of Explanations for Robot Policies with Weighted Signal Temporal Logic
Mikihisa Yuasa
R. Sreenivas
Huy T. Tran
76
0
0
30 Apr 2025
Planning with Diffusion Models for Target-Oriented Dialogue Systems
Planning with Diffusion Models for Target-Oriented Dialogue Systems
Hanwen Du
Bo Peng
Xia Ning
74
0
0
23 Apr 2025
Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning
Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning
Jie Cheng
Ruixi Qiao
Lijun Li
Chao Guo
Jianmin Wang
Gang Xiong
Yisheng Lv
Fei-Yue Wang
LRM
462
5
0
21 Apr 2025
Next-Future: Sample-Efficient Policy Learning for Robotic-Arm Tasks
Next-Future: Sample-Efficient Policy Learning for Robotic-Arm Tasks
Fikrican Özgür
René Zurbrugg
Suryansh Kumar
86
0
0
15 Apr 2025
Digital Twin Catalog: A Large-Scale Photorealistic 3D Object Digital Twin Dataset
Digital Twin Catalog: A Large-Scale Photorealistic 3D Object Digital Twin Dataset
Zhao Dong
Ka Chen
Zhaoyang Lv
Hong-Xing Yu
Yunzhi Zhang
...
Xiaqing Pan
Mingfei Yan
Jiajun Wu
Carl Ren
Richard Newcombe
119
3
0
11 Apr 2025
TW-CRL: Time-Weighted Contrastive Reward Learning for Efficient Inverse Reinforcement Learning
TW-CRL: Time-Weighted Contrastive Reward Learning for Efficient Inverse Reinforcement Learning
Yuxuan Li
Yicheng Gao
Ning Yang
Stephen Xia
OffRL
133
0
0
08 Apr 2025
Solving Sokoban using Hierarchical Reinforcement Learning with Landmarks
Solving Sokoban using Hierarchical Reinforcement Learning with Landmarks
Sergey Pastukhov
67
0
0
06 Apr 2025
Outlook Towards Deployable Continual Learning for Particle Accelerators
Outlook Towards Deployable Continual Learning for Particle Accelerators
Kishansingh Rajput
Sen Lin
Auralee Edelen
Willem Blokland
Malachi Schram
71
0
0
04 Apr 2025
Reward Generation via Large Vision-Language Model in Offline Reinforcement Learning
Reward Generation via Large Vision-Language Model in Offline Reinforcement Learning
Younghwan Lee
Tung M. Luu
Donghoon Lee
Chang D. Yoo
3DVVLMOffRL
136
0
0
03 Apr 2025
Probabilistic Curriculum Learning for Goal-Based Reinforcement Learning
Probabilistic Curriculum Learning for Goal-Based Reinforcement Learning
Llewyn Salt
Marcus Gallagher
64
1
0
02 Apr 2025
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Yongshuai Liu
Xin Liu
GAN
195
2
0
24 Mar 2025
Causally Aligned Curriculum Learning
Causally Aligned Curriculum Learning
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
CML
102
4
0
21 Mar 2025
CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations
CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations
Maria Makarova
Qian Liu
Dzmitry Tsetserukou
OffRL
74
0
0
20 Mar 2025
1234...242526
Next