ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.15503
  4. Cited By
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
v1v2 (latest)

Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning

27 June 2023
Jinyi Liu
Yi Ma
Jianye Hao
Yujing Hu
Yan Zheng
Tangjie Lv
Changjie Fan
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning"

30 / 30 papers shown
Title
Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning
Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning
Abdullah Vanlioglu
147
0
0
28 Mar 2025
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
Kai-Wen Zhao
Yi-An Ma
Jianye Hao
Jinyi Liu
Yan Zheng
Zhaopeng Meng
OffRLOnRL
114
12
0
12 Jun 2023
The In-Sample Softmax for Offline Reinforcement Learning
The In-Sample Softmax for Offline Reinforcement Learning
Chenjun Xiao
Han Wang
Yangchen Pan
Adam White
Martha White
OffRL
85
26
0
28 Feb 2023
Boosting Offline Reinforcement Learning via Data Rebalancing
Boosting Offline Reinforcement Learning via Data Rebalancing
Yang Yue
Bingyi Kang
Xiao Ma
Zhongwen Xu
Gao Huang
Shuicheng Yan
OffRL
81
23
0
17 Oct 2022
CORL: Research-oriented Deep Offline Reinforcement Learning Library
CORL: Research-oriented Deep Offline Reinforcement Learning Library
Denis Tarasov
Alexander Nikulin
Dmitry Akimov
Vladislav Kurenkov
Sergey Kolesnikov
OffRL
130
94
0
13 Oct 2022
Bootstrapped Transformer for Offline Reinforcement Learning
Bootstrapped Transformer for Offline Reinforcement Learning
Kerong Wang
Hanye Zhao
Xufang Luo
Kan Ren
Weinan Zhang
Dongsheng Li
OffRL
73
40
0
17 Jun 2022
PAnDR: Fast Adaptation to New Environments from Offline Experiences via
  Decoupling Policy and Environment Representations
PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations
Tong Sang
Hongyao Tang
Yi-An Ma
Jianye Hao
Yan Zheng
Zhaopeng Meng
Boyan Li
Zhen Wang
OffRL
67
5
0
06 Apr 2022
Topological Experience Replay
Topological Experience Replay
Zhang-Wei Hong
Tao Chen
Yen-Chen Lin
Joni Pajarinen
Pulkit Agrawal
75
16
0
29 Mar 2022
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement
  Learning
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Zhuoran Yang
Zhihong Deng
Animesh Garg
Peng Liu
Zhaoran Wang
OffRL
106
141
0
23 Feb 2022
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
94
23
0
09 Nov 2021
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
312
936
0
12 Oct 2021
Uncertainty-Based Offline Reinforcement Learning with Diversified
  Q-Ensemble
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble
Gaon An
Seungyong Moon
Jang-Hyun Kim
Hyun Oh Song
OffRL
187
284
0
04 Oct 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
86
103
0
14 Sep 2021
A Minimalist Approach to Offline Reinforcement Learning
A Minimalist Approach to Offline Reinforcement Learning
Scott Fujimoto
S. Gu
OffRL
145
834
0
12 Jun 2021
Decision Transformer: Reinforcement Learning via Sequence Modeling
Decision Transformer: Reinforcement Learning via Sequence Modeling
Lili Chen
Kevin Lu
Aravind Rajeswaran
Kimin Lee
Aditya Grover
Michael Laskin
Pieter Abbeel
A. Srinivas
Igor Mordatch
OffRL
213
1,671
0
02 Jun 2021
COMBO: Conservative Offline Model-Based Policy Optimization
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
315
435
0
16 Feb 2021
Revisiting Prioritized Experience Replay: A Value Perspective
Revisiting Prioritized Experience Replay: A Value Perspective
Ang Li
Zongqing Lu
Chenglin Miao
66
9
0
05 Feb 2021
Understanding and Mitigating the Limitations of Prioritized Experience
  Replay
Understanding and Mitigating the Limitations of Prioritized Experience Replay
Yangchen Pan
Jincheng Mei
Amir-massoud Farahmand
Martha White
Hengshuai Yao
Mohsen Rohani
Jun Luo
69
12
0
19 Jul 2020
Conservative Q-Learning for Offline Reinforcement Learning
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRLOnRL
152
1,841
0
08 Jun 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRLGP
586
2,052
0
04 May 2020
Reinforcement Learning with Augmented Data
Reinforcement Learning with Augmented Data
Michael Laskin
Kimin Lee
Adam Stooke
Lerrel Pinto
Pieter Abbeel
A. Srinivas
OffRL
168
661
0
30 Apr 2020
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GPOffRL
253
1,387
0
15 Apr 2020
Deep Reinforcement Learning for Autonomous Driving: A Survey
Deep Reinforcement Learning for Autonomous Driving: A Survey
B. R. Kiran
Ibrahim Sobh
V. Talpaert
Patrick Mannion
A. A. Sallab
S. Yogamani
P. Pérez
369
1,710
0
02 Feb 2020
Mastering Complex Control in MOBA Games with Deep Reinforcement Learning
Mastering Complex Control in MOBA Games with Deep Reinforcement Learning
Deheng Ye
Zhao Liu
Mingfei Sun
Bei Shi
P. Zhao
...
Tengfei Shi
Liang Wang
Qiang Fu
Wei Yang
Lanxiao Huang
73
324
0
20 Dec 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRLOnRL
156
1,070
0
03 Jun 2019
Off-Policy Deep Reinforcement Learning without Exploration
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto
David Meger
Doina Precup
OffRLBDL
350
1,628
0
07 Dec 2018
Self-Imitation Learning
Self-Imitation Learning
Junhyuk Oh
Yijie Guo
Satinder Singh
Honglak Lee
SSL
90
251
0
14 Jun 2018
Sample-Efficient Deep Reinforcement Learning via Episodic Backward
  Update
Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update
Su Young Lee
Sung-Ik Choi
Sae-Young Chung
BDL
79
75
0
31 May 2018
Rainbow: Combining Improvements in Deep Reinforcement Learning
Rainbow: Combining Improvements in Deep Reinforcement Learning
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
OffRL
116
2,283
0
06 Oct 2017
Prioritized Experience Replay
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
350
3,812
0
18 Nov 2015
1