Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.15503
Cited By
v1
v2 (latest)
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
27 June 2023
Jinyi Liu
Yi Ma
Jianye Hao
Yujing Hu
Yan Zheng
Tangjie Lv
Changjie Fan
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning"
30 / 30 papers shown
Title
Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning
Abdullah Vanlioglu
147
0
0
28 Mar 2025
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
Kai-Wen Zhao
Yi-An Ma
Jianye Hao
Jinyi Liu
Yan Zheng
Zhaopeng Meng
OffRL
OnRL
114
12
0
12 Jun 2023
The In-Sample Softmax for Offline Reinforcement Learning
Chenjun Xiao
Han Wang
Yangchen Pan
Adam White
Martha White
OffRL
85
26
0
28 Feb 2023
Boosting Offline Reinforcement Learning via Data Rebalancing
Yang Yue
Bingyi Kang
Xiao Ma
Zhongwen Xu
Gao Huang
Shuicheng Yan
OffRL
81
23
0
17 Oct 2022
CORL: Research-oriented Deep Offline Reinforcement Learning Library
Denis Tarasov
Alexander Nikulin
Dmitry Akimov
Vladislav Kurenkov
Sergey Kolesnikov
OffRL
130
94
0
13 Oct 2022
Bootstrapped Transformer for Offline Reinforcement Learning
Kerong Wang
Hanye Zhao
Xufang Luo
Kan Ren
Weinan Zhang
Dongsheng Li
OffRL
73
40
0
17 Jun 2022
PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations
Tong Sang
Hongyao Tang
Yi-An Ma
Jianye Hao
Yan Zheng
Zhaopeng Meng
Boyan Li
Zhen Wang
OffRL
67
5
0
06 Apr 2022
Topological Experience Replay
Zhang-Wei Hong
Tao Chen
Yen-Chen Lin
Joni Pajarinen
Pulkit Agrawal
75
16
0
29 Mar 2022
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Zhuoran Yang
Zhihong Deng
Animesh Garg
Peng Liu
Zhaoran Wang
OffRL
106
141
0
23 Feb 2022
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
94
23
0
09 Nov 2021
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
312
936
0
12 Oct 2021
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble
Gaon An
Seungyong Moon
Jang-Hyun Kim
Hyun Oh Song
OffRL
187
284
0
04 Oct 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
86
103
0
14 Sep 2021
A Minimalist Approach to Offline Reinforcement Learning
Scott Fujimoto
S. Gu
OffRL
145
834
0
12 Jun 2021
Decision Transformer: Reinforcement Learning via Sequence Modeling
Lili Chen
Kevin Lu
Aravind Rajeswaran
Kimin Lee
Aditya Grover
Michael Laskin
Pieter Abbeel
A. Srinivas
Igor Mordatch
OffRL
213
1,671
0
02 Jun 2021
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
315
435
0
16 Feb 2021
Revisiting Prioritized Experience Replay: A Value Perspective
Ang Li
Zongqing Lu
Chenglin Miao
66
9
0
05 Feb 2021
Understanding and Mitigating the Limitations of Prioritized Experience Replay
Yangchen Pan
Jincheng Mei
Amir-massoud Farahmand
Martha White
Hengshuai Yao
Mohsen Rohani
Jun Luo
69
12
0
19 Jul 2020
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
152
1,841
0
08 Jun 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
586
2,052
0
04 May 2020
Reinforcement Learning with Augmented Data
Michael Laskin
Kimin Lee
Adam Stooke
Lerrel Pinto
Pieter Abbeel
A. Srinivas
OffRL
168
661
0
30 Apr 2020
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GP
OffRL
253
1,387
0
15 Apr 2020
Deep Reinforcement Learning for Autonomous Driving: A Survey
B. R. Kiran
Ibrahim Sobh
V. Talpaert
Patrick Mannion
A. A. Sallab
S. Yogamani
P. Pérez
369
1,710
0
02 Feb 2020
Mastering Complex Control in MOBA Games with Deep Reinforcement Learning
Deheng Ye
Zhao Liu
Mingfei Sun
Bei Shi
P. Zhao
...
Tengfei Shi
Liang Wang
Qiang Fu
Wei Yang
Lanxiao Huang
73
324
0
20 Dec 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRL
OnRL
156
1,070
0
03 Jun 2019
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto
David Meger
Doina Precup
OffRL
BDL
350
1,628
0
07 Dec 2018
Self-Imitation Learning
Junhyuk Oh
Yijie Guo
Satinder Singh
Honglak Lee
SSL
90
251
0
14 Jun 2018
Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update
Su Young Lee
Sung-Ik Choi
Sae-Young Chung
BDL
79
75
0
31 May 2018
Rainbow: Combining Improvements in Deep Reinforcement Learning
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
OffRL
116
2,283
0
06 Oct 2017
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
350
3,812
0
18 Nov 2015
1