Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1511.05952
Cited By
v1
v2
v3
v4 (latest)
Prioritized Experience Replay
18 November 2015
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Prioritized Experience Replay"
50 / 1,454 papers shown
Title
Improving Environment Novelty Quantification for Effective Unsupervised Environment Design
Jayden Teoh
Wenjun Li
Pradeep Varakantham
113
2
0
08 Feb 2025
Learning from Active Human Involvement through Proxy Value Propagation
Zhenghao Peng
Wenjie Mo
Chenda Duan
Quanyi Li
Bolei Zhou
185
16
0
05 Feb 2025
Optimizing Job Allocation using Reinforcement Learning with Graph Neural Networks
Lars C.P.M. Quaedvlieg
105
0
0
31 Jan 2025
Towards General-Purpose Model-Free Reinforcement Learning
Scott Fujimoto
P. DÓro
Amy Zhang
Yuandong Tian
Michael Rabbat
OffRL
99
6
0
28 Jan 2025
Episodic memory in AI agents poses risks that should be studied and mitigated
Chad DeChant
143
4
0
20 Jan 2025
Average-Reward Reinforcement Learning with Entropy Regularization
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
R. Kulkarni
OOD
80
2
0
17 Jan 2025
Pareto Set Learning for Multi-Objective Reinforcement Learning
Erlong Liu
Yu-Chang Wu
Xiaobin Huang
Chengrui Gao
Ren-Jian Wang
Ke Xue
Chao Qian
OffRL
235
2
0
12 Jan 2025
Risk-averse policies for natural gas futures trading using distributional reinforcement learning
Félicien Hêche
Biagio Nigro
Oussama Barakat
Stephan Robert-Nicoud
OffRL
90
0
0
08 Jan 2025
Highway Graph to Accelerate Reinforcement Learning
Zidu Yin
Zhen Zhang
Dong Gong
Stefano V. Albrecht
J. Q. Shi
OffRL
75
0
0
08 Jan 2025
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
Chunyu Xuan
Yazhe Niu
Yuan Pu
Shuai Hu
Yu Liu
Jing Yang
163
0
0
03 Jan 2025
High-fidelity social learning via shared episodic memories enhances collaborative foraging through mnemonic convergence
Ismael T. Freire
P. Verschure
58
1
0
31 Dec 2024
Graph-attention-based Casual Discovery with Trust Region-navigated Clipping Policy Optimization
Shixuan Liu
Yanghe Feng
Keyu Wu
Guangquan Cheng
Jincai Huang
Zhong Liu
CML
84
7
0
27 Dec 2024
Contrastive Representation for Interactive Recommendation
Jingyu Li
Zhiyong Feng
Dongxiao He
Hongqi Chen
Qinghang Gao
Guoli Wu
83
0
0
24 Dec 2024
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
Kun Wu
Yinuo Zhao
Zhihao Xu
Zhengping Che
Chengxiang Yin
C. Liu
Qinru Qiu
Feiferi Feng
OffRL
170
1
0
22 Dec 2024
Design of Restricted Normalizing Flow towards Arbitrary Stochastic Policy with Computational Efficiency
Taisuke Kobayashi
Takumi Aotani
176
5
0
17 Dec 2024
Optimizing Plastic Waste Collection in Water Bodies Using Heterogeneous Autonomous Surface Vehicles with Deep Reinforcement Learning
Alejandro Mendoza Barrionuevo
S. Luis
Daniel Gutiérrez-Reina
S. T. Marín
92
1
0
03 Dec 2024
Playable Game Generation
Mingyu Yang
Junyou Li
Zhongbin Fang
Sheng Chen
Yangbin Yu
Qiang Fu
Wei Yang
Deheng Ye
VGen
123
10
0
01 Dec 2024
Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Yuchen Shi
Huaxin Pei
Liang Feng
Jianming Hu
Dingyi Yao
112
0
0
30 Nov 2024
A Local Information Aggregation based Multi-Agent Reinforcement Learning for Robot Swarm Dynamic Task Allocation
Yang Lv
Jinlong Lei
Peng Yi
113
1
0
29 Nov 2024
CRASH: Challenging Reinforcement-Learning Based Adversarial Scenarios For Safety Hardening
A. Kulkarni
Shangtong Zhang
Madhur Behl
AAML
108
1
0
26 Nov 2024
From Prototypes to General Distributions: An Efficient Curriculum for Masked Image Modeling
Jinhong Lin
Cheng-En Wu
Huanran Li
Jifan Zhang
Yu Hen Hu
Pedro Morgado
117
0
0
16 Nov 2024
Act in Collusion: A Persistent Distributed Multi-Target Backdoor in Federated Learning
Tao Liu
Wu Yang
Chen Xu
Jiguang Lv
Huanran Wang
Yuhang Zhang
Shuchun Xu
Dapeng Man
AAML
FedML
70
0
0
06 Nov 2024
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Tyler Clark
Mark Towers
Christine Evers
Jonathon Hare
OffRL
158
1
0
06 Nov 2024
Autonomous Decision Making for UAV Cooperative Pursuit-Evasion Game with Reinforcement Learning
Yang Zhao
Zidong Nie
Kangsheng Dong
Qinghua Huang
Xiaochen Li
35
0
0
05 Nov 2024
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Kaiyan Zhao
Yiming Wang
Yuyang Chen
Yan Li
Leong Hou U
Xiaoguang Niu
129
1
0
27 Oct 2024
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRL
DiffM
190
7
0
23 Oct 2024
Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces
Jifeng Hu
Sili Huang
Li Shen
Zhejian Yang
Shengchao Hu
Shisong Tang
Hechang Chen
Yi Chang
Dacheng Tao
Lichao Sun
OffRL
89
0
0
21 Oct 2024
Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization
Timofei Gritsaev
Nikita Morozov
S. Samsonov
D. Tiapkin
85
3
0
20 Oct 2024
TF-DDRL: A Transformer-enhanced Distributed DRL Technique for Scheduling IoT Applications in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
OffRL
115
4
0
18 Oct 2024
Enhancing LLM Agents for Code Generation with Possibility and Pass-rate Prioritized Experience Replay
Yuyang Chen
Kaiyan Zhao
Yiming Wang
Ming Yang
Jian Zhang
Yan Li
157
1
0
16 Oct 2024
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation
Jaehyun Park
Yunho Kim
Sejin Kim
Byung-Jun Lee
Sundong Kim
OffRL
83
1
0
15 Oct 2024
Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task
Yunho Kim
Jaehyun Park
Heejun Kim
Sejin Kim
Byung-Jun Lee
Sundong Kim
OffRL
81
1
0
15 Oct 2024
Learning Agents With Prioritization and Parameter Noise in Continuous State and Action Space
Rajesh Mangannavar
Gopalakrishnan Srinivasaraghavan
38
2
0
15 Oct 2024
PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion
Runsong Zhu
Shi Qiu
Qianyi Wu
Ka-Hei Hui
Pheng-Ann Heng
Chi-Wing Fu
60
4
0
14 Oct 2024
SAPIENT: Mastering Multi-turn Conversational Recommendation with Strategic Planning and Monte Carlo Tree Search
Hanwen Du
Bo Peng
Xia Ning
117
0
0
12 Oct 2024
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Xiyao Wang
Linfeng Song
Ye Tian
Dian Yu
Baolin Peng
Haitao Mi
Furong Huang
Dong Yu
LRM
134
14
0
09 Oct 2024
Learning in complex action spaces without policy gradients
Arash Tavakoli
Sina Ghiassian
Nemanja Rakićević
OffRL
74
0
0
08 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
106
1
0
07 Oct 2024
Adaptive teachers for amortized samplers
Minsu Kim
Sanghyeok Choi
Taeyoung Yun
Emmanuel Bengio
Leo Feng
Jarrid Rector-Brooks
Sungsoo Ahn
Jinkyoo Park
Nikolay Malkin
Yoshua Bengio
488
7
0
02 Oct 2024
Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training
Pihe Hu
Shaolong Li
Zhuoran Li
L. Pan
Longbo Huang
42
0
0
28 Sep 2024
CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models
Kanghyun Ryu
Qiayuan Liao
Zhongyu Li
Koushil Sreenath
Negar Mehr
Negar Mehr
LM&Ro
360
4
0
27 Sep 2024
DIGIMON: Diagnosis and Mitigation of Sampling Skew for Reinforcement Learning based Meta-Planner in Robot Navigation
Shiwei Feng
Xuan Chen
Zhiyuan Cheng
Zikang Xiong
Yifei Gao
Siyuan Cheng
Sayali Kate
Xiangyu Zhang
OffRL
73
0
0
17 Sep 2024
CD-NGP: A Fast Scalable Continual Representation for Dynamic Scenes
Zhenhuan Liu
Shuai Liu
Zhiwei Ning
Jie Yang
Wei Liu
3DV
3DGS
66
2
0
08 Sep 2024
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn
Hongyao Tang
Glen Berseth
OffRL
95
2
0
07 Sep 2024
Robust off-policy Reinforcement Learning via Soft Constrained Adversary
Kosuke Nakanishi
Akihiro Kubo
Yuji Yasui
Shin Ishii
81
0
0
31 Aug 2024
A Tighter Convergence Proof of Reverse Experience Replay
Nan Jiang
Jinzhao Li
Yexiang Xue
63
0
0
30 Aug 2024
Theoretical Insights into Overparameterized Models in Multi-Task and Replay-Based Continual Learning
Mohammadamin Banayeeanzade
Mahdi Soltanolkotabi
Mohammad Rostami
CLL
LRM
311
4
0
29 Aug 2024
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
Nikolaos Pippas
Cagatay Turkay
Elliot A. Ludvig
AIFin
193
4
0
20 Aug 2024
Enhancing Reinforcement Learning Through Guided Search
Jérôme Arjonilla
Abdallah Saffidine
Tristan Cazenave
OffRL
181
0
0
19 Aug 2024
SigmaRL: A Sample-Efficient and Generalizable Multi-Agent Reinforcement Learning Framework for Motion Planning
Jianye Xu
Pan Hu
Bassam Alrifaee
113
5
0
14 Aug 2024
Previous
1
2
3
4
5
...
28
29
30
Next