ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.05952
  4. Cited By
Prioritized Experience Replay

Prioritized Experience Replay

18 November 2015
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
    OffRL
ArXivPDFHTML

Papers citing "Prioritized Experience Replay"

50 / 1,441 papers shown
Title
Graph-attention-based Casual Discovery with Trust Region-navigated
  Clipping Policy Optimization
Graph-attention-based Casual Discovery with Trust Region-navigated Clipping Policy Optimization
Shixuan Liu
Yanghe Feng
Keyu Wu
Guangquan Cheng
Jincai Huang
Zhong Liu
CML
67
7
0
27 Dec 2024
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
Kun Wu
Yinuo Zhao
Zhihao Xu
Zhengping Che
Chengxiang Yin
C. Liu
Qinru Qiu
Feiferi Feng
OffRL
109
1
0
22 Dec 2024
Design of Restricted Normalizing Flow towards Arbitrary Stochastic
  Policy with Computational Efficiency
Design of Restricted Normalizing Flow towards Arbitrary Stochastic Policy with Computational Efficiency
Taisuke Kobayashi
Takumi Aotani
145
5
0
17 Dec 2024
Optimizing Plastic Waste Collection in Water Bodies Using Heterogeneous
  Autonomous Surface Vehicles with Deep Reinforcement Learning
Optimizing Plastic Waste Collection in Water Bodies Using Heterogeneous Autonomous Surface Vehicles with Deep Reinforcement Learning
Alejandro Mendoza Barrionuevo
S. Luis
Daniel Gutiérrez-Reina
S. T. Marín
71
1
0
03 Dec 2024
Playable Game Generation
Playable Game Generation
Mingyu Yang
Junyou Li
Zhongbin Fang
Sheng Chen
Yangbin Yu
Qiang Fu
Wei Yang
Deheng Ye
VGen
76
9
0
01 Dec 2024
Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Yuchen Shi
Huaxin Pei
Liang Feng
Jianming Hu
Dingyi Yao
75
0
0
30 Nov 2024
A Local Information Aggregation based Multi-Agent Reinforcement Learning
  for Robot Swarm Dynamic Task Allocation
A Local Information Aggregation based Multi-Agent Reinforcement Learning for Robot Swarm Dynamic Task Allocation
Yang Lv
Jinlong Lei
Peng Yi
59
1
0
29 Nov 2024
CRASH: Challenging Reinforcement-Learning Based Adversarial Scenarios
  For Safety Hardening
CRASH: Challenging Reinforcement-Learning Based Adversarial Scenarios For Safety Hardening
A. Kulkarni
Shangtong Zhang
Madhur Behl
AAML
63
0
0
26 Nov 2024
From Prototypes to General Distributions: An Efficient Curriculum for
  Masked Image Modeling
From Prototypes to General Distributions: An Efficient Curriculum for Masked Image Modeling
Jinhong Lin
Cheng-En Wu
Huanran Li
Jifan Zhang
Yu Hen Hu
Pedro Morgado
43
0
0
16 Nov 2024
Act in Collusion: A Persistent Distributed Multi-Target Backdoor in
  Federated Learning
Act in Collusion: A Persistent Distributed Multi-Target Backdoor in Federated Learning
Tao Liu
Wu Yang
Chen Xu
Jiguang Lv
Huanran Wang
Yuhang Zhang
Shuchun Xu
Dapeng Man
AAML
FedML
35
0
0
06 Nov 2024
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Tyler Clark
Mark Towers
Christine Evers
Jonathon Hare
OffRL
40
0
0
06 Nov 2024
Autonomous Decision Making for UAV Cooperative Pursuit-Evasion Game with
  Reinforcement Learning
Autonomous Decision Making for UAV Cooperative Pursuit-Evasion Game with Reinforcement Learning
Yang Zhao
Zidong Nie
Kangsheng Dong
Qinghua Huang
Xiaochen Li
25
0
0
05 Nov 2024
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Kaiyan Zhao
Yiming Wang
Yuyang Chen
Yan Li
Leong Hou U
Xiaoguang Niu
44
1
0
27 Oct 2024
Prioritized Generative Replay
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRL
DiffM
119
2
0
23 Oct 2024
Solving Continual Offline RL through Selective Weights Activation on
  Aligned Spaces
Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces
Jifeng Hu
Sili Huang
Li Shen
Zhejian Yang
Shengchao Hu
Shisong Tang
Hechang Chen
Yi Chang
Dacheng Tao
Lichao Sun
OffRL
44
0
0
21 Oct 2024
Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization
Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization
Timofei Gritsaev
Nikita Morozov
S. Samsonov
D. Tiapkin
26
0
0
20 Oct 2024
TF-DDRL: A Transformer-enhanced Distributed DRL Technique for Scheduling
  IoT Applications in Edge and Cloud Computing Environments
TF-DDRL: A Transformer-enhanced Distributed DRL Technique for Scheduling IoT Applications in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
OffRL
37
3
0
18 Oct 2024
Enhancing LLM Agents for Code Generation with Possibility and Pass-rate Prioritized Experience Replay
Enhancing LLM Agents for Code Generation with Possibility and Pass-rate Prioritized Experience Replay
Yuyang Chen
Kaiyan Zhao
Yiming Wang
Ming Yang
Jian Zhang
Yan Li
47
1
0
16 Oct 2024
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive
  Revaluation
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation
Jaehyun Park
Yunho Kim
Sejin Kim
Byung-Jun Lee
Sundong Kim
OffRL
39
1
0
15 Oct 2024
Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC
  Task
Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task
Yunho Kim
Jaehyun Park
Heejun Kim
Sejin Kim
Byung-Jun Lee
Sundong Kim
OffRL
40
1
0
15 Oct 2024
Learning Agents With Prioritization and Parameter Noise in Continuous
  State and Action Space
Learning Agents With Prioritization and Parameter Noise in Continuous State and Action Space
Rajesh Mangannavar
Gopalakrishnan Srinivasaraghavan
25
2
0
15 Oct 2024
PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion
PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion
Runsong Zhu
Shi Qiu
Qianyi Wu
Ka-Hei Hui
Pheng-Ann Heng
Chi-Wing Fu
36
3
0
14 Oct 2024
SAPIENT: Mastering Multi-turn Conversational Recommendation with Strategic Planning and Monte Carlo Tree Search
SAPIENT: Mastering Multi-turn Conversational Recommendation with Strategic Planning and Monte Carlo Tree Search
Hanwen Du
B. Peng
Xia Ning
38
0
0
12 Oct 2024
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge
  with Curriculum Preference Learning
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Xiyao Wang
Linfeng Song
Ye Tian
Dian Yu
Baolin Peng
Haitao Mi
Furong Huang
Dong Yu
LRM
60
11
0
09 Oct 2024
Learning in complex action spaces without policy gradients
Learning in complex action spaces without policy gradients
Arash Tavakoli
Sina Ghiassian
Nemanja Rakićević
OffRL
34
0
0
08 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
45
1
0
07 Oct 2024
Adaptive teachers for amortized samplers
Adaptive teachers for amortized samplers
Minsu Kim
Sanghyeok Choi
Taeyoung Yun
Emmanuel Bengio
Leo Feng
Jarrid Rector-Brooks
Sungsoo Ahn
Jinkyoo Park
Nikolay Malkin
Yoshua Bengio
258
4
0
02 Oct 2024
Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse
  Training
Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training
Pihe Hu
Shaolong Li
Zhuoran Li
L. Pan
Longbo Huang
29
0
0
28 Sep 2024
CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models
CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models
Kanghyun Ryu
Qiayuan Liao
Zhongyu Li
Koushil Sreenath
Negar Mehr
Negar Mehr
LM&Ro
210
3
0
27 Sep 2024
DIGIMON: Diagnosis and Mitigation of Sampling Skew for Reinforcement
  Learning based Meta-Planner in Robot Navigation
DIGIMON: Diagnosis and Mitigation of Sampling Skew for Reinforcement Learning based Meta-Planner in Robot Navigation
Shiwei Feng
Xuan Chen
Zhiyuan Cheng
Zikang Xiong
Yifei Gao
Siyuan Cheng
Sayali Kate
Xiangyu Zhang
OffRL
30
0
0
17 Sep 2024
CD-NGP: A Fast Scalable Continual Representation for Dynamic Scenes
CD-NGP: A Fast Scalable Continual Representation for Dynamic Scenes
Zhenhuan Liu
Shuai Liu
Zhiwei Ning
Jie Yang
Wei Liu
3DV
3DGS
37
2
0
08 Sep 2024
Improving Deep Reinforcement Learning by Reducing the Chain Effect of
  Value and Policy Churn
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn
Hongyao Tang
Glen Berseth
OffRL
50
1
0
07 Sep 2024
Robust off-policy Reinforcement Learning via Soft Constrained Adversary
Robust off-policy Reinforcement Learning via Soft Constrained Adversary
Kosuke Nakanishi
Akihiro Kubo
Yuji Yasui
Shin Ishii
53
0
0
31 Aug 2024
A Tighter Convergence Proof of Reverse Experience Replay
A Tighter Convergence Proof of Reverse Experience Replay
Nan Jiang
Jinzhao Li
Yexiang Xue
21
0
0
30 Aug 2024
Theoretical Insights into Overparameterized Models in Multi-Task and Replay-Based Continual Learning
Theoretical Insights into Overparameterized Models in Multi-Task and Replay-Based Continual Learning
Mohammadamin Banayeeanzade
Mahdi Soltanolkotabi
Mohammad Rostami
CLL
LRM
103
1
0
29 Aug 2024
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
Nikolaos Pippas
Cagatay Turkay
Elliot A. Ludvig
AIFin
95
3
0
20 Aug 2024
Enhancing Reinforcement Learning Through Guided Search
Enhancing Reinforcement Learning Through Guided Search
Jérôme Arjonilla
Abdallah Saffidine
Tristan Cazenave
OffRL
99
0
0
19 Aug 2024
SigmaRL: A Sample-Efficient and Generalizable Multi-Agent Reinforcement Learning Framework for Motion Planning
SigmaRL: A Sample-Efficient and Generalizable Multi-Agent Reinforcement Learning Framework for Motion Planning
Jianye Xu
Pan Hu
Bassam Alrifaee
49
5
0
14 Aug 2024
Parallel Distributional Deep Reinforcement Learning for Mapless
  Navigation of Terrestrial Mobile Robots
Parallel Distributional Deep Reinforcement Learning for Mapless Navigation of Terrestrial Mobile Robots
V. A. Kich
A. H. Kolling
J. C. Jesus
Gabriel V. Heisler
Hiago Jacobs
...
André da Silva Kelbouscas
Akihisa Ohya
Ricardo B. Grando
Paulo Lilles Jorge Drews-Jr
D. T. Gamarra
38
3
0
11 Aug 2024
Review of Cloud Service Composition for Intelligent Manufacturing
Review of Cloud Service Composition for Intelligent Manufacturing
Cuixia Li
Liqiang Liu
Li Shi
29
0
0
03 Aug 2024
Adaptive traffic signal safety and efficiency improvement by multi
  objective deep reinforcement learning approach
Adaptive traffic signal safety and efficiency improvement by multi objective deep reinforcement learning approach
Shahin Mirbakhsh
Mahdi Azizi
23
2
0
01 Aug 2024
How to Choose a Reinforcement-Learning Algorithm
How to Choose a Reinforcement-Learning Algorithm
Fabian Bongratz
Vladimir Golkov
Lukas Mautner
Luca Della Libera
Frederik Heetmeyer
Felix Czaja
Julian Rodemann
Daniel Cremers
34
1
0
30 Jul 2024
Collision Probability Distribution Estimation via Temporal Difference
  Learning
Collision Probability Distribution Estimation via Temporal Difference Learning
Thomas Steinecker
Thorsten Luettel
Mirko Maehlisch
19
0
0
29 Jul 2024
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement
  Learning
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning
Andrew Patterson
Samuel Neumann
Raksha Kumaraswamy
Martha White
Adam White
26
2
0
26 Jul 2024
Principal-Agent Reinforcement Learning
Principal-Agent Reinforcement Learning
Dima Ivanov
Paul Dutting
Inbal Talgam-Cohen
Tonghan Wang
David C. Parkes
42
3
0
25 Jul 2024
Reinforcement Learning-based Adaptive Mitigation of Uncorrected DRAM
  Errors in the Field
Reinforcement Learning-based Adaptive Mitigation of Uncorrected DRAM Errors in the Field
Isaac Boixaderas
Sergi Moré
Javier Bartolome
David Vicente
Petar Radojković
Paul M. Carpenter
Eduard Ayguadé
27
1
0
23 Jul 2024
Multiple Importance Sampling for Stochastic Gradient Estimation
Multiple Importance Sampling for Stochastic Gradient Estimation
Corentin Salaün
Xingchang Huang
Iliyan Georgiev
Niloy J. Mitra
Gurprit Singh
32
1
0
22 Jul 2024
Investigating the Interplay of Prioritized Replay and Generalization
Investigating the Interplay of Prioritized Replay and Generalization
Parham Mohammad Panahi
Andrew Patterson
Martha White
Adam White
58
0
0
12 Jul 2024
Generalizing soft actor-critic algorithms to discrete action spaces
Generalizing soft actor-critic algorithms to discrete action spaces
Le Zhang
Yong Gu
Xin Zhao
Yanshuo Zhang
Shu Zhao
Yifei Jin
Xinxin Wu
34
0
0
08 Jul 2024
Embracing Massive Medical Data
Embracing Massive Medical Data
Yu-Cheng Chou
Zongwei Zhou
Alan Yuille
CLL
OOD
40
4
0
05 Jul 2024
Previous
12345...272829
Next