ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.05952
  4. Cited By
Prioritized Experience Replay
v1v2v3v4 (latest)

Prioritized Experience Replay

18 November 2015
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Prioritized Experience Replay"

50 / 1,454 papers shown
Title
CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization
CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization
Ranting Hu
OffRL
33
0
0
18 Jun 2025
Dynamic Reinsurance Treaty Bidding via Multi-Agent Reinforcement Learning
Dynamic Reinsurance Treaty Bidding via Multi-Agent Reinforcement Learning
Stella C. Dong
James R. Finlay
22
0
0
16 Jun 2025
Uncertainty Prioritized Experience Replay
Rodrigo Carrasco-Davis
Sebastian Lee
Claudia Clopath
Will Dabney
37
0
0
10 Jun 2025
Contextual Experience Replay for Self-Improvement of Language Agents
Contextual Experience Replay for Self-Improvement of Language Agents
Yitao Liu
Chenglei Si
Karthik Narasimhan
Shunyu Yao
LLMAG
39
0
0
07 Jun 2025
Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay
Yifan Sun
Jingyan Shen
Yibin Wang
Tianyu Chen
Zhendong Wang
Mingyuan Zhou
Huan Zhang
97
0
0
05 Jun 2025
Adaptive Destruction Processes for Diffusion Samplers
Adaptive Destruction Processes for Diffusion Samplers
Timofei Gritsaev
Nikita Morozov
Kirill Tamogashev
D. Tiapkin
S. Samsonov
A. Naumov
Dmitry Vetrov
Nikolay Malkin
64
0
0
02 Jun 2025
Hybrid Cross-domain Robust Reinforcement Learning
Hybrid Cross-domain Robust Reinforcement Learning
Linh Le Pham Van
Minh Hoang Nguyen
Hung Le
H. Tran
Sunil R. Gupta
OffRL
46
0
0
29 May 2025
BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL
BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL
Yu-Heng Hung
Kai-Jie Lin
Yu-Heng Lin
Chien-Yi Wang
Cheng Sun
Ping-Chun Hsieh
73
1
0
28 May 2025
Proxy-Free GFlowNet
Proxy-Free GFlowNet
Ruishuo Chen
Xun Wang
Rui Hu
Zhuoran Li
Longbo Huang
74
0
0
26 May 2025
A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning
A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning
Yuzheng Hu
Fan Wu
Haotian Ye
David A. Forsyth
James Y. Zou
Nan Jiang
Jiaqi W. Ma
Han Zhao
OffRL
79
0
0
25 May 2025
Self-Evolving Curriculum for LLM Reasoning
Self-Evolving Curriculum for LLM Reasoning
Xiaoyin Chen
Jiarui Lu
Minsu Kim
Dinghuai Zhang
Jian Tang
Alexandre Piché
Nicolas Angelard-Gontier
Yoshua Bengio
Ehsan Kamalloo
ReLMLRM
124
0
0
20 May 2025
Learning to Play Like Humans: A Framework for LLM Adaptation in Interactive Fiction Games
Learning to Play Like Humans: A Framework for LLM Adaptation in Interactive Fiction Games
Jinming Zhang
Yunfei Long
LLMAG
57
0
0
18 May 2025
Unveiling the Black Box: A Multi-Layer Framework for Explaining Reinforcement Learning-Based Cyber Agents
Unveiling the Black Box: A Multi-Layer Framework for Explaining Reinforcement Learning-Based Cyber Agents
Diksha Goel
Kristen Moore
Jeff Wang
Minjune Kim
Thanh Thi Nguyen
AAML
36
0
0
16 May 2025
Reinforcement Learning for Game-Theoretic Resource Allocation on Graphs
Reinforcement Learning for Game-Theoretic Resource Allocation on Graphs
Zijian An
Lifeng Zhou
55
0
0
08 May 2025
Understand the Effect of Importance Weighting in Deep Learning on Dataset Shift
Understand the Effect of Importance Weighting in Deep Learning on Dataset Shift
Thien Nhan Vo
96
0
0
06 May 2025
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning
Caleb Chuck
Fan Feng
Carl Qi
Chang Shi
Siddhant Agarwal
Amy Zhang
S. Niekum
91
0
0
06 May 2025
Unraveling the Rainbow: can value-based methods schedule?
Unraveling the Rainbow: can value-based methods schedule?
Arthur Corrêa
Alexandre Jesus
Cristóvão Silva
Samuel Moniz
OffRL
78
0
0
06 May 2025
Enhancing New-item Fairness in Dynamic Recommender Systems
Enhancing New-item Fairness in Dynamic Recommender Systems
Huizhong Guo
Zhu Sun
Donghai Hong
Tianjun Wei
Jinfeng Li
Jie Zhang
66
0
0
30 Apr 2025
Interactive Double Deep Q-network: Integrating Human Interventions and Evaluative Predictions in Reinforcement Learning of Autonomous Driving
Interactive Double Deep Q-network: Integrating Human Interventions and Evaluative Predictions in Reinforcement Learning of Autonomous Driving
Alkis Sygkounas
Ioannis Athanasiadis
Andreas Persson
Michael Felsberg
Amy Loutfi
OffRL
103
0
0
28 Apr 2025
Cracking the Code of Action: a Generative Approach to Affordances for Reinforcement Learning
Cracking the Code of Action: a Generative Approach to Affordances for Reinforcement Learning
Lynn Cherif
Flemming Kondrup
David Venuto
Ankit Anand
Doina Precup
Khimya Khetarpal
LM&Ro
199
0
0
24 Apr 2025
Noise-Tolerant Coreset-Based Class Incremental Continual Learning
Noise-Tolerant Coreset-Based Class Incremental Continual Learning
Edison Mucllari
Aswin Raghavan
Z. Daniels
CLLNoLa
430
0
0
23 Apr 2025
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Shihan Dou
Muling Wu
Jingwen Xu
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
OffRLLRM
77
2
0
19 Apr 2025
Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning
Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning
Yixuan Even Xu
Yash Savani
Fei Fang
Zico Kolter
OffRL
115
12
0
18 Apr 2025
Next-Future: Sample-Efficient Policy Learning for Robotic-Arm Tasks
Next-Future: Sample-Efficient Policy Learning for Robotic-Arm Tasks
Fikrican Özgür
René Zurbrugg
Suryansh Kumar
86
0
0
15 Apr 2025
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning
Haozhe Wang
Chao Qu
Zuming Huang
Wei Chu
Fangzhen Lin
Wenhu Chen
OffRLReLMSyDaLRMVLM
168
40
0
10 Apr 2025
SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement
SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement
Xinze Wang
Zhiyong Yang
Chao Feng
Hongjin Lu
Linjie Li
Chung-Ching Lin
Kevin Qinghong Lin
Furong Huang
Lijuan Wang
OODDReLMLRMVLM
224
19
0
10 Apr 2025
Momentum Boosted Episodic Memory for Improving Learning in Long-Tailed RL Environments
Momentum Boosted Episodic Memory for Improving Learning in Long-Tailed RL Environments
Dolton Fernandes
Pramod Kaushik
Harsh Shukla
Bapi Raju Surampudi
53
0
0
08 Apr 2025
AEGIS: Human Attention-based Explainable Guidance for Intelligent Vehicle Systems
AEGIS: Human Attention-based Explainable Guidance for Intelligent Vehicle Systems
Zhuoli Zhuang
Cheng-You Lu
Yu-Cheng Chang
Yu-Kai Wang
T. Do
Chin-Teng Lin
115
0
0
08 Apr 2025
Algorithm Discovery With LLMs: Evolutionary Search Meets Reinforcement Learning
Algorithm Discovery With LLMs: Evolutionary Search Meets Reinforcement Learning
Anja Surina
Amin Mansouri
Lars Quaedvlieg
Amal Seddas
Maryna Viazovska
Emmanuel Abbe
Çağlar Gülçehre
116
3
0
07 Apr 2025
Economic Battery Storage Dispatch with Deep Reinforcement Learning from Rule-Based Demonstrations
Economic Battery Storage Dispatch with Deep Reinforcement Learning from Rule-Based Demonstrations
Manuel Sage
Martin Staniszewski
Yaoyao Fiona Zhao
89
2
0
06 Apr 2025
Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning
Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning
Xuerui Su
Shufang Xie
Guoqing Liu
Yingce Xia
Renqian Luo
Peiran Jin
Zhiming Ma
Yue Wang
Zun Wang
Yuting Liu
LRM
97
5
0
06 Apr 2025
NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices
NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices
Zhe Wang
Yifei Zhu
100
0
0
04 Apr 2025
Exploration-Driven Generative Interactive Environments
Exploration-Driven Generative Interactive Environments
N. Savov
Naser Kazemi
Mohammad Mahdi
Danda Pani Paudel
Xi Wang
Luc Van Gool
VGen3DV
117
1
0
03 Apr 2025
FastFlow: Early Yet Robust Network Flow Classification using the Minimal Number of Time-Series Packets
FastFlow: Early Yet Robust Network Flow Classification using the Minimal Number of Time-Series Packets
Rushi Jayeshkumar Babaria
Minzhao Lyu
Gustavo E. A. P. A. Batista
V. Sivaraman
AI4TS
58
0
0
02 Apr 2025
MAER-Nav: Bidirectional Motion Learning Through Mirror-Augmented Experience Replay for Robot Navigation
MAER-Nav: Bidirectional Motion Learning Through Mirror-Augmented Experience Replay for Robot Navigation
Shanze Wang
Mingao Tan
Zhiyong Yang
Biao Huang
Xiaoyu Shen
Hailong Huang
Wei Zhang
59
0
0
31 Mar 2025
Flip Learning: Weakly Supervised Erase to Segment Nodules in Breast Ultrasound
Flip Learning: Weakly Supervised Erase to Segment Nodules in Breast Ultrasound
Yuhao Huang
Ao Chang
Haoran Dou
X. Tao
Xinrui Zhou
...
Ruobing Huang
Alejandro F Frangi
Lingyun Bao
Xin Yang
Dong Ni
123
1
0
26 Mar 2025
CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations
CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations
Maria Makarova
Qian Liu
Dzmitry Tsetserukou
OffRL
76
0
0
20 Mar 2025
A Generalist Hanabi Agent
A Generalist Hanabi Agent
Arjun Vaithilingam Sudhakar
Hadi Nekoei
Mathieu Reymond
Miao Liu
Janarthanan Rajendran
Sarath Chandar
481
0
0
17 Mar 2025
Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via Exploration
Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via Exploration
Amir Baghi
Jens Sjölund
Joakim Bergdahl
Linus Gisslén
Alessandro Sestini
132
0
0
17 Mar 2025
Deep Learning Agents Trained For Avoidance Behave Like Hawks And Doves
Aryaman Reddi
Glenn Vinnicombe
80
0
0
14 Mar 2025
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Moritz A. Zanger
Pascal R. van der Vaart
Wendelin Bohmer
M. Spaan
UQCVBDL
507
2
0
14 Mar 2025
PER-DPP Sampling Framework and Its Application in Path Planning
Junzhe Wang
65
0
0
10 Mar 2025
Impoola: The Power of Average Pooling for Image-Based Deep Reinforcement Learning
Raphael Trumpp
Ansgar Schäfftlein
Mirco Theile
Marco Caccamo
102
1
0
07 Mar 2025
Reinforcement learning with combinatorial actions for coupled restless bandits
Reinforcement learning with combinatorial actions for coupled restless bandits
Lily Xu
Bryan Wilder
Elias B. Khalil
Milind Tambe
107
1
0
01 Mar 2025
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Zhouyu He
Peng Qiao
Rongchun Li
Yong Dou
Yusong Tan
OffRL
168
0
0
27 Feb 2025
Fewer May Be Better: Enhancing Offline Reinforcement Learning with Reduced Dataset
Fewer May Be Better: Enhancing Offline Reinforcement Learning with Reduced Dataset
Yiqin Yang
Quanwei Wang
Chenghao Li
Hao Hu
Chengjie Wu
...
Dianyu Zhong
Ziyou Zhang
Qianchuan Zhao
Chongjie Zhang
Xu Bo
OffRL
117
0
0
26 Feb 2025
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
Taiyi Wang
Zhihao Wu
Jianheng Liu
Jianye Hao
Jun Wang
Kun Shao
OffRL
122
29
0
24 Feb 2025
Enhancing PPO with Trajectory-Aware Hybrid Policies
Qisai Liu
Zhanhong Jiang
Hsin-Jung Yang
Mahsa Khosravi
Joshua R. Waite
Soumik Sarkar
112
0
0
21 Feb 2025
Multi-Objective Reinforcement Learning for Critical Scenario Generation of Autonomous Vehicles
Jiahui Wu
Chengjie Lu
Aitor Arrieta
Shaukat Ali
80
0
0
18 Feb 2025
Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations
Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations
Abdelrhman Shaheen
Anas Badr
Ali Abohendy
Hatem Alsaadawy
Nadine Alsayad
143
2
0
14 Feb 2025
1234...282930
Next