ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1708.05866
  4. Cited By
A Brief Survey of Deep Reinforcement Learning

A Brief Survey of Deep Reinforcement Learning

19 August 2017
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
    OffRL
ArXivPDFHTML

Papers citing "A Brief Survey of Deep Reinforcement Learning"

50 / 596 papers shown
Title
Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation
Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation
Donghoon Lee
Tung M. Luu
Younghwan Lee
Chang D. Yoo
OffRL
VLM
9
0
0
16 May 2025
Aerodynamic and structural airfoil shape optimisation via Transfer Learning-enhanced Deep Reinforcement Learning
Aerodynamic and structural airfoil shape optimisation via Transfer Learning-enhanced Deep Reinforcement Learning
David Ramos
Lucas Lacasa
E. Valero
G. Rubio
AI4CE
27
0
0
05 May 2025
A Generalised and Adaptable Reinforcement Learning Stopping Method
A Generalised and Adaptable Reinforcement Learning Stopping Method
Reem Bin-Hezam
Mark Stevenson
24
0
0
03 May 2025
Quantum-Enhanced Hybrid Reinforcement Learning Framework for Dynamic Path Planning in Autonomous Systems
Quantum-Enhanced Hybrid Reinforcement Learning Framework for Dynamic Path Planning in Autonomous Systems
Sahil Tomar
Shamshe Alam
Sandeep Kumar
Amit Mathur
46
0
0
29 Apr 2025
Motion Generation for Food Topping Challenge 2024: Serving Salmon Roe Bowl and Picking Fried Chicken
Motion Generation for Food Topping Challenge 2024: Serving Salmon Roe Bowl and Picking Fried Chicken
Koki Inami
Masashi Konosu
Koki Yamane
Nozomu Masuya
Yunhan Li
Yu-Han Shu
Hiroshi Sato
Shinnosuke Homma
S. Sakaino
49
0
0
28 Apr 2025
An Automated Reinforcement Learning Reward Design Framework with Large Language Model for Cooperative Platoon Coordination
An Automated Reinforcement Learning Reward Design Framework with Large Language Model for Cooperative Platoon Coordination
Dixiao Wei
Peng Yi
Jinlong Lei
Yiguang Hong
Yuchuan Du
134
0
0
28 Apr 2025
BQSched: A Non-intrusive Scheduler for Batch Concurrent Queries via Reinforcement Learning
BQSched: A Non-intrusive Scheduler for Batch Concurrent Queries via Reinforcement Learning
Chenhao Xu
Chunyu Chen
Jinglin Peng
Jiannan Wang
Jun Gao
OffRL
AI4TS
48
0
0
27 Apr 2025
A Systematic Approach to Design Real-World Human-in-the-Loop Deep Reinforcement Learning: Salient Features, Challenges and Trade-offs
A Systematic Approach to Design Real-World Human-in-the-Loop Deep Reinforcement Learning: Salient Features, Challenges and Trade-offs
Jalal Arabneydi
Saiful Islam
Srijita Das
S. Gottipati
William Duguay
Clodéric Mars
Matthew E. Taylor
Matthew J. Guzdial
Antoine Fagette
Younes Zerouali
21
0
0
23 Apr 2025
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Shihan Dou
Muling Wu
Jingwen Xu
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
OffRL
LRM
32
0
0
19 Apr 2025
Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss
Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss
Ukjo Hwang
Songnam Hong
OffRL
41
0
0
14 Apr 2025
Efficient Transformed Gaussian Process State-Space Models for Non-Stationary High-Dimensional Dynamical Systems
Efficient Transformed Gaussian Process State-Space Models for Non-Stationary High-Dimensional Dynamical Systems
Zhidi Lin
Ying Li
Feng Yin
Juan Maroñas
Alexandre Thiéry
54
0
0
24 Mar 2025
Boosting Virtual Agent Learning and Reasoning: A Step-wise, Multi-dimensional, and Generalist Reward Model with Benchmark
Boosting Virtual Agent Learning and Reasoning: A Step-wise, Multi-dimensional, and Generalist Reward Model with Benchmark
Bingchen Miao
Y. Wu
Minghe Gao
Qifan Yu
Wendong Bu
Wenqiao Zhang
Yunfei Li
Siliang Tang
Tat-Seng Chua
Juncheng Billy Li
LLMAG
LRM
56
0
0
24 Mar 2025
Towards Automated Semantic Interpretability in Reinforcement Learning via Vision-Language Models
Towards Automated Semantic Interpretability in Reinforcement Learning via Vision-Language Models
Zhaoxin Li
Zhang Xi-Jia
Batuhan Altundas
Letian Chen
Rohan R. Paleja
Matthew C. Gombolay
OffRL
41
0
0
20 Mar 2025
Reinforcement Learning-based Heuristics to Guide Domain-Independent Dynamic Programming
Reinforcement Learning-based Heuristics to Guide Domain-Independent Dynamic Programming
Minori Narita
Ryo Kuroiwa
J. Christopher Beck
42
0
0
20 Mar 2025
Behaviour Discovery and Attribution for Explainable Reinforcement Learning
Rishav Rishav
Somjit Nath
Vincent Michalski
Samira Ebrahimi Kahou
FAtt
OffRL
70
0
0
19 Mar 2025
Learning with Expert Abstractions for Efficient Multi-Task Continuous Control
Learning with Expert Abstractions for Efficient Multi-Task Continuous Control
Jeff Jewett
Sandhya Saisubramanian
OffRL
48
0
0
19 Mar 2025
Deep Belief Markov Models for POMDP Inference
Deep Belief Markov Models for POMDP Inference
Giacomo Arcieri
K. Papakonstantinou
D. Štraub
Eleni Chatzi
43
0
0
17 Mar 2025
Context-aware Constrained Reinforcement Learning Based Energy-Efficient Power Scheduling for Non-stationary XR Data Traffic
Kexuan Wang
An Liu
44
0
0
13 Mar 2025
Collaborative Expert LLMs Guided Multi-Objective Molecular Optimization
Jiajun Yu
Y. Zheng
Huan Yee Koh
Shirui Pan
Tianyue Wang
Haishuai Wang
64
0
0
05 Mar 2025
Embodied Escaping: End-to-End Reinforcement Learning for Robot Navigation in Narrow Environment
Han Zheng
Jingyang Zhang
Mingyang Jiang
Peiyuan Liu
Danni Liu
Tong Qin
Ming Yang
148
0
0
05 Mar 2025
Efficiently Solving Discounted MDPs with Predictions on Transition Matrices
Efficiently Solving Discounted MDPs with Predictions on Transition Matrices
Lixing Lyu
Jiashuo Jiang
Wang Chi Cheung
42
1
0
24 Feb 2025
Self-Supervised Transformers as Iterative Solution Improvers for Constraint Satisfaction
Self-Supervised Transformers as Iterative Solution Improvers for Constraint Satisfaction
Yudong Xu
Wenhao Li
Scott Sanner
Elias Boutros Khalil
44
0
0
18 Feb 2025
Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations
Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations
Abdelrhman Shaheen
Anas Badr
Ali Abohendy
Hatem Alsaadawy
Nadine Alsayad
59
0
0
14 Feb 2025
A Survey of Reinforcement Learning for Optimization in Automation
A Survey of Reinforcement Learning for Optimization in Automation
Ahmad Farooq
Kamran Iqbal
OffRL
89
1
0
13 Feb 2025
Reinforced Lifelong Editing for Language Models
Reinforced Lifelong Editing for Language Models
Zherui Li
Houcheng Jiang
Hao Chen
Baolong Bi
Zhenhong Zhou
Fei Sun
Fan Zhang
Qing Guo
KELM
51
5
0
09 Feb 2025
Survey on Recent Progress of AI for Chemistry: Methods, Applications, and Opportunities
Survey on Recent Progress of AI for Chemistry: Methods, Applications, and Opportunities
Ding Hu
Pengxiang Hua
Zhen Huang
88
0
0
09 Feb 2025
Learning more with the same effort: how randomization improves the robustness of a robotic deep reinforcement learning agent
Learning more with the same effort: how randomization improves the robustness of a robotic deep reinforcement learning agent
Lucía Güitta-López
Jaime Boal
Álvaro J. López-López
48
5
0
24 Jan 2025
Audio-Driven Reinforcement Learning for Head-Orientation in Naturalistic Environments
Audio-Driven Reinforcement Learning for Head-Orientation in Naturalistic Environments
Wessel Ledder
Yuzhen Qin
Kiki van der Heijden
99
0
0
20 Jan 2025
Advanced Persistent Threats (APT) Attribution Using Deep Reinforcement Learning
Advanced Persistent Threats (APT) Attribution Using Deep Reinforcement Learning
Animesh Singh Basnet
M. C. Ghanem
Dipo Dunsin
Wiktor Sowinski-Mydlarz
AAML
37
0
0
08 Jan 2025
Contrastive Learning from Exploratory Actions: Leveraging Natural Interactions for Preference Elicitation
N. Dennler
Stefanos Nikolaidis
Maja J. Matarić
141
0
0
03 Jan 2025
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
H. Zhang
Xiaoman Pan
Hongwei Wang
Kaixin Ma
W. Yu
Dong Yu
LLMAG
61
3
0
03 Jan 2025
ReinFog: A DRL Empowered Framework for Resource Management in Edge and
  Cloud Computing Environments
ReinFog: A DRL Empowered Framework for Resource Management in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
72
0
0
20 Nov 2024
Bridging the Resource Gap: Deploying Advanced Imitation Learning Models onto Affordable Embedded Platforms
Haizhou Ge
Ruixiang Wang
Zhu-ang Xu
Hongrui Zhu
Ruichen Deng
Yuhang Dong
Zeyu Pang
Guyue Zhou
Junyu Zhang
Lu Shi
78
1
0
18 Nov 2024
Joint Age-State Belief is All You Need: Minimizing AoII via Pull-Based
  Remote Estimation
Joint Age-State Belief is All You Need: Minimizing AoII via Pull-Based Remote Estimation
Ismail Cosandal
S. Ulukus
Nail Akar
21
3
0
11 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and
  Distributed Computing: A Survey
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
22
2
0
08 Nov 2024
Hypercube Policy Regularization Framework for Offline Reinforcement
  Learning
Hypercube Policy Regularization Framework for Offline Reinforcement Learning
Yi Shen
Hanyan Huang
OffRL
28
0
0
07 Nov 2024
Dynamic Weight Adjusting Deep Q-Networks for Real-Time Environmental
  Adaptation
Dynamic Weight Adjusting Deep Q-Networks for Real-Time Environmental Adaptation
Xinhao Zhang
Jinghan Zhang
Wujun Si
Kunpeng Liu
43
0
0
04 Nov 2024
Machine Learning Innovations in CPR: A Comprehensive Survey on Enhanced
  Resuscitation Techniques
Machine Learning Innovations in CPR: A Comprehensive Survey on Enhanced Resuscitation Techniques
Saidul Islam
Gaith Rjoub
Hanae Elmekki
Jamal Bentahar
Witold Pedrycz
R. Cohen
31
0
0
03 Nov 2024
Multi-Agent Deep Q-Network with Layer-based Communication Channel for
  Autonomous Internal Logistics Vehicle Scheduling in Smart Manufacturing
Multi-Agent Deep Q-Network with Layer-based Communication Channel for Autonomous Internal Logistics Vehicle Scheduling in Smart Manufacturing
Mohammad Feizabadi
Arman Hosseini
Zakaria Yahouni
35
0
0
01 Nov 2024
PrefPaint: Aligning Image Inpainting Diffusion Model with Human
  Preference
PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference
Kendong Liu
Zhiyu Zhu
Chuanhao Li
Hui Liu
H. Zeng
Junhui Hou
EGVM
38
2
0
29 Oct 2024
Dual-Agent Deep Reinforcement Learning for Dynamic Pricing and
  Replenishment
Dual-Agent Deep Reinforcement Learning for Dynamic Pricing and Replenishment
Yi Zheng
Zehao Li
Peng Jiang
Yijie Peng
22
0
0
28 Oct 2024
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Kaiyan Zhao
Yiming Wang
Yuyang Chen
Yan Li
Leong Hou U
Xiaoguang Niu
36
1
0
27 Oct 2024
Beyond Simple Sum of Delayed Rewards: Non-Markovian Reward Modeling for
  Reinforcement Learning
Beyond Simple Sum of Delayed Rewards: Non-Markovian Reward Modeling for Reinforcement Learning
Yuting Tang
Xin-Qiang Cai
Jing-Cheng Pang
Qiyu Wu
Yao-Xiang Ding
Masashi Sugiyama
OffRL
26
0
0
26 Oct 2024
Learning Agents With Prioritization and Parameter Noise in Continuous
  State and Action Space
Learning Agents With Prioritization and Parameter Noise in Continuous State and Action Space
Rajesh Mangannavar
Gopalakrishnan Srinivasaraghavan
25
2
0
15 Oct 2024
A Scalable Communication Protocol for Networks of Large Language Models
A Scalable Communication Protocol for Networks of Large Language Models
Samuele Marro
Emanuele La Malfa
Jesse Wright
Bernard Ghanem
Nigel Shadbolt
Michael Wooldridge
Philip H. S. Torr
GNN
AIFin
40
8
0
14 Oct 2024
Transfer Learning for a Class of Cascade Dynamical Systems
Transfer Learning for a Class of Cascade Dynamical Systems
Shima Rabiei
Sandipan Mishra
Santiago Paternain
18
0
0
09 Oct 2024
Urban Computing for Climate and Environmental Justice: Early
  Perspectives From Two Research Initiatives
Urban Computing for Climate and Environmental Justice: Early Perspectives From Two Research Initiatives
Carolina Veiga
Ashish Sharma
Daniel de Oliveira
Marcos Lage
Fabio Miranda
AI4CE
37
0
0
06 Oct 2024
TemporalPaD: a reinforcement-learning framework for temporal feature
  representation and dimension reduction
TemporalPaD: a reinforcement-learning framework for temporal feature representation and dimension reduction
Xuechen Mu
Zhenyu Huang
Kewei Li
Haotian Zhang
Xiuli Wang
Yusi Fan
Kai Zhang
Fengfeng Zhou
AI4TS
OffRL
18
0
0
27 Sep 2024
Artificial Intelligence for Secured Information Systems in Smart Cities: Collaborative IoT Computing with Deep Reinforcement Learning and Blockchain
Artificial Intelligence for Secured Information Systems in Smart Cities: Collaborative IoT Computing with Deep Reinforcement Learning and Blockchain
Amin Zakaie Far
Mohammad Zakaie Far
Sonia Gharibzadeh
Shiva Zangeneh
Leila Amini
Morteza Rahimi
Morteza Rahimi
Saeed Asadi
26
5
0
24 Sep 2024
Semifactual Explanations for Reinforcement Learning
Semifactual Explanations for Reinforcement Learning
Jasmina Gajcin
Jovan Jeromela
Ivana Dusparic
OffRL
35
0
0
09 Sep 2024
1234...101112
Next