ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1708.05866
  4. Cited By
A Brief Survey of Deep Reinforcement Learning
v1v2 (latest)

A Brief Survey of Deep Reinforcement Learning

19 August 2017
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
    OffRL
ArXiv (abs)PDFHTML

Papers citing "A Brief Survey of Deep Reinforcement Learning"

50 / 604 papers shown
Title
Energy-Based Transfer for Reinforcement Learning
Energy-Based Transfer for Reinforcement Learning
Zeyun Deng
Jasorsi Ghosh
Fiona Xie
Yuzhe Lu
Katia Sycara
Joseph Campbell
15
0
0
19 Jun 2025
PNCS:Power-Norm Cosine Similarity for Diverse Client Selection in Federated Learning
PNCS:Power-Norm Cosine Similarity for Diverse Client Selection in Federated Learning
Liangyan Li
Yangyi Liu
Yimo Ning
Stefano Rini
Jun Chen
FedML
19
0
0
18 Jun 2025
Learning Swing-up Maneuvers for a Suspended Aerial Manipulation Platform in a Hierarchical Control Framework
Learning Swing-up Maneuvers for a Suspended Aerial Manipulation Platform in a Hierarchical Control Framework
Hemjyoti Das
Minh Nhat Vu
Christian Ott
16
0
0
16 Jun 2025
Realistic Urban Traffic Generator using Decentralized Federated Learning for the SUMO simulator
Realistic Urban Traffic Generator using Decentralized Federated Learning for the SUMO simulator
Alberto Bazán-Guillén
Carlos Beis-Penedo
Diego Cajaraville-Aboy
Pablo Barbecho-Bautista
R. Redondo
Luis J. de la Cruz Llopis
Ana Fernández-Vilas
Mónica Aguilar Igartua
M. Fernández-Veiga
AI4TS
19
0
0
09 Jun 2025
CARoL: Context-aware Adaptation for Robot Learning
CARoL: Context-aware Adaptation for Robot Learning
Zechen Hu
Tong Xu
Xuesu Xiao
Xuan Wang
27
0
0
08 Jun 2025
Enhancing Efficiency and Propulsion in Bio-mimetic Robotic Fish through End-to-End Deep Reinforcement Learning
Xinyu Cui
Boai Sun
Yi Zhu
Ning Yang
Haifeng Zhang
Weicheng Cui
D. Fan
Jun Wang
159
9
0
05 Jun 2025
Autoencoding Random Forests
Autoencoding Random Forests
Binh Duc Vu
Jan Kapar
Marvin N. Wright
David S. Watson
154
0
0
27 May 2025
A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning
A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning
Yuzheng Hu
Fan Wu
Haotian Ye
David A. Forsyth
James Y. Zou
Nan Jiang
Jiaqi W. Ma
Han Zhao
OffRL
74
0
0
25 May 2025
Navigate the Unknown: Enhancing LLM Reasoning with Intrinsic Motivation Guided Exploration
Navigate the Unknown: Enhancing LLM Reasoning with Intrinsic Motivation Guided Exploration
Jingtong Gao
Ling Pan
Yejing Wang
Rui Zhong
Chi Lu
Qingpeng Cai
Peng Jiang
Xiangyu Zhao
LRM
101
1
0
23 May 2025
LLM-Powered AI Agent Systems and Their Applications in Industry
LLM-Powered AI Agent Systems and Their Applications in Industry
Guannan Liang
Qianqian Tong
LLMAGLM&Ro
83
3
0
22 May 2025
Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation
Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation
Donghoon Lee
Tung M. Luu
Younghwan Lee
Chang D. Yoo
OffRLVLM
72
0
0
16 May 2025
Aerodynamic and structural airfoil shape optimisation via Transfer Learning-enhanced Deep Reinforcement Learning
Aerodynamic and structural airfoil shape optimisation via Transfer Learning-enhanced Deep Reinforcement Learning
David Ramos
Lucas Lacasa
E. Valero
G. Rubio
AI4CE
111
0
0
05 May 2025
A Generalised and Adaptable Reinforcement Learning Stopping Method
A Generalised and Adaptable Reinforcement Learning Stopping Method
Reem Bin-Hezam
Mark Stevenson
57
0
0
03 May 2025
Quantum-Enhanced Hybrid Reinforcement Learning Framework for Dynamic Path Planning in Autonomous Systems
Quantum-Enhanced Hybrid Reinforcement Learning Framework for Dynamic Path Planning in Autonomous Systems
Sahil Tomar
Shamshe Alam
Sandeep Kumar
Amit Mathur
112
1
0
29 Apr 2025
An Automated Reinforcement Learning Reward Design Framework with Large Language Model for Cooperative Platoon Coordination
An Automated Reinforcement Learning Reward Design Framework with Large Language Model for Cooperative Platoon Coordination
Dixiao Wei
Peng Yi
Jinlong Lei
Yiguang Hong
Yuchuan Du
432
0
0
28 Apr 2025
Motion Generation for Food Topping Challenge 2024: Serving Salmon Roe Bowl and Picking Fried Chicken
Motion Generation for Food Topping Challenge 2024: Serving Salmon Roe Bowl and Picking Fried Chicken
Koki Inami
Masashi Konosu
Koki Yamane
Nozomu Masuya
Yunhan Li
Yu-Han Shu
Hiroshi Sato
Shinnosuke Homma
S. Sakaino
234
0
0
28 Apr 2025
BQSched: A Non-intrusive Scheduler for Batch Concurrent Queries via Reinforcement Learning
BQSched: A Non-intrusive Scheduler for Batch Concurrent Queries via Reinforcement Learning
Chenhao Xu
Chunyu Chen
Jinglin Peng
Jiannan Wang
Jun Gao
OffRLAI4TS
80
0
0
27 Apr 2025
A Systematic Approach to Design Real-World Human-in-the-Loop Deep Reinforcement Learning: Salient Features, Challenges and Trade-offs
A Systematic Approach to Design Real-World Human-in-the-Loop Deep Reinforcement Learning: Salient Features, Challenges and Trade-offs
Jalal Arabneydi
Saiful Islam
Srijita Das
S. Gottipati
William Duguay
Clodéric Mars
Matthew E. Taylor
Matthew J. Guzdial
Antoine Fagette
Younes Zerouali
37
0
0
23 Apr 2025
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Shihan Dou
Muling Wu
Jingwen Xu
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
OffRLLRM
77
2
0
19 Apr 2025
Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss
Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss
Ukjo Hwang
Songnam Hong
OffRL
78
0
0
14 Apr 2025
Boosting Virtual Agent Learning and Reasoning: A Step-Wise, Multi-Dimensional, and Generalist Reward Model with Benchmark
Boosting Virtual Agent Learning and Reasoning: A Step-Wise, Multi-Dimensional, and Generalist Reward Model with Benchmark
Bingchen Miao
Y. Wu
Minghe Gao
Qifan Yu
Wendong Bu
Wenqiao Zhang
Yunfei Li
Siliang Tang
Tat-Seng Chua
Juncheng Billy Li
LLMAGLRM
133
1
0
24 Mar 2025
Efficient Transformed Gaussian Process State-Space Models for Non-Stationary High-Dimensional Dynamical Systems
Efficient Transformed Gaussian Process State-Space Models for Non-Stationary High-Dimensional Dynamical Systems
Zhidi Lin
Ying Li
Feng Yin
Juan Maroñas
Alexandre Thiéry
164
0
0
24 Mar 2025
Reinforcement Learning-based Heuristics to Guide Domain-Independent Dynamic Programming
Reinforcement Learning-based Heuristics to Guide Domain-Independent Dynamic Programming
Minori Narita
Ryo Kuroiwa
J. Christopher Beck
78
0
0
20 Mar 2025
Towards Automated Semantic Interpretability in Reinforcement Learning via Vision-Language Models
Towards Automated Semantic Interpretability in Reinforcement Learning via Vision-Language Models
Zhaoxin Li
Zhang Xi-Jia
Batuhan Altundas
Letian Chen
Rohan R. Paleja
Matthew C. Gombolay
OffRL
76
0
0
20 Mar 2025
Learning with Expert Abstractions for Efficient Multi-Task Continuous Control
Learning with Expert Abstractions for Efficient Multi-Task Continuous Control
Jeff Jewett
Sandhya Saisubramanian
OffRL
117
0
0
19 Mar 2025
Behaviour Discovery and Attribution for Explainable Reinforcement Learning
Behaviour Discovery and Attribution for Explainable Reinforcement Learning
Rishav Rishav
Somjit Nath
Vincent Michalski
Samira Ebrahimi Kahou
FAttOffRL
168
1
0
19 Mar 2025
Deep Belief Markov Models for POMDP Inference
Deep Belief Markov Models for POMDP Inference
Giacomo Arcieri
K. Papakonstantinou
D. Štraub
Eleni Chatzi
86
0
0
17 Mar 2025
Context-aware Constrained Reinforcement Learning Based Energy-Efficient Power Scheduling for Non-stationary XR Data Traffic
Kexuan Wang
An Liu
83
0
0
13 Mar 2025
Collaborative Expert LLMs Guided Multi-Objective Molecular Optimization
Jiajun Yu
Y. Zheng
Huan Yee Koh
Xiaojun Jia
Tianyue Wang
Haishuai Wang
119
2
0
05 Mar 2025
Embodied Escaping: End-to-End Reinforcement Learning for Robot Navigation in Narrow Environment
Han Zheng
Jing Zhang
Mingyang Jiang
Peiyuan Liu
Danni Liu
Tong Qin
Ming Yang
338
0
0
05 Mar 2025
Efficiently Solving Discounted MDPs with Predictions on Transition Matrices
Efficiently Solving Discounted MDPs with Predictions on Transition Matrices
Lixing Lyu
Jiashuo Jiang
Wang Chi Cheung
87
1
0
24 Feb 2025
Self-Supervised Transformers as Iterative Solution Improvers for Constraint Satisfaction
Self-Supervised Transformers as Iterative Solution Improvers for Constraint Satisfaction
Yudong Xu
Wenhao Li
Scott Sanner
Elias Boutros Khalil
99
0
0
18 Feb 2025
Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations
Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations
Abdelrhman Shaheen
Anas Badr
Ali Abohendy
Hatem Alsaadawy
Nadine Alsayad
143
2
0
14 Feb 2025
A Survey of Reinforcement Learning for Optimization in Automation
A Survey of Reinforcement Learning for Optimization in Automation
Ahmad Farooq
Kamran Iqbal
OffRL
168
2
0
13 Feb 2025
Reinforced Lifelong Editing for Language Models
Reinforced Lifelong Editing for Language Models
Zherui Li
Houcheng Jiang
Hao Chen
Baolong Bi
Zhenhong Zhou
Fei Sun
Sihang Li
Xinze Wang
KELM
157
8
0
09 Feb 2025
Survey on Recent Progress of AI for Chemistry: Methods, Applications, and Opportunities
Survey on Recent Progress of AI for Chemistry: Methods, Applications, and Opportunities
Ding Hu
Pengxiang Hua
Zhen Huang
261
0
0
09 Feb 2025
Learning more with the same effort: how randomization improves the robustness of a robotic deep reinforcement learning agent
Learning more with the same effort: how randomization improves the robustness of a robotic deep reinforcement learning agent
Lucía Güitta-López
Jaime Boal
Álvaro J. López-López
117
6
0
24 Jan 2025
Audio-Driven Reinforcement Learning for Head-Orientation in Naturalistic Environments
Audio-Driven Reinforcement Learning for Head-Orientation in Naturalistic Environments
Wessel Ledder
Yuzhen Qin
Kiki van der Heijden
207
1
0
20 Jan 2025
Advanced Persistent Threats (APT) Attribution Using Deep Reinforcement Learning
Advanced Persistent Threats (APT) Attribution Using Deep Reinforcement Learning
Animesh Singh Basnet
M. C. Ghanem
Dipo Dunsin
Wiktor Sowinski-Mydlarz
AAML
63
0
0
08 Jan 2025
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Han Zhang
Xiaoman Pan
Hongwei Wang
Kaixin Ma
Wenhao Yu
Dong Yu
LLMAG
146
4
0
03 Jan 2025
Contrastive Learning from Exploratory Actions: Leveraging Natural Interactions for Preference Elicitation
N. Dennler
Stefanos Nikolaidis
Maja J. Matarić
464
0
0
03 Jan 2025
ReinFog: A DRL Empowered Framework for Resource Management in Edge and
  Cloud Computing Environments
ReinFog: A DRL Empowered Framework for Resource Management in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
106
1
0
20 Nov 2024
Bridging the Resource Gap: Deploying Advanced Imitation Learning Models onto Affordable Embedded Platforms
Haizhou Ge
Ruixiang Wang
Zhu-ang Xu
Hongrui Zhu
Ruichen Deng
Yuhang Dong
Zeyu Pang
Guyue Zhou
Junyu Zhang
Lu Shi
140
1
0
18 Nov 2024
Joint Age-State Belief is All You Need: Minimizing AoII via Pull-Based
  Remote Estimation
Joint Age-State Belief is All You Need: Minimizing AoII via Pull-Based Remote Estimation
Ismail Cosandal
S. Ulukus
Nail Akar
50
3
0
11 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and
  Distributed Computing: A Survey
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
98
6
0
08 Nov 2024
Hypercube Policy Regularization Framework for Offline Reinforcement
  Learning
Hypercube Policy Regularization Framework for Offline Reinforcement Learning
Yi Shen
Hanyan Huang
OffRL
60
0
0
07 Nov 2024
Dynamic Weight Adjusting Deep Q-Networks for Real-Time Environmental
  Adaptation
Dynamic Weight Adjusting Deep Q-Networks for Real-Time Environmental Adaptation
Xinhao Zhang
Jinghan Zhang
Wujun Si
Kunpeng Liu
79
1
0
04 Nov 2024
Machine Learning Innovations in CPR: A Comprehensive Survey on Enhanced
  Resuscitation Techniques
Machine Learning Innovations in CPR: A Comprehensive Survey on Enhanced Resuscitation Techniques
Saidul Islam
Gaith Rjoub
Hanae Elmekki
Jamal Bentahar
Witold Pedrycz
R. Cohen
92
0
0
03 Nov 2024
Multi-Agent Deep Q-Network with Layer-based Communication Channel for
  Autonomous Internal Logistics Vehicle Scheduling in Smart Manufacturing
Multi-Agent Deep Q-Network with Layer-based Communication Channel for Autonomous Internal Logistics Vehicle Scheduling in Smart Manufacturing
Mohammad Feizabadi
Arman Hosseini
Zakaria Yahouni
94
0
0
01 Nov 2024
PrefPaint: Aligning Image Inpainting Diffusion Model with Human
  Preference
PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference
Kendong Liu
Zhiyu Zhu
Chuanhao Li
Hui Liu
H. Zeng
Junhui Hou
EGVM
76
4
0
29 Oct 2024
1234...111213
Next