ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1708.05866
  4. Cited By
A Brief Survey of Deep Reinforcement Learning
v1v2 (latest)

A Brief Survey of Deep Reinforcement Learning

19 August 2017
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
    OffRL
ArXiv (abs)PDFHTML

Papers citing "A Brief Survey of Deep Reinforcement Learning"

50 / 604 papers shown
Title
Dual-Agent Deep Reinforcement Learning for Dynamic Pricing and
  Replenishment
Dual-Agent Deep Reinforcement Learning for Dynamic Pricing and Replenishment
Yi Zheng
Zehao Li
Peng Jiang
Yijie Peng
57
0
0
28 Oct 2024
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Kaiyan Zhao
Yiming Wang
Yuyang Chen
Yan Li
Leong Hou U
Xiaoguang Niu
127
1
0
27 Oct 2024
Beyond Simple Sum of Delayed Rewards: Non-Markovian Reward Modeling for
  Reinforcement Learning
Beyond Simple Sum of Delayed Rewards: Non-Markovian Reward Modeling for Reinforcement Learning
Yuting Tang
Xin-Qiang Cai
Jing-Cheng Pang
Qiyu Wu
Yao-Xiang Ding
Masashi Sugiyama
OffRL
59
0
0
26 Oct 2024
Learning Agents With Prioritization and Parameter Noise in Continuous
  State and Action Space
Learning Agents With Prioritization and Parameter Noise in Continuous State and Action Space
Rajesh Mangannavar
Gopalakrishnan Srinivasaraghavan
38
2
0
15 Oct 2024
A Scalable Communication Protocol for Networks of Large Language Models
A Scalable Communication Protocol for Networks of Large Language Models
Samuele Marro
Emanuele La Malfa
Jesse Wright
Ge Li
Nigel Shadbolt
Michael Wooldridge
Philip Torr
GNNAIFin
83
15
0
14 Oct 2024
Transfer Learning for a Class of Cascade Dynamical Systems
Transfer Learning for a Class of Cascade Dynamical Systems
Shima Rabiei
Sandipan Mishra
Santiago Paternain
53
0
0
09 Oct 2024
Urban Computing for Climate and Environmental Justice: Early
  Perspectives From Two Research Initiatives
Urban Computing for Climate and Environmental Justice: Early Perspectives From Two Research Initiatives
Carolina Veiga
Ashish Sharma
Daniel de Oliveira
Marcos Lage
Fabio Miranda
AI4CE
84
0
0
06 Oct 2024
TemporalPaD: a reinforcement-learning framework for temporal feature
  representation and dimension reduction
TemporalPaD: a reinforcement-learning framework for temporal feature representation and dimension reduction
Xuechen Mu
Zhenyu Huang
Kewei Li
Haotian Zhang
Xiuli Wang
Yusi Fan
Kai Zhang
Fengfeng Zhou
AI4TSOffRL
28
0
0
27 Sep 2024
Artificial Intelligence for Secured Information Systems in Smart Cities: Collaborative IoT Computing with Deep Reinforcement Learning and Blockchain
Artificial Intelligence for Secured Information Systems in Smart Cities: Collaborative IoT Computing with Deep Reinforcement Learning and Blockchain
Amin Zakaie Far
Mohammad Zakaie Far
Sonia Gharibzadeh
Shiva Zangeneh
Leila Amini
Morteza Rahimi
Morteza Rahimi
Saeed Asadi
127
6
0
24 Sep 2024
Semifactual Explanations for Reinforcement Learning
Semifactual Explanations for Reinforcement Learning
Jasmina Gajcin
Jovan Jeromela
Ivana Dusparic
OffRL
65
1
0
09 Sep 2024
A Comprehensive Survey on Evidential Deep Learning and Its Applications
A Comprehensive Survey on Evidential Deep Learning and Its Applications
Junyu Gao
Mengyuan Chen
Liangyu Xiang
Changsheng Xu
EDLBDLUQCV
131
6
0
07 Sep 2024
Formal Verification and Control with Conformal Prediction
Formal Verification and Control with Conformal Prediction
Lars Lindemann
Yiqi Zhao
Xinyi Yu
George J. Pappas
Jyotirmoy Deshmukh
731
17
0
31 Aug 2024
Statistical QoS Provision in Business-Centric Networks
Statistical QoS Provision in Business-Centric Networks
Chang Wu
Yuang Chen
Hancheng Lu
101
0
0
28 Aug 2024
cc-DRL: a Convex Combined Deep Reinforcement Learning Flight Control
  Design for a Morphing Quadrotor
cc-DRL: a Convex Combined Deep Reinforcement Learning Flight Control Design for a Morphing Quadrotor
Tao Yang
Huai-Ning Wu
Jun-Wei Wang
154
0
0
23 Aug 2024
Robust Iterative Value Conversion: Deep Reinforcement Learning for
  Neurochip-driven Edge Robots
Robust Iterative Value Conversion: Deep Reinforcement Learning for Neurochip-driven Edge Robots
Y. Kadokawa
Tomohito Kodera
Yoshihisa Tsurumine
Shinya Nishimura
Takamitsu Matsubara
91
2
0
23 Aug 2024
Distributed Noncoherent Joint Transmission Based on Multi-Agent
  Reinforcement Learning for Dense Small Cell MISO Systems
Distributed Noncoherent Joint Transmission Based on Multi-Agent Reinforcement Learning for Dense Small Cell MISO Systems
Shaozhuang Bai
Zhenzhen Gao
Xuewen Liao
49
0
0
22 Aug 2024
Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey
Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey
Ruiqi Zhang
Jing Hou
Florian Walter
Shangding Gu
Jiayi Guan
Florian Röhrbein
Yali Du
Panpan Cai
G. Chen
Alois Knoll
134
15
0
19 Aug 2024
SkillMimic: Learning Basketball Interaction Skills from Demonstrations
SkillMimic: Learning Basketball Interaction Skills from Demonstrations
Yinhuai Wang
Qihan Zhao
Runyi Yu
Ailing Zeng
Jing Lin
...
Xiu Li
Qifeng Chen
Jian Zhang
Lei Zhang
Ping Tan
112
2
0
12 Aug 2024
DeepAir: A Multi-Agent Deep Reinforcement Learning Based Scheme for an
  Unknown User Location Problem
DeepAir: A Multi-Agent Deep Reinforcement Learning Based Scheme for an Unknown User Location Problem
Baris Yamansavascilar
Atay Ozgovde
Cem Ersoy
30
0
0
11 Aug 2024
Deep Reinforcement Learning for the Design of Metamaterial Mechanisms
  with Functional Compliance Control
Deep Reinforcement Learning for the Design of Metamaterial Mechanisms with Functional Compliance Control
Yejun Choi
Yeoneung Kim
Keun Park
35
0
0
08 Aug 2024
Real-time Dexterous Telemanipulation with an End-Effect-Oriented
  Learning-based Approach
Real-time Dexterous Telemanipulation with an End-Effect-Oriented Learning-based Approach
Haoyang Wang
He Bai
Xiaoli Zhang
Yunsik Jung
Michel Bowman
Lingfeng Tao
37
4
0
01 Aug 2024
Adaptive traffic signal safety and efficiency improvement by multi
  objective deep reinforcement learning approach
Adaptive traffic signal safety and efficiency improvement by multi objective deep reinforcement learning approach
Shahin Mirbakhsh
Mahdi Azizi
49
3
0
01 Aug 2024
Towards Generalizable Reinforcement Learning via Causality-Guided Self-Adaptive Representations
Yupei Yang
Erdun Gao
Fan Feng
Xinyue Wang
Shikui Tu
Lei Xu
CMLOODTTA
108
1
0
30 Jul 2024
A Method for Fast Autonomy Transfer in Reinforcement Learning
A Method for Fast Autonomy Transfer in Reinforcement Learning
D. Sahabandu
Bhaskar Ramasubramanian
M. Alexiou
J. S. Mertoguno
L. Bushnell
Radha Poovendran
62
0
0
29 Jul 2024
Dataset Distillation for Offline Reinforcement Learning
Dataset Distillation for Offline Reinforcement Learning
Jonathan Light
Yuanzhe Liu
Ziniu Hu
DD
91
3
0
29 Jul 2024
Explainable Post hoc Portfolio Management Financial Policy of a Deep
  Reinforcement Learning agent
Explainable Post hoc Portfolio Management Financial Policy of a Deep Reinforcement Learning agent
Alejandra de la Rica Escudero
E.C. Garrido-Merchán
Maria Coronado Vaca
AIFin
97
3
0
19 Jul 2024
Maintenance Strategies for Sewer Pipes with Multi-State Degradation and
  Deep Reinforcement Learning
Maintenance Strategies for Sewer Pipes with Multi-State Degradation and Deep Reinforcement Learning
L. A. Jimenez-Roa
T. D. Simão
Zaharah Bukhsh
T. Tinga
Hajo Molegraaf
Nils Jansen
Marielle Stoelinga
AI4CE
45
3
0
17 Jul 2024
Revolutionizing Bridge Operation and maintenance with LLM-based Agents:
  An Overview of Applications and Insights
Revolutionizing Bridge Operation and maintenance with LLM-based Agents: An Overview of Applications and Insights
Xinyu-Chen
Lianzhen-Zhang
LLMAGAI4CE
106
4
0
14 Jul 2024
Model-free Distortion Canceling and Control of Quantum Devices
Model-free Distortion Canceling and Control of Quantum Devices
A. F. Fouad
A. Youssry
A. El-Rafei
Sherif Hammad
66
2
0
13 Jul 2024
Evaluating Front-end & Back-end of Human Automation Interaction
  Applications A Hypothetical Benchmark
Evaluating Front-end & Back-end of Human Automation Interaction Applications A Hypothetical Benchmark
Gonçalo Hora de Carvalho
58
0
0
12 Jul 2024
Communication-Aware Reinforcement Learning for Cooperative Adaptive
  Cruise Control
Communication-Aware Reinforcement Learning for Cooperative Adaptive Cruise Control
Sicong Jiang
Seongjin Choi
Lijun Sun
80
1
0
12 Jul 2024
Hierarchical Consensus-Based Multi-Agent Reinforcement Learning for
  Multi-Robot Cooperation Tasks
Hierarchical Consensus-Based Multi-Agent Reinforcement Learning for Multi-Robot Cooperation Tasks
Pu Feng
Junkang Liang
Size Wang
Xin Yu
Xin Ji
Yiting Chen
Kui Zhang
Rongye Shi
Wenjun Wu
126
7
0
11 Jul 2024
AI-based Automatic Segmentation of Prostate on Multi-modality Images: A
  Review
AI-based Automatic Segmentation of Prostate on Multi-modality Images: A Review
Rui Jin
Derun Li
Dehui Xiang
Lei Zhang
Hailing Zhou
Fei Shi
Weifang Zhu
Jing Cai
Tao Peng
Xinjian Chen
79
0
0
09 Jul 2024
The Impact of Quantization and Pruning on Deep Reinforcement Learning
  Models
The Impact of Quantization and Pruning on Deep Reinforcement Learning Models
Heng Lu
Mehdi Alemi
Reza Rawassizadeh
100
1
0
05 Jul 2024
To Switch or Not to Switch? Balanced Policy Switching in Offline Reinforcement Learning
To Switch or Not to Switch? Balanced Policy Switching in Offline Reinforcement Learning
Tao Ma
Xuzhi Yang
Zoltan Szabo
OffRL
150
0
0
01 Jul 2024
Disentangled Representations for Causal Cognition
Disentangled Representations for Causal Cognition
Filippo Torresan
Manuel Baltieri
CML
101
2
0
30 Jun 2024
A Benchmark Study of Deep-RL Methods for Maximum Coverage Problems over
  Graphs
A Benchmark Study of Deep-RL Methods for Maximum Coverage Problems over Graphs
Zhicheng Liang
Yifan Yang
Xiangyu Ke
Xiaokui Xiao
Yunjun Gao
134
0
0
20 Jun 2024
Trapezoidal Gradient Descent for Effective Reinforcement Learning in
  Spiking Networks
Trapezoidal Gradient Descent for Effective Reinforcement Learning in Spiking Networks
Yuhao Pan
Xiucheng Wang
Nan Cheng
Qi Qiu
88
0
0
19 Jun 2024
An Internal Model Principle For Robots
An Internal Model Principle For Robots
Vadim Weinstein
Tamara Alshammari
K. Timperi
Mehdi Bennis
Steven M. Lavalle
65
3
0
17 Jun 2024
Optimizing Deep Reinforcement Learning for Adaptive Robotic Arm Control
Optimizing Deep Reinforcement Learning for Adaptive Robotic Arm Control
Jonaid Shianifar
Michael Schukat
Karl Mason
42
3
0
12 Jun 2024
DNN Partitioning, Task Offloading, and Resource Allocation in Dynamic
  Vehicular Networks: A Lyapunov-Guided Diffusion-Based Reinforcement Learning
  Approach
DNN Partitioning, Task Offloading, and Resource Allocation in Dynamic Vehicular Networks: A Lyapunov-Guided Diffusion-Based Reinforcement Learning Approach
Zhang Liu
Hongyang Du
Junzhe Lin
Zhibin Gao
Lianfen Huang
Seyyedali Hosseinalipour
Dusit Niyato
61
10
0
11 Jun 2024
Online Adaptation for Enhancing Imitation Learning Policies
Online Adaptation for Enhancing Imitation Learning Policies
Federico Malato
Ville Hautamaki
OnRL
49
2
0
07 Jun 2024
Optimization of geological carbon storage operations with multimodal
  latent dynamic model and deep reinforcement learning
Optimization of geological carbon storage operations with multimodal latent dynamic model and deep reinforcement learning
Zhongzheng Wang
Yuntian Chen
Guodong Chen
Dongxiao Zhang
AI4CE
81
1
0
07 Jun 2024
Poisoning Attacks and Defenses in Recommender Systems: A Survey
Poisoning Attacks and Defenses in Recommender Systems: A Survey
Zongwei Wang
Junliang Yu
Min Gao
Wei Yuan
Guanhua Ye
S. Sadiq
Hongzhi Yin
AAML
82
6
0
03 Jun 2024
Deep Reinforcement Learning for Sim-to-Real Policy Transfer of VTOL-UAVs
  Offshore Docking Operations
Deep Reinforcement Learning for Sim-to-Real Policy Transfer of VTOL-UAVs Offshore Docking Operations
A. M. Ali
Aryaman Gupta
Hashim A. Hashim
OffRL
65
7
0
02 Jun 2024
Statistical Context Detection for Deep Lifelong Reinforcement Learning
Statistical Context Detection for Deep Lifelong Reinforcement Learning
Jeffery Dick
Saptarshi Nath
Christos Peridis
Eseoghene Ben-Iwhiwhu
Soheil Kolouri
Andrea Soltoggio
OffRL
83
2
0
29 May 2024
Matrix Low-Rank Approximation For Policy Gradient Methods
Matrix Low-Rank Approximation For Policy Gradient Methods
Sergio Rozada
A. Marques
64
2
0
27 May 2024
Matrix Low-Rank Trust Region Policy Optimization
Matrix Low-Rank Trust Region Policy Optimization
Sergio Rozada
Antonio G. Marques
87
0
0
27 May 2024
Survey of Graph Neural Network for Internet of Things and NextG Networks
Survey of Graph Neural Network for Internet of Things and NextG Networks
Sabarish Krishna Moorthy
Jithin Jagannath
76
2
0
27 May 2024
Deep Reinforcement Learning with Enhanced PPO for Safe Mobile Robot
  Navigation
Deep Reinforcement Learning with Enhanced PPO for Safe Mobile Robot Navigation
Hamid Taheri
Seyed Rasoul Hosseini
37
9
0
25 May 2024
Previous
12345...111213
Next