Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1708.05866
Cited By
v1
v2 (latest)
A Brief Survey of Deep Reinforcement Learning
19 August 2017
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Brief Survey of Deep Reinforcement Learning"
50 / 604 papers shown
Title
Fusing Blockchain and AI with Metaverse: A Survey
Qinglin Yang
Yetong Zhao
Huawei Huang
Zehui Xiong
Jiawen Kang
Zibin Zheng
98
320
0
10 Jan 2022
3DPG: Distributed Deep Deterministic Policy Gradient Algorithms for Networked Multi-Agent Systems
Adrian Redder
Arunselvan Ramaswamy
Holger Karl
OffRL
43
2
0
03 Jan 2022
Avoiding Catastrophe: Active Dendrites Enable Multi-Task Learning in Dynamic Environments
A. Iyer
Karan Grewal
Akash Velu
Lucas O. Souza
Jérémy Forest
Subutai Ahmad
AI4CE
103
46
0
31 Dec 2021
Data-Free Knowledge Transfer: A Survey
Yuang Liu
Wei Zhang
Jun Wang
Jianyong Wang
119
48
0
31 Dec 2021
Aerial Base Station Positioning and Power Control for Securing Communications: A Deep Q-Network Approach
Aly Sabri Abdalla
Ali Behfarnia
Vuk Marojevic
45
7
0
21 Dec 2021
Safe multi-agent deep reinforcement learning for joint bidding and maintenance scheduling of generation units
Pegah Rokhforoz
Olga Fink
23
8
0
20 Dec 2021
An Online Data-Driven Emergency-Response Method for Autonomous Agents in Unforeseen Situations
Glenn Maguire
Nicholas A. Ketz
Praveen K. Pilly
Jean-Baptiste Mouret
47
1
0
17 Dec 2021
Contrastive Explanations for Comparing Preferences of Reinforcement Learning Agents
Jasmina Gajcin
Rahul Nair
Tejaswini Pedapati
Radu Marinescu
Elizabeth M. Daly
Ivana Dusparic
OffRL
48
11
0
17 Dec 2021
Attention-Based Model and Deep Reinforcement Learning for Distribution of Event Processing Tasks
A. Mazayev
F. Al-Tam
N. Correia
77
5
0
07 Dec 2021
Energy-Efficient Design for a NOMA assisted STAR-RIS Network with Deep Reinforcement Learning
Yiyu Guo
Fang Fang
Donghong Cai
Z. Ding
30
40
0
30 Nov 2021
Edge Artificial Intelligence for 6G: Vision, Enabling Technologies, and Applications
Khaled B. Letaief
Yuanming Shi
Jianmin Lu
Jianhua Lu
99
434
0
24 Nov 2021
Blockchain-based Recommender Systems: Applications, Challenges and Future Opportunities
Yassine Himeur
A. Sayed
A. Alsalemi
F. Bensaali
Abbes Amira
Iraklis Varlamis
Magdalini Eirinaki
Christos Sardianos
G. Dimitrakopoulos
73
86
0
22 Nov 2021
Low Precision Decentralized Distributed Training over IID and non-IID Data
Sai Aparna Aketi
Sangamesh Kodge
Kaushik Roy
MQ
26
9
0
17 Nov 2021
AI in Human-computer Gaming: Techniques, Challenges and Opportunities
Qiyue Yin
Jun Yang
Kaiqi Huang
Meijing Zhao
Wancheng Ni
Bin Liang
Yan Huang
Shu Wu
Liangsheng Wang
61
21
0
15 Nov 2021
RLOps: Development Life-cycle of Reinforcement Learning Aided Open RAN
Peizheng Li
Jonathan D. Thomas
Xiaoyang Wang
Ahmed Khalil
A. Ahmad
...
S. Kapoor
Arjun Parekh
A. Doufexi
Arman Shojaeifard
Robert Piechocki
AI4TS
70
38
0
12 Nov 2021
Which Model to Trust: Assessing the Influence of Models on the Performance of Reinforcement Learning Algorithms for Continuous Control Tasks
Giacomo Arcieri
David Wölfle
Eleni Chatzi
OffRL
104
5
0
25 Oct 2021
Independent Natural Policy Gradient Always Converges in Markov Potential Games
Roy Fox
Stephen Marcus McAleer
W. Overman
Ioannis Panageas
90
49
0
20 Oct 2021
Safe Reinforcement Learning Using Robust Control Barrier Functions
Y. Emam
Gennaro Notomista
Paul Glotfelter
Z. Kira
M. Egerstedt
OffRL
71
42
0
11 Oct 2021
Deep Reinforcement Learning for Decentralized Multi-Robot Exploration With Macro Actions
A. H. Tan
Federico Pizarro Bejarano
Yuhan Zhu
Richard Ren
G. Nejat
97
33
0
05 Oct 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
135
60
0
28 Sep 2021
Emergent behavior and neural dynamics in artificial agents tracking turbulent plumes
Satpreet H. Singh
F. V. Breugel
Rajesh P. N. Rao
Bingni W. Brunton
103
4
0
25 Sep 2021
ACReL: Adversarial Conditional value-at-risk Reinforcement Learning
Mathieu Godbout
M. Heuillet
Sharath Chandra
R. Bhati
Audrey Durand
64
1
0
20 Sep 2021
OpenFed: A Comprehensive and Versatile Open-Source Federated Learning Framework
Dengsheng Chen
Vince Tan
Zhi-Wei Lu
Jie Hu
FedML
64
32
0
16 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
86
103
0
14 Sep 2021
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach
Qinbo Bai
Amrit Singh Bedi
Mridul Agarwal
Alec Koppel
Vaneet Aggarwal
189
60
0
13 Sep 2021
A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions
Xiaocong Chen
L. Yao
Julian McAuley
Guanglin Zhou
Xianzhi Wang
AI4TS
79
62
0
08 Sep 2021
A review of mobile robot motion planning methods: from classical motion planning workflows to reinforcement learning-based architectures
Changyin Sun
Zicheng He
Chunwei Song
Changyin Sun
108
59
0
31 Aug 2021
Communication-Computation Efficient Device-Edge Co-Inference via AutoML
Xinjie Zhang
Jiawei Shao
Yuyi Mao
Jun Zhang
66
8
0
30 Aug 2021
Reinforcement Learning based Condition-oriented Maintenance Scheduling for Flow Line Systems
Raphael Lamprecht
Ferdinand Wurst
Marco F. Huber
42
3
0
27 Aug 2021
Self-optimizing adaptive optics control with Reinforcement Learning for high-contrast imaging
Rico Landman
S. Haffert
V. M. Radhakrishnan
C. Keller
54
28
0
24 Aug 2021
Entropy-Aware Model Initialization for Effective Exploration in Deep Reinforcement Learning
Sooyoung Jang
Hyungil Kim
55
5
0
24 Aug 2021
MimicBot: Combining Imitation and Reinforcement Learning to win in Bot Bowl
Nicola Pezzotti
63
1
0
21 Aug 2021
Explainable Deep Reinforcement Learning Using Introspection in a Non-episodic Task
Angel Ayala
Francisco Cruz
Bruno José Torres Fernandes
Richard Dazeley
47
6
0
18 Aug 2021
Diversity-based Trajectory and Goal Selection with Hindsight Experience Replay
Tianhong Dai
Hengyan Liu
Kai Arulkumaran
Guangyu Ren
Anil Anthony Bharath
70
11
0
17 Aug 2021
Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning
Lukas Brunke
Melissa Greeff
Adam W. Hall
Zhaocong Yuan
Siqi Zhou
Jacopo Panerati
Angela P. Schoellig
OffRL
70
637
0
13 Aug 2021
Deep Reinforcement Learning for Demand Driven Services in Logistics and Transportation Systems: A Survey
Zefang Zong
Tao Feng
Tong Xia
Depeng Jin
Yong Li
57
3
0
10 Aug 2021
Reinforcement Learning for Intelligent Healthcare Systems: A Comprehensive Survey
A. Abdellatif
N. Mhaisen
Z. Chkirbene
Amr M. Mohamed
A. Erbad
Mohsen Guizani
OffRL
AI4TS
65
23
0
05 Aug 2021
Advances in Trajectory Optimization for Space Vehicle Control
Danylo Malyuta
Yue Yu
Purnanand Elango
Behçet Açikmese
103
123
0
05 Aug 2021
A Pragmatic Look at Deep Imitation Learning
Kai Arulkumaran
D. Lillrank
59
9
0
04 Aug 2021
A Reinforcement Learning Approach for Scheduling in mmWave Networks
M. Dogan
Yahya H. Ezzeldin
Christina Fragouli
Addison W. Bohannon
44
10
0
01 Aug 2021
A reinforcement learning approach to resource allocation in genomic selection
Saba Moeinizade
Guiping Hu
Lizhi Wang
60
15
0
22 Jul 2021
Constrained Policy Gradient Method for Safe and Fast Reinforcement Learning: a Neural Tangent Kernel Based Approach
B. Varga
Balázs Kulcsár
M. Chehreghani
77
1
0
19 Jul 2021
Deep Multiagent Reinforcement Learning: Challenges and Directions
Annie Wong
Thomas Bäck
Anna V. Kononova
Aske Plaat
AI4CE
116
97
0
29 Jun 2021
A Comprehensive Survey of Incentive Mechanism for Federated Learning
Rongfei Zeng
Chaobing Zeng
Xingwei Wang
Yue Liu
Xiaowen Chu
FedML
80
101
0
27 Jun 2021
A Survey on Human-aware Robot Navigation
Ronja Möller
Antonino Furnari
Sebastiano Battiato
Aki Härmä
G. Farinella
134
89
0
22 Jun 2021
Learning Knowledge Graph-based World Models of Textual Environments
Prithviraj Ammanabrolu
Mark O. Riedl
3DV
102
32
0
17 Jun 2021
Modeling Worlds in Text
Prithviraj Ammanabrolu
Mark O. Riedl
VGen
LM&Ro
63
14
0
17 Jun 2021
Reinforcement learning for pursuit and evasion of microswimmers at low Reynolds number
Francesco Borra
Luca Biferale
M. Cencini
A. Celani
119
22
0
16 Jun 2021
A3C-S: Automated Agent Accelerator Co-Search towards Efficient Deep Reinforcement Learning
Yonggan Fu
Yongan Zhang
Chaojian Li
Zhongzhi Yu
Yingyan Lin
52
6
0
11 Jun 2021
An overview of deep learning techniques for epileptic seizures detection and prediction based on neuroimaging modalities: Methods, challenges, and future works
A. Shoeibi
Parisa Moridian
Marjane Khodatars
Navid Ghassemi
M. Jafari
...
Juan M Gorriz
Javier Ramírez
Abbas Khosravi
S. Nahavandi
U. Acharya
87
54
0
29 May 2021
Previous
1
2
3
...
7
8
9
...
11
12
13
Next