Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1708.05866
Cited By
v1
v2 (latest)
A Brief Survey of Deep Reinforcement Learning
19 August 2017
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Brief Survey of Deep Reinforcement Learning"
50 / 604 papers shown
Title
A Survey of Learning on Small Data: Generalization, Optimization, and Challenge
Xiaofeng Cao
Weixin Bu
Sheng-Jun Huang
Minling Zhang
Ivor W. Tsang
Yew-Soon Ong
James T. Kwok
99
1
0
29 Jul 2022
Driver Dojo: A Benchmark for Generalizable Reinforcement Learning for Autonomous Driving
Sebastian Rietsch
S. Huang
G. Kontes
Axel Plinge
Christopher Mutschler
OOD
OffRL
78
5
0
23 Jul 2022
Robust Knowledge Adaptation for Dynamic Graph Neural Networks
Han Li
Changsheng Li
Kaituo Feng
Ye Yuan
Guoren Wang
H. Zha
85
14
0
22 Jul 2022
Controllable Data Generation by Deep Learning: A Review
Shiyu Wang
Yuanqi Du
Xiaojie Guo
Bo Pan
Zhaohui Qin
Liang Zhao
99
28
0
19 Jul 2022
A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games
Zihan Ding
DiJia Su
Qinghua Liu
Chi Jin
73
3
0
18 Jul 2022
Scaling up ML-based Black-box Planning with Partial STRIPS Models
M. Greco
Álvaro Torralba
Jorge A. Baier
Héctor Palacios
OffRL
27
0
0
10 Jul 2022
A Deep Model for Partial Multi-Label Image Classification with Curriculum Based Disambiguation
Feng Sun
Ming-Kun Xie
Sheng-Jun Huang
51
7
0
06 Jul 2022
Tackling Real-World Autonomous Driving using Deep Reinforcement Learning
Paolo Maramotti
Alessandro Paolo Capasso
Giulio Bacchiani
A. Broggi
60
11
0
05 Jul 2022
Action-modulated midbrain dopamine activity arises from distributed control policies
Jack W Lindsey
Ashok Litwin-Kumar
MLAU
51
12
0
01 Jul 2022
A Survey on Active Simultaneous Localization and Mapping: State of the Art and New Frontiers
Julio A. Placed
Jared Strader
Henry Carrillo
Nikolay Atanasov
Vadim Indelman
Luca Carlone
J. A. Castellanos
132
191
0
01 Jul 2022
Traffic Management of Autonomous Vehicles using Policy Based Deep Reinforcement Learning and Intelligent Routing
Anum Mushtaq
I. Haq
M. A. Sarwar
Asifullah Khan
Omair Shafiq
24
4
0
28 Jun 2022
An Energy and Carbon Footprint Analysis of Distributed and Federated Learning
S. Savazzi
V. Rampa
Sanaz Kianoush
M. Bennis
69
46
0
21 Jun 2022
Review Neural Networks about Image Transformation Based on IGC Learning Framework with Annotated Information
Yuanjie Yan
Suorong Yang
Yan Wang
Jian Zhao
S. Furao
57
0
0
21 Jun 2022
Reinforcement Learning for Economic Policy: A New Frontier?
C. Tilbury
OffRL
82
3
0
16 Jun 2022
FreeKD: Free-direction Knowledge Distillation for Graph Neural Networks
Kaituo Feng
Changsheng Li
Ye Yuan
Guoren Wang
107
35
0
14 Jun 2022
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm
Qinbo Bai
Amrit Singh Bedi
Vaneet Aggarwal
100
24
0
12 Jun 2022
Deep Multi-Agent Reinforcement Learning with Hybrid Action Spaces based on Maximum Entropy
Hongzhi Hua
Kaigui Wu
Guixuan Wen
33
0
0
10 Jun 2022
Dap-FL: Federated Learning flourishes by adaptive tuning and secure aggregation
Qian Chen
Zilong Wang
Jiawei Chen
Haonan Yan
Xiaodong Lin
FedML
53
17
0
08 Jun 2022
Deep Reinforcement Learning for Cybersecurity Threat Detection and Protection: A Review
Mohit Sewak
S. K. Sahay
Hemant Rathore
AAML
51
27
0
06 Jun 2022
Neuro-Nav: A Library for Neurally-Plausible Reinforcement Learning
Arthur Juliani
Samuel A. Barnett
Brandon Davis
Margaret E. Sereno
Ida Momennejad
OffRL
65
10
0
06 Jun 2022
Learning Generalized Wireless MAC Communication Protocols via Abstraction
Luciano Miuccio
Salvatore Riolo
S. Samarakoon
D. Panno
M. Bennis
58
17
0
06 Jun 2022
Optimizing Objective Functions from Trained ReLU Neural Networks via Sampling
Georgia Perakis
Asterios Tsiourvas
67
11
0
27 May 2022
Meta Policy Learning for Cold-Start Conversational Recommendation
Zhendong Chu
Hongning Wang
Yun Xiao
Bo Long
Lingfei Wu
OffRL
96
35
0
24 May 2022
Improving Short Text Classification With Augmented Data Using GPT-3
Salvador Balkus
Donghui Yan
61
37
0
23 May 2022
Modeling Human Behavior Part I -- Learning and Belief Approaches
Andrew Fuchs
A. Passarella
M. Conti
83
7
0
13 May 2022
VesNet-RL: Simulation-based Reinforcement Learning for Real-World US Probe Navigation
Yuanwei Bi
Zhongliang Jiang
Yuan Gao
T. Wendler
A. Karlas
Nassir Navab
48
35
0
10 May 2022
Disturbance-Injected Robust Imitation Learning with Task Achievement
Hirotaka Tahara
Hikaru Sasaki
Hanbit Oh
B. Michael
Takamitsu Matsubara
85
9
0
09 May 2022
Diverse Imitation Learning via Self-Organizing Generative Models
Arash Vahabpour
Tianyi Wang
Qiujing Lu
Omead Brandon Pooladzandi
V. Roychowdhury
SSL
80
1
0
06 May 2022
Using Deep Reinforcement Learning to solve Optimal Power Flow problem with generator failures
Muhammad Awais
36
0
0
04 May 2022
Exploration in Deep Reinforcement Learning: A Survey
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
97
365
0
02 May 2022
HyperNCA: Growing Developmental Networks with Neural Cellular Automata
Elias Najarro
Shyam Sudhakaran
Claire Glanois
S. Risi
90
16
0
25 Apr 2022
Deep Reinforcement Learning Using a Low-Dimensional Observation Filter for Visual Complex Video Game Playing
V. A. Kich
J. C. Jesus
Ricardo B. Grando
A. H. Kolling
Gabriel V. Heisler
R. S. Guerra
OffRL
37
2
0
24 Apr 2022
Adaptive Online Value Function Approximation with Wavelets
Michael Beukman
Michael Mitchley
Dean S. Wookey
Steven D. James
George Konidaris
49
1
0
22 Apr 2022
Efficient and practical quantum compiler towards multi-qubit systems with deep reinforcement learning
Qiuhao Chen
Yuxuan Du
Qi Zhao
Yuliang Jiao
Xiliang Lu
Xingyao Wu
59
13
0
14 Apr 2022
Reinforcement learning on graphs: A survey
Mingshuo Nie
Dongming Chen
Dongqi Wang
110
51
0
13 Apr 2022
On Improving Cross-dataset Generalization of Deepfake Detectors
Aakash Varma Nadimpalli
A. Rattani
CVBM
63
44
0
08 Apr 2022
Distributed Reinforcement Learning for Robot Teams: A Review
Yutong Wang
Mehul Damani
Pamela Wang
Yuhong Cao
Guillaume Sartoretti
118
22
0
07 Apr 2022
Sketching without Worrying: Noise-Tolerant Sketch-Based Image Retrieval
A. Bhunia
Subhadeep Koley
Abdullah Faiz Ur Rahman Khilji
Aneeshan Sain
Pinaki Nath Chowdhury
Tao Xiang
Yi-Zhe Song
AAML
80
44
0
28 Mar 2022
Deep reinforcement learning guided graph neural networks for brain network analysis
Xusheng Zhao
Hongzhi Zhang
Hao Peng
Amin Beheshti
Jessica J. M. Monaghan
...
Mark Dras
Qiong Dai
Yangyang Li
Philip S. Yu
Lifang He
GNN
107
47
0
18 Mar 2022
GAC: A Deep Reinforcement Learning Model Toward User Incentivization in Unknown Social Networks
Shiqing Wu
Weihua Li
Quan-wei Bai
GNN
63
11
0
17 Mar 2022
A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open Problems
Rafael Figueiredo Prudencio
Marcos R. O. A. Máximo
Esther Luna Colombini
OffRL
111
244
0
02 Mar 2022
Efficient Task Allocation in Smart Warehouses with Multi-delivery Stations and Heterogeneous Robots
George S. Oliveira
J. Röning
P. Plentz
J. Carvalho
21
7
0
28 Feb 2022
Multi-fidelity reinforcement learning framework for shape optimization
Sahil Bhola
Suraj Pawar
Prasanna Balaprakash
R. Maulik
AI4CE
71
24
0
22 Feb 2022
A Comparative Study of Deep Reinforcement Learning-based Transferable Energy Management Strategies for Hybrid Electric Vehicles
Jingyi Xu
Zirui Li
Li Gao
Junyi Ma
Qi Liu
Yanan Zhao
45
14
0
22 Feb 2022
Cooperative Artificial Intelligence
T. Baumann
34
0
0
20 Feb 2022
Communication-Efficient Consensus Mechanism for Federated Reinforcement Learning
Xing Xu
Rongpeng Li
Zhifeng Zhao
Honggang Zhang
FedML
64
6
0
30 Jan 2022
Hyperparameter Tuning for Deep Reinforcement Learning Applications
M. Kiran
Melis Ozyildirim
141
22
0
26 Jan 2022
Reinforcement Learning Based Query Vertex Ordering Model for Subgraph Matching
Hanchen Wang
Ying Zhang
Lu Qin
Wei Wang
Weinan Zhang
Xuemin Lin
67
18
0
25 Jan 2022
Scientific Machine Learning through Physics-Informed Neural Networks: Where we are and What's next
S. Cuomo
Vincenzo Schiano Di Cola
F. Giampaolo
G. Rozza
Maizar Raissi
F. Piccialli
PINN
138
1,306
0
14 Jan 2022
Benchmarking Deep Reinforcement Learning Algorithms for Vision-based Robotics
Swagat Kumar
Hayden Sampson
Ardhendu Behera
38
0
0
11 Jan 2022
Previous
1
2
3
...
6
7
8
...
11
12
13
Next