Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1708.05866
Cited By
A Brief Survey of Deep Reinforcement Learning
19 August 2017
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Brief Survey of Deep Reinforcement Learning"
50 / 596 papers shown
Title
Multi-market Energy Optimization with Renewables via Reinforcement Learning
Lucien Werner
Peeyush Kumar
16
5
0
13 Jun 2023
UAV Trajectory and Multi-User Beamforming Optimization for Clustered Users Against Passive Eavesdropping Attacks With Unknown CSI
Aly Sabri Abdalla
Ali Behfarnia
Vuk Marojevic
8
8
0
11 Jun 2023
A Single-Loop Deep Actor-Critic Algorithm for Constrained Reinforcement Learning with Provable Convergence
Kexuan Wang
An Liu
Baishuo Liu
18
1
0
10 Jun 2023
Safety of autonomous vehicles: A survey on Model-based vs. AI-based approaches
Dimia Iberraken
Lounis Adouane
19
1
0
29 May 2023
Voyager: An Open-Ended Embodied Agent with Large Language Models
Guanzhi Wang
Yuqi Xie
Yunfan Jiang
Ajay Mandlekar
Chaowei Xiao
Yuke Zhu
Linxi Fan
Anima Anandkumar
LM&Ro
SyDa
63
757
0
25 May 2023
Automatic Design Method of Building Pipeline Layout Based on Deep Reinforcement Learning
Chen Yang
Zhe Zheng
Jiali Lin
AI4CE
13
1
0
18 May 2023
S-REINFORCE: A Neuro-Symbolic Policy Gradient Approach for Interpretable Reinforcement Learning
R. Dutta
Qinchen Wang
Ankur Singh
Dhruv Kumarjiguda
Xiaoli Li
Senthilnath Jayavelu
22
2
0
12 May 2023
Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Prakash Panangaden
S. Rezaei-Shoshtari
Rosie Zhao
D. Meger
Doina Precup
30
2
0
09 May 2023
Adaptive Learning Path Navigation Based on Knowledge Tracing and Reinforcement Learning
Jyun-Yi Chen
Saeed Saeedvand
I-Wei Lai
21
2
0
08 May 2023
Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators
Alexander Herzog
Kanishka Rao
Karol Hausman
Yao Lu
Paul Wohlhart
...
Noah Brown
Mrinal Kalakrishnan
Julian Ibarz
P. Pastor
Sergey Levine
OffRL
41
23
0
05 May 2023
Data-driven Knowledge Fusion for Deep Multi-instance Learning
Yu-Xuan Zhang
Zhengchun Zhou
Xingxing He
A. R. Adhikary
Bapi Dutta
33
1
0
24 Apr 2023
Reinforcement Learning with Knowledge Representation and Reasoning: A Brief Survey
Chao Yu
Xuejing Zheng
H. Zhuo
OffRL
LRM
55
7
0
24 Apr 2023
Learning Representative Trajectories of Dynamical Systems via Domain-Adaptive Imitation
Edgardo Solano-Carrillo
Jannis Stoppe
18
0
0
19 Apr 2023
Deep Explainable Relational Reinforcement Learning: A Neuro-Symbolic Approach
Rishi Hazra
Luc de Raedt
NAI
29
9
0
17 Apr 2023
Integration of Reinforcement Learning Based Behavior Planning With Sampling Based Motion Planning for Automated Driving
Marvin Klimke
Benjamin Völz
M. Buchholz
26
5
0
17 Apr 2023
MLOps Spanning Whole Machine Learning Life Cycle: A Survey
Fang Zhengxin
Yuan Yi
Zhang Jingyu
Liu Yue
Mu Yuechen
...
Xu Xiwei
Wang Jeff
Wang Chen
Zhang Shuai
Chen Shiping
24
4
0
13 Apr 2023
Neural Network Algorithm for Intercepting Targets Moving Along Known Trajectories by a Dubins' Car
Ivan Nasonov
A. Galyaev
Andrey V. Medvedev
28
0
0
12 Apr 2023
Tracker: Model-based Reinforcement Learning for Tracking Control of Human Finger Attached with Thin McKibben Muscles
Daichi Saito
Eri Nagatomo
Jefferson Pardomuan
Hideki Koike
19
0
0
01 Apr 2023
Understanding Reinforcement Learning Algorithms: The Progress from Basic Q-learning to Proximal Policy Optimization
M. Chadi
H. Mousannif
OffRL
21
4
0
31 Mar 2023
Quantum Deep Hedging
El Amine Cherrat
S. Raj
Iordanis Kerenidis
Abhishek Shekhar
Ben Wood
...
Pierre Minssen
Ruslan Shaydulin
Yue Sun
Romina Yalovetzky
Marco Pistoia
26
25
0
29 Mar 2023
Connected Superlevel Set in (Deep) Reinforcement Learning and its Application to Minimax Theorems
Sihan Zeng
Thinh T. Doan
Justin Romberg
OffRL
27
3
0
23 Mar 2023
Adaptive Road Configurations for Improved Autonomous Vehicle-Pedestrian Interactions using Reinforcement Learning
Qiming Ye
Yuxiang Feng
Jose Javier Escribano Macias
M. Stettler
Panagiotis Angeloudis
16
4
0
22 Mar 2023
Multi-modal reward for visual relationships-based image captioning
Ali Abedi
Hossein Karshenas
Peyman Adibi
44
2
0
19 Mar 2023
Energy-Efficient Cellular-Connected UAV Swarm Control Optimization
Yang Su
Hui Zhou
Yansha Deng
Mischa Dohler
6
5
0
18 Mar 2023
Deep incremental learning models for financial temporal tabular datasets with distribution shifts
Thomas Wong
Mauricio Barahona
OOD
AIFin
AI4TS
18
0
0
14 Mar 2023
Enhancing MAP-Elites with Multiple Parallel Evolution Strategies
Manon Flageat
Bryan Lim
Antoine Cully
27
2
0
10 Mar 2023
RACCER: Towards Reachable and Certain Counterfactual Explanations for Reinforcement Learning
Jasmina Gajcin
Ivana Dusparic
CML
32
3
0
08 Mar 2023
A Strategy-Oriented Bayesian Soft Actor-Critic Model
Qin Yang
Ramviyas Parasuraman
11
8
0
07 Mar 2023
Ensemble Reinforcement Learning: A Survey
Yanjie Song
Ponnuthurai Nagaratnam Suganthan
Witold Pedrycz
Junwei Ou
Yongming He
Y. Chen
Yutong Wu
OffRL
46
39
0
05 Mar 2023
Reinforced Labels: Multi-Agent Deep Reinforcement Learning for Point-Feature Label Placement
Petr Bobák
Ladislav Čmolík
Martin Cadík
OffRL
32
3
0
02 Mar 2023
Methods and Mechanisms for Interactive Novelty Handling in Adversarial Environments
Tung Thai
Mingyu Shen
M. Garg
Ayush Kalani
Nakul Vaidya
...
Neeraj Varshney
Chitta Baral
Subbarao Kambhampati
Jivko Sinapov
matthias. scheutz
34
0
0
28 Feb 2023
Diffusion Model-Augmented Behavioral Cloning
Shangcheng Chen
Hsiang-Chun Wang
Ming-Hao Hsu
Chun-Mao Lai
Shao-Hua Sun
DiffM
55
31
0
26 Feb 2023
Hierarchical Needs-driven Agent Learning Systems: From Deep Reinforcement Learning To Diverse Strategies
Qin Yang
17
2
0
25 Feb 2023
Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning
Mudit Verma
Siddhant Bhambri
Subbarao Kambhampati
37
4
0
17 Feb 2023
A State Augmentation based approach to Reinforcement Learning from Human Preferences
Mudit Verma
Subbarao Kambhampati
33
2
0
17 Feb 2023
Data Driven Reward Initialization for Preference based Reinforcement Learning
Mudit Verma
Subbarao Kambhampati
35
1
0
17 Feb 2023
Revisiting Bellman Errors for Offline Model Selection
Joshua P. Zitovsky
Daniel de Marchi
Rishabh Agarwal
Michael R. Kosorok University of North Carolina at Chapel Hill
OffRL
32
5
0
31 Jan 2023
CRC-RL: A Novel Visual Feature Representation Architecture for Unsupervised Reinforcement Learning
Darshita Jain
A. Majumder
S. Dutta
Swagat Kumar
SSL
34
1
0
31 Jan 2023
AI Tool for Exploring How Economic Activities Impact Local Ecosystems
Claes Strannegaard
Niklas Engsner
Rasmus Lindgren
Simon Olsson
J. Endler
6
0
0
25 Jan 2023
Evolution of MAC Protocols in the Machine Learning Decade: A Comprehensive Survey
Mostafa Hussien
I. Taj-Eddin
Mohammed F. A. Ahmed
Ali Ranjha
K. Nguyen
M. Cheriet
AI4TS
12
8
0
24 Jan 2023
RLAS-BIABC: A Reinforcement Learning-Based Answer Selection Using the BERT Model Boosted by an Improved ABC Algorithm
Hamid Gharagozlou
J. Mohammadzadeh
A. Bastanfard
S. S. Ghidary
8
34
0
07 Jan 2023
DRL-GAN: A Hybrid Approach for Binary and Multiclass Network Intrusion Detection
Caroline Strickland
Chandrika Saha
Muhammad Zakar
Sareh Nejad
Noshin Tasnim
D. Lizotte
Anwar Haque
26
10
0
05 Jan 2023
Deep Spectral Q-learning with Application to Mobile Health
Yuhe Gao
C. Shi
R. Song
32
0
0
03 Jan 2023
New Challenges in Reinforcement Learning: A Survey of Security and Privacy
Yunjiao Lei
Dayong Ye
Sheng Shen
Yulei Sui
Tianqing Zhu
Wanlei Zhou
33
18
0
31 Dec 2022
A Mapping of Assurance Techniques for Learning Enabled Autonomous Systems to the Systems Engineering Lifecycle
Christian Ellis
Maggie B. Wigness
L. Fiondella
40
1
0
30 Dec 2022
Function Approximation for Solving Stackelberg Equilibrium in Large Perfect Information Games
Chun Kai Ling
J. Zico Kolter
Fei Fang
35
0
0
29 Dec 2022
Discovering Efficient Periodic Behaviours in Mechanical Systems via Neural Approximators
Yannik P. Wotte
Sven Dummer
N. Botteghi
C. Brune
Stefano Stramigioli
Federico Califano
36
5
0
29 Dec 2022
Investigation of reinforcement learning for shape optimization of profile extrusion dies
C. Fricke
D. Wolff
Marco Kemmerling
S. Elgeti
OffRL
6
4
0
23 Dec 2022
Robust Path Selection in Software-defined WANs using Deep Reinforcement Learning
Shahrooz Pouryousef
Lixin Gao
Don Towsley
17
1
0
21 Dec 2022
A Memetic Algorithm with Reinforcement Learning for Sociotechnical Production Scheduling
Felix Grumbach
Nour Eldin Alaa Badr
Pascal Reusch
Sebastian Trojahn
6
6
0
21 Dec 2022
Previous
1
2
3
4
5
6
...
10
11
12
Next