Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1708.05866
Cited By
v1
v2 (latest)
A Brief Survey of Deep Reinforcement Learning
19 August 2017
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Brief Survey of Deep Reinforcement Learning"
50 / 604 papers shown
Title
SMARLA: A Safety Monitoring Approach for Deep Reinforcement Learning Agents
Amirhossein Zolfagharian
Manel Abdellatif
Lionel C. Briand
S. Ramesh
109
5
0
03 Aug 2023
Improving Generalization in Visual Reinforcement Learning via Conflict-aware Gradient Agreement Augmentation
Siao Liu
Zhaoyu Chen
Yang Liu
Yuzheng Wang
Dingkang Yang
...
Ziqing Zhou
Xie Yi
Wei Li
Wenqiang Zhang
Zhongxue Gan
118
24
0
02 Aug 2023
Towards Building AI-CPS with NVIDIA Isaac Sim: An Industrial Benchmark and Case Study for Robotics Manipulation
Zhehua Zhou
Jiayang Song
Xuan Xie
Zhan Shu
Lei Ma
Dikai Liu
Jianxiong Yin
Simon See
64
20
0
31 Jul 2023
Using Implicit Behavior Cloning and Dynamic Movement Primitive to Facilitate Reinforcement Learning for Robot Motion Planning
Zengjie Zhang
Jayden Hong
Amir M. Soufi Enayati
Homayoun Najjaran
78
3
0
29 Jul 2023
Improvable Gap Balancing for Multi-Task Learning
Yanqi Dai
Nanyi Fei
Zhiwu Lu
77
5
0
28 Jul 2023
Adaptive Control of Resource Flow to Optimize Construction Work and Cash Flow via Online Deep Reinforcement Learning
Can Jiang
Xin Li
Jianpeng Lin
Ming-Yuan Liu
Zhiliang Ma
AI4CE
33
20
0
20 Jul 2023
Transformers in Reinforcement Learning: A Survey
Pranav Agarwal
A. Rahman
P. St-Charles
Simon J. D. Prince
Samira Ebrahimi Kahou
OffRL
108
21
0
12 Jul 2023
Loss Dynamics of Temporal Difference Reinforcement Learning
Blake Bordelon
P. Masset
Henry Kuo
Cengiz Pehlevan
AI4CE
60
0
0
10 Jul 2023
Shared Growth of Graph Neural Networks via Prompted Free-direction Knowledge Distillation
Kaituo Feng
Yikun Miao
Changsheng Li
Ye Yuan
Guoren Wang
121
0
0
02 Jul 2023
Safe Reinforcement Learning with Dead-Ends Avoidance and Recovery
Xiao Zhang
Hai Zhang
Hongtu Zhou
Chang Huang
Di Zhang
Chen Ye
Junqiao Zhao
OffRL
85
5
0
24 Jun 2023
Multi-market Energy Optimization with Renewables via Reinforcement Learning
Lucien Werner
Peeyush Kumar
35
5
0
13 Jun 2023
UAV Trajectory and Multi-User Beamforming Optimization for Clustered Users Against Passive Eavesdropping Attacks With Unknown CSI
Aly Sabri Abdalla
Ali Behfarnia
Vuk Marojevic
43
9
0
11 Jun 2023
A Single-Loop Deep Actor-Critic Algorithm for Constrained Reinforcement Learning with Provable Convergence
Kexuan Wang
An Liu
Baishuo Liu
58
1
0
10 Jun 2023
Safety of autonomous vehicles: A survey on Model-based vs. AI-based approaches
Dimia Iberraken
Lounis Adouane
80
2
0
29 May 2023
Voyager: An Open-Ended Embodied Agent with Large Language Models
Guanzhi Wang
Yuqi Xie
Yunfan Jiang
Ajay Mandlekar
Chaowei Xiao
Yuke Zhu
Linxi Fan
Anima Anandkumar
LM&Ro
SyDa
173
844
0
25 May 2023
Automatic Design Method of Building Pipeline Layout Based on Deep Reinforcement Learning
Chen Yang
Zhe Zheng
Jiali Lin
AI4CE
23
1
0
18 May 2023
S-REINFORCE: A Neuro-Symbolic Policy Gradient Approach for Interpretable Reinforcement Learning
R. Dutta
Qinchen Wang
Ankur Singh
Dhruv Kumarjiguda
Xiaoli Li
Senthilnath Jayavelu
39
2
0
12 May 2023
Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Prakash Panangaden
S. Rezaei-Shoshtari
Rosie Zhao
David Meger
Doina Precup
74
4
0
09 May 2023
Adaptive Learning Path Navigation Based on Knowledge Tracing and Reinforcement Learning
Jyun-Yi Chen
Saeed Saeedvand
I-Wei Lai
46
2
0
08 May 2023
Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators
Alexander Herzog
Kanishka Rao
Karol Hausman
Yao Lu
Paul Wohlhart
...
Noah Brown
Mrinal Kalakrishnan
Julian Ibarz
P. Pastor
Sergey Levine
OffRL
93
27
0
05 May 2023
Data-driven Knowledge Fusion for Deep Multi-instance Learning
Yu-Xuan Zhang
Zhengchun Zhou
Xingxing He
A. R. Adhikary
Bapi Dutta
60
1
0
24 Apr 2023
Reinforcement Learning with Knowledge Representation and Reasoning: A Brief Survey
Chao Yu
Xuejing Zheng
H. Zhuo
OffRL
LRM
134
8
0
24 Apr 2023
Learning Representative Trajectories of Dynamical Systems via Domain-Adaptive Imitation
Edgardo Solano-Carrillo
Jannis Stoppe
51
0
0
19 Apr 2023
Deep Explainable Relational Reinforcement Learning: A Neuro-Symbolic Approach
Rishi Hazra
Luc de Raedt
NAI
73
9
0
17 Apr 2023
Integration of Reinforcement Learning Based Behavior Planning With Sampling Based Motion Planning for Automated Driving
Marvin Klimke
Benjamin Völz
M. Buchholz
48
6
0
17 Apr 2023
MLOps Spanning Whole Machine Learning Life Cycle: A Survey
Fang Zhengxin
Yuan Yi
Zhang Jingyu
Liu Yue
Mu Yuechen
...
Xu Xiwei
Wang Jeff
Wang Chen
Zhang Shuai
Chen Shiping
54
4
0
13 Apr 2023
Neural Network Algorithm for Intercepting Targets Moving Along Known Trajectories by a Dubins' Car
Ivan Nasonov
A. Galyaev
Andrey V. Medvedev
57
0
0
12 Apr 2023
Tracker: Model-based Reinforcement Learning for Tracking Control of Human Finger Attached with Thin McKibben Muscles
Daichi Saito
Eri Nagatomo
Jefferson Pardomuan
Hideki Koike
48
0
0
01 Apr 2023
Understanding Reinforcement Learning Algorithms: The Progress from Basic Q-learning to Proximal Policy Optimization
M. Chadi
H. Mousannif
OffRL
43
4
0
31 Mar 2023
Quantum Deep Hedging
El Amine Cherrat
S. Raj
Iordanis Kerenidis
Abhishek Shekhar
Ben Wood
...
Pierre Minssen
Ruslan Shaydulin
Yue Sun
Romina Yalovetzky
Marco Pistoia
78
26
0
29 Mar 2023
Connected Superlevel Set in (Deep) Reinforcement Learning and its Application to Minimax Theorems
Sihan Zeng
Thinh T. Doan
Justin Romberg
OffRL
60
3
0
23 Mar 2023
Adaptive Road Configurations for Improved Autonomous Vehicle-Pedestrian Interactions using Reinforcement Learning
Qiming Ye
Yuxiang Feng
Jose Javier Escribano Macias
M. Stettler
Panagiotis Angeloudis
44
4
0
22 Mar 2023
Multi-modal reward for visual relationships-based image captioning
Ali Abedi
Hossein Karshenas
Peyman Adibi
131
2
0
19 Mar 2023
Energy-Efficient Cellular-Connected UAV Swarm Control Optimization
Yang Su
Hui Zhou
Yansha Deng
Mischa Dohler
28
5
0
18 Mar 2023
Deep incremental learning models for financial temporal tabular datasets with distribution shifts
Thomas Wong
Mauricio Barahona
OOD
AIFin
AI4TS
151
0
0
14 Mar 2023
Enhancing MAP-Elites with Multiple Parallel Evolution Strategies
Manon Flageat
Bryan Lim
Antoine Cully
67
2
0
10 Mar 2023
RACCER: Towards Reachable and Certain Counterfactual Explanations for Reinforcement Learning
Jasmina Gajcin
Ivana Dusparic
CML
48
4
0
08 Mar 2023
A Strategy-Oriented Bayesian Soft Actor-Critic Model
Qin Yang
Ramviyas Parasuraman
73
8
0
07 Mar 2023
Ensemble Reinforcement Learning: A Survey
Yanjie Song
Ponnuthurai Nagaratnam Suganthan
Witold Pedrycz
Junwei Ou
Yongming He
Y. Chen
Yutong Wu
OffRL
91
41
0
05 Mar 2023
Reinforced Labels: Multi-Agent Deep Reinforcement Learning for Point-Feature Label Placement
Petr Bobák
Ladislav Čmolík
Martin Cadík
OffRL
68
3
0
02 Mar 2023
Methods and Mechanisms for Interactive Novelty Handling in Adversarial Environments
Tung Thai
Mingyu Shen
M. Garg
Ayush Kalani
Nakul Vaidya
...
Neeraj Varshney
Chitta Baral
Subbarao Kambhampati
Jivko Sinapov
matthias. scheutz
77
0
0
28 Feb 2023
Diffusion Model-Augmented Behavioral Cloning
Shangcheng Chen
Hsiang-Chun Wang
Ming-Hao Hsu
Chun-Mao Lai
Shao-Hua Sun
DiffM
150
31
0
26 Feb 2023
Hierarchical Needs-driven Agent Learning Systems: From Deep Reinforcement Learning To Diverse Strategies
Qin Yang
36
2
0
25 Feb 2023
Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning
Mudit Verma
Siddhant Bhambri
Subbarao Kambhampati
69
5
0
17 Feb 2023
A State Augmentation based approach to Reinforcement Learning from Human Preferences
Mudit Verma
Subbarao Kambhampati
50
2
0
17 Feb 2023
Data Driven Reward Initialization for Preference based Reinforcement Learning
Mudit Verma
Subbarao Kambhampati
58
1
0
17 Feb 2023
Revisiting Bellman Errors for Offline Model Selection
Joshua P. Zitovsky
Daniel de Marchi
Rishabh Agarwal
Michael R. Kosorok University of North Carolina at Chapel Hill
OffRL
82
5
0
31 Jan 2023
CRC-RL: A Novel Visual Feature Representation Architecture for Unsupervised Reinforcement Learning
Darshita Jain
A. Majumder
S. Dutta
Swagat Kumar
SSL
64
1
0
31 Jan 2023
AI Tool for Exploring How Economic Activities Impact Local Ecosystems
Claes Strannegaard
Niklas Engsner
Rasmus Lindgren
Simon Olsson
J. Endler
13
1
0
25 Jan 2023
Evolution of MAC Protocols in the Machine Learning Decade: A Comprehensive Survey
Mostafa Hussien
I. Taj-Eddin
Mohammed F. A. Ahmed
Ali Ranjha
K. Nguyen
M. Cheriet
AI4TS
46
9
0
24 Jan 2023
Previous
1
2
3
4
5
6
...
11
12
13
Next