ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1708.05866
  4. Cited By
A Brief Survey of Deep Reinforcement Learning
v1v2 (latest)

A Brief Survey of Deep Reinforcement Learning

19 August 2017
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
    OffRL
ArXiv (abs)PDFHTML

Papers citing "A Brief Survey of Deep Reinforcement Learning"

50 / 604 papers shown
Title
Semi-supervised learning via DQN for log anomaly detection
Semi-supervised learning via DQN for log anomaly detection
Yingying He
Xiaobing Pei
Lihong Shen
70
1
0
06 Jan 2024
Deep Reinforcement Learning for Local Path Following of an Autonomous
  Formula SAE Vehicle
Deep Reinforcement Learning for Local Path Following of an Autonomous Formula SAE Vehicle
Harvey Merton
Thomas Delamore
Karl Stol
Henry Williams
45
0
0
05 Jan 2024
Learning-based agricultural management in partially observable
  environments subject to climate variability
Learning-based agricultural management in partially observable environments subject to climate variability
Zhaoan Wang
Shaoping Xiao
Junchao Li
Jun Wang
25
3
0
02 Jan 2024
On the Burstiness of Distributed Machine Learning Traffic
On the Burstiness of Distributed Machine Learning Traffic
Natchanon Luangsomboon
Fahimeh Fazel
Jorg Liebeherr
A. Sobhani
Shichao Guan
Xingjun Chu
68
2
0
30 Dec 2023
ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic
  Tensor Selection
ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection
Kai Huang
Boyuan Yang
Wei Gao
93
21
0
21 Dec 2023
OpenRL: A Unified Reinforcement Learning Framework
OpenRL: A Unified Reinforcement Learning Framework
Shiyu Huang
Wentse Chen
Yiwen Sun
Fuqing Bie
Weijuan Tu
81
3
0
20 Dec 2023
LDM$^2$: A Large Decision Model Imitating Human Cognition with Dynamic
  Memory Enhancement
LDM2^22: A Large Decision Model Imitating Human Cognition with Dynamic Memory Enhancement
Xingjin Wang
Linjing Li
D. Zeng
55
0
0
13 Dec 2023
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRLOOD
188
5
0
13 Dec 2023
Ensemble Kalman Filtering Meets Gaussian Process SSM for Non-Mean-Field
  and Online Inference
Ensemble Kalman Filtering Meets Gaussian Process SSM for Non-Mean-Field and Online Inference
Zhidi Lin
Yiyong Sun
Feng Yin
Alexandre Thiéry
84
4
0
10 Dec 2023
Creative Agents: Empowering Agents with Imagination for Creative Tasks
Creative Agents: Empowering Agents with Imagination for Creative Tasks
Chi Zhang
Penglin Cai
Yuhui Fu
Haoqi Yuan
Zongqing Lu
LM&RoLLMAG
129
24
0
05 Dec 2023
AdsorbRL: Deep Multi-Objective Reinforcement Learning for Inverse
  Catalysts Design
AdsorbRL: Deep Multi-Objective Reinforcement Learning for Inverse Catalysts Design
Romain Lacombe
Lucas Hendren
Khalid El-Awady
38
2
0
04 Dec 2023
Atlas: Hybrid Cloud Migration Advisor for Interactive Microservices
Atlas: Hybrid Cloud Migration Advisor for Interactive Microservices
Ka-Ho Chow
Umesh Deshpande
Veera Deenadayalan
S. Seshadri
Ling Liu
37
3
0
12 Nov 2023
Model-assisted Reinforcement Learning of a Quadrotor
Model-assisted Reinforcement Learning of a Quadrotor
Arshad Javeed
77
0
0
12 Nov 2023
An Intelligent Social Learning-based Optimization Strategy for Black-box
  Robotic Control with Reinforcement Learning
An Intelligent Social Learning-based Optimization Strategy for Black-box Robotic Control with Reinforcement Learning
Xubo Yang
Jian Gao
Ting Wang
Yaozhen He
55
0
0
11 Nov 2023
FigStep: Jailbreaking Large Vision-Language Models via Typographic Visual Prompts
FigStep: Jailbreaking Large Vision-Language Models via Typographic Visual Prompts
Yichen Gong
Delong Ran
Jinyuan Liu
Conglei Wang
Tianshuo Cong
Anyu Wang
Sisi Duan
Xiaoyun Wang
MLLM
235
161
0
09 Nov 2023
QOCO: A QoE-Oriented Computation Offloading Algorithm based on Deep
  Reinforcement Learning for Mobile Edge Computing
QOCO: A QoE-Oriented Computation Offloading Algorithm based on Deep Reinforcement Learning for Mobile Edge Computing
Iman Rahmati
Hamed Shah-Mansouri
Ali Movaghar
23
1
0
04 Nov 2023
Accelerating Reinforcement Learning of Robotic Manipulations via
  Feedback from Large Language Models
Accelerating Reinforcement Learning of Robotic Manipulations via Feedback from Large Language Models
Kun-Mo Chu
Xufeng Zhao
C. Weber
Mengdi Li
Stefan Wermter
LLMAGLM&Ro
87
15
0
04 Nov 2023
Agent-based Modelling of Credit Card Promotions
Agent-based Modelling of Credit Card Promotions
Conor B. Hamill
Raad Khraishi
Simona Gherghel
Jerrard Lawrence
Salvatore Mercuri
Ramin Okhrati
Greig A. Cowan
58
1
0
03 Nov 2023
Diffusion Models for Reinforcement Learning: A Survey
Diffusion Models for Reinforcement Learning: A Survey
Zhengbang Zhu
Hanye Zhao
Haoran He
Yichao Zhong
Shenyu Zhang
Haoquan Guo
Tingting Chen
Weinan Zhang
160
68
0
02 Nov 2023
EconAgent: Large Language Model-Empowered Agents for Simulating
  Macroeconomic Activities
EconAgent: Large Language Model-Empowered Agents for Simulating Macroeconomic Activities
Nian Li
Chen Gao
Mingyu Li
Yong Li
Qingmin Liao
LLMAGAI4CE
121
82
0
16 Oct 2023
Leveraging Knowledge Distillation for Efficient Deep Reinforcement
  Learning in Resource-Constrained Environments
Leveraging Knowledge Distillation for Efficient Deep Reinforcement Learning in Resource-Constrained Environments
Guanlin Meng
40
1
0
16 Oct 2023
Deep Reinforcement Learning with Explicit Context Representation
Deep Reinforcement Learning with Explicit Context Representation
Francisco Munguia-Galeano
Ah-Hwee Tan
Ze Ji
OffRL
75
2
0
15 Oct 2023
PAGE: Equilibrate Personalization and Generalization in Federated
  Learning
PAGE: Equilibrate Personalization and Generalization in Federated Learning
Qian Chen
Zilong Wang
Jiaqi Hu
Haonan Yan
Jianying Zhou
Xiao-La Lin
FedML
84
4
0
13 Oct 2023
Deep reinforcement learning for machine scheduling: Methodology, the
  state-of-the-art, and future directions
Deep reinforcement learning for machine scheduling: Methodology, the state-of-the-art, and future directions
Maziyar Khadivi
Todd Charter
Marjan Yaghoubi
Masoud Jalayer
Maryam Ahang
Ardeshir Shojaeinasab
Homayoun Najjaran
73
12
0
04 Oct 2023
Algebras of actions in an agent's representations of the world
Algebras of actions in an agent's representations of the world
Alexander Dean
Eduardo Alonso
Esther Mondragón
65
0
0
02 Oct 2023
Consistency Models as a Rich and Efficient Policy Class for
  Reinforcement Learning
Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning
Daoce Wang
Chi Jin
OffRLDiffM
94
35
0
29 Sep 2023
On Generating Explanations for Reinforcement Learning Policies: An Empirical Study
On Generating Explanations for Reinforcement Learning Policies: An Empirical Study
Mikihisa Yuasa
Huy T. Tran
R. Sreenivas
FAttLRM
151
1
0
29 Sep 2023
Raijū: Reinforcement Learning-Guided Post-Exploitation for Automating
  Security Assessment of Network Systems
Raijū: Reinforcement Learning-Guided Post-Exploitation for Automating Security Assessment of Network Systems
V. Pham
Hien Do Hoang
Phan Thanh Trung
Van Dinh Quoc
T. To
Phan The Duy
43
0
0
27 Sep 2023
An In-depth Survey of Large Language Model-based Artificial Intelligence
  Agents
An In-depth Survey of Large Language Model-based Artificial Intelligence Agents
Pengyu Zhao
Zijian Jin
Ning Cheng
LLMAG
99
24
0
23 Sep 2023
A Machine Learning-oriented Survey on Tiny Machine Learning
A Machine Learning-oriented Survey on Tiny Machine Learning
Luigi Capogrosso
Federico Cunico
D. Cheng
Franco Fummi
Marco Cristani
SyDaMU
106
45
0
21 Sep 2023
Multicopy Reinforcement Learning Agents
Multicopy Reinforcement Learning Agents
Alicia P. Wolfe
Oliver Diamond
Brigitte Goeler-Slough
Remi Feuerman
Magdalena Kisielinska
Victoria Manfredi
127
0
0
19 Sep 2023
Contrastive Initial State Buffer for Reinforcement Learning
Contrastive Initial State Buffer for Reinforcement Learning
Nico Messikommer
Yunlong Song
Davide Scaramuzza
OffRL
103
9
0
18 Sep 2023
Stable In-hand Manipulation with Finger Specific Multi-agent Shadow
  Reward
Stable In-hand Manipulation with Finger Specific Multi-agent Shadow Reward
Lingfeng Tao
Jiucai Zhang
Xiaoli Zhang
56
0
0
13 Sep 2023
Compositional Learning of Visually-Grounded Concepts Using Reinforcement
Compositional Learning of Visually-Grounded Concepts Using Reinforcement
Zijun Lin
Haidi Azaman
M Ganesh Kumar
Cheston Tan
CoGeOffRL
74
3
0
08 Sep 2023
Ensemble DNN for Age-of-Information Minimization in UAV-assisted
  Networks
Ensemble DNN for Age-of-Information Minimization in UAV-assisted Networks
M. Ndiaye
El Houcine Bergou
Hajar Elhammouti
51
1
0
06 Sep 2023
On Reducing Undesirable Behavior in Deep Reinforcement Learning Models
On Reducing Undesirable Behavior in Deep Reinforcement Learning Models
Ophir M. Carmel
Guy Katz
67
0
0
06 Sep 2023
Efficient RL via Disentangled Environment and Agent Representations
Efficient RL via Disentangled Environment and Agent Representations
Kevin Gmelin
Shikhar Bahl
Russell Mendonca
Deepak Pathak
DRL
71
9
0
05 Sep 2023
Neurosymbolic Reinforcement Learning and Planning: A Survey
Neurosymbolic Reinforcement Learning and Planning: A Survey
Kamal Acharya
Waleed Raza
Carlos Dourado
Alvaro Velasquez
Houbing Song
NAIOffRL
90
17
0
02 Sep 2023
Deep Reinforcement Learning in Surgical Robotics: Enhancing the
  Automation Level
Deep Reinforcement Learning in Surgical Robotics: Enhancing the Automation Level
Cheng Qian
Hongliang Ren
83
4
0
02 Sep 2023
The AI Revolution: Opportunities and Challenges for the Finance Sector
The AI Revolution: Opportunities and Challenges for the Finance Sector
Carsten Maple
Lukasz Szpruch
Gregory Epiphaniou
Kalina S. Staykova
Simran Singh
William Penwarden
Yisi Wen
Zijian Wang
Jagdish Hariharan
Pavle Avramović
AIFin
108
36
0
31 Aug 2023
Iterative Reward Shaping using Human Feedback for Correcting Reward
  Misspecification
Iterative Reward Shaping using Human Feedback for Correcting Reward Misspecification
Jasmina Gajcin
J. McCarthy
Rahul Nair
Radu Marinescu
Elizabeth M. Daly
Ivana Dusparic
92
3
0
30 Aug 2023
R^3: On-device Real-Time Deep Reinforcement Learning for Autonomous
  Robotics
R^3: On-device Real-Time Deep Reinforcement Learning for Autonomous Robotics
Zexin Li
Aritra Samanta
Yufei Li
Andrea Soltoggio
Hyoseung Kim
Cong Liu
115
7
0
29 Aug 2023
Traffic Light Control with Reinforcement Learning
Traffic Light Control with Reinforcement Learning
Tao Pan
76
3
0
28 Aug 2023
Racing Towards Reinforcement Learning based control of an Autonomous
  Formula SAE Car
Racing Towards Reinforcement Learning based control of an Autonomous Formula SAE Car
Aakaash Salvaji
Harry Taylor
David Valencia
Trevor Gee
Henry Williams
35
3
0
24 Aug 2023
An Intentional Forgetting-Driven Self-Healing Method For Deep
  Reinforcement Learning Systems
An Intentional Forgetting-Driven Self-Healing Method For Deep Reinforcement Learning Systems
Ahmed Haj Yahmed
Rached Bouchoucha
Houssem Ben Braiek
Foutse Khomh
CLLAI4CE
66
0
0
23 Aug 2023
Deploying Deep Reinforcement Learning Systems: A Taxonomy of Challenges
Deploying Deep Reinforcement Learning Systems: A Taxonomy of Challenges
Ahmed Haj Yahmed
Altaf Allah Abbassi
Amin Nikanjam
Heng Li
Foutse Khomh
OffRL
72
5
0
23 Aug 2023
Prompt-Based Length Controlled Generation with Reinforcement Learning
Prompt-Based Length Controlled Generation with Reinforcement Learning
Renlong Jie
Xiaojun Meng
Lifeng Shang
Xin Jiang
Qun Liu
80
10
0
23 Aug 2023
Artificial Intelligence for Smart Transportation
Artificial Intelligence for Smart Transportation
Michael Wilbur
Amutheezan Sivagnanam
Afiya Ayman
Samitha Samaranayeke
Abhishek Dubey
Aron Laszka
AI4TS
53
2
0
14 Aug 2023
Large Language Models and Foundation Models in Smart Agriculture:
  Basics, Opportunities, and Challenges
Large Language Models and Foundation Models in Smart Agriculture: Basics, Opportunities, and Challenges
Jiajia Li
Mingle Xu
Lirong Xiang
Dong Chen
Weichao Zhuang
Xunyuan Yin
Zhao Li
130
3
0
13 Aug 2023
Learning Team-Based Navigation: A Review of Deep Reinforcement Learning
  Techniques for Multi-Agent Pathfinding
Learning Team-Based Navigation: A Review of Deep Reinforcement Learning Techniques for Multi-Agent Pathfinding
Jaeho Chung
Jamil Fayyad
Younes Al Younes
Homayoun Najjaran
78
17
0
11 Aug 2023
Previous
12345...111213
Next