Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1708.05866
Cited By
v1
v2 (latest)
A Brief Survey of Deep Reinforcement Learning
19 August 2017
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Brief Survey of Deep Reinforcement Learning"
50 / 604 papers shown
Title
Energy-Based Transfer for Reinforcement Learning
Zeyun Deng
Jasorsi Ghosh
Fiona Xie
Yuzhe Lu
Katia Sycara
Joseph Campbell
15
0
0
19 Jun 2025
PNCS:Power-Norm Cosine Similarity for Diverse Client Selection in Federated Learning
Liangyan Li
Yangyi Liu
Yimo Ning
Stefano Rini
Jun Chen
FedML
19
0
0
18 Jun 2025
Learning Swing-up Maneuvers for a Suspended Aerial Manipulation Platform in a Hierarchical Control Framework
Hemjyoti Das
Minh Nhat Vu
Christian Ott
16
0
0
16 Jun 2025
Realistic Urban Traffic Generator using Decentralized Federated Learning for the SUMO simulator
Alberto Bazán-Guillén
Carlos Beis-Penedo
Diego Cajaraville-Aboy
Pablo Barbecho-Bautista
R. Redondo
Luis J. de la Cruz Llopis
Ana Fernández-Vilas
Mónica Aguilar Igartua
M. Fernández-Veiga
AI4TS
19
0
0
09 Jun 2025
CARoL: Context-aware Adaptation for Robot Learning
Zechen Hu
Tong Xu
Xuesu Xiao
Xuan Wang
27
0
0
08 Jun 2025
Enhancing Efficiency and Propulsion in Bio-mimetic Robotic Fish through End-to-End Deep Reinforcement Learning
Xinyu Cui
Boai Sun
Yi Zhu
Ning Yang
Haifeng Zhang
Weicheng Cui
D. Fan
Jun Wang
159
9
0
05 Jun 2025
Autoencoding Random Forests
Binh Duc Vu
Jan Kapar
Marvin N. Wright
David S. Watson
154
0
0
27 May 2025
A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning
Yuzheng Hu
Fan Wu
Haotian Ye
David A. Forsyth
James Y. Zou
Nan Jiang
Jiaqi W. Ma
Han Zhao
OffRL
74
0
0
25 May 2025
Navigate the Unknown: Enhancing LLM Reasoning with Intrinsic Motivation Guided Exploration
Jingtong Gao
Ling Pan
Yejing Wang
Rui Zhong
Chi Lu
Qingpeng Cai
Peng Jiang
Xiangyu Zhao
LRM
101
1
0
23 May 2025
LLM-Powered AI Agent Systems and Their Applications in Industry
Guannan Liang
Qianqian Tong
LLMAG
LM&Ro
83
3
0
22 May 2025
Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation
Donghoon Lee
Tung M. Luu
Younghwan Lee
Chang D. Yoo
OffRL
VLM
72
0
0
16 May 2025
Aerodynamic and structural airfoil shape optimisation via Transfer Learning-enhanced Deep Reinforcement Learning
David Ramos
Lucas Lacasa
E. Valero
G. Rubio
AI4CE
111
0
0
05 May 2025
A Generalised and Adaptable Reinforcement Learning Stopping Method
Reem Bin-Hezam
Mark Stevenson
57
0
0
03 May 2025
Quantum-Enhanced Hybrid Reinforcement Learning Framework for Dynamic Path Planning in Autonomous Systems
Sahil Tomar
Shamshe Alam
Sandeep Kumar
Amit Mathur
112
1
0
29 Apr 2025
An Automated Reinforcement Learning Reward Design Framework with Large Language Model for Cooperative Platoon Coordination
Dixiao Wei
Peng Yi
Jinlong Lei
Yiguang Hong
Yuchuan Du
432
0
0
28 Apr 2025
Motion Generation for Food Topping Challenge 2024: Serving Salmon Roe Bowl and Picking Fried Chicken
Koki Inami
Masashi Konosu
Koki Yamane
Nozomu Masuya
Yunhan Li
Yu-Han Shu
Hiroshi Sato
Shinnosuke Homma
S. Sakaino
234
0
0
28 Apr 2025
BQSched: A Non-intrusive Scheduler for Batch Concurrent Queries via Reinforcement Learning
Chenhao Xu
Chunyu Chen
Jinglin Peng
Jiannan Wang
Jun Gao
OffRL
AI4TS
80
0
0
27 Apr 2025
A Systematic Approach to Design Real-World Human-in-the-Loop Deep Reinforcement Learning: Salient Features, Challenges and Trade-offs
Jalal Arabneydi
Saiful Islam
Srijita Das
S. Gottipati
William Duguay
Clodéric Mars
Matthew E. Taylor
Matthew J. Guzdial
Antoine Fagette
Younes Zerouali
37
0
0
23 Apr 2025
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Shihan Dou
Muling Wu
Jingwen Xu
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
OffRL
LRM
77
2
0
19 Apr 2025
Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss
Ukjo Hwang
Songnam Hong
OffRL
78
0
0
14 Apr 2025
Boosting Virtual Agent Learning and Reasoning: A Step-Wise, Multi-Dimensional, and Generalist Reward Model with Benchmark
Bingchen Miao
Y. Wu
Minghe Gao
Qifan Yu
Wendong Bu
Wenqiao Zhang
Yunfei Li
Siliang Tang
Tat-Seng Chua
Juncheng Billy Li
LLMAG
LRM
133
1
0
24 Mar 2025
Efficient Transformed Gaussian Process State-Space Models for Non-Stationary High-Dimensional Dynamical Systems
Zhidi Lin
Ying Li
Feng Yin
Juan Maroñas
Alexandre Thiéry
164
0
0
24 Mar 2025
Reinforcement Learning-based Heuristics to Guide Domain-Independent Dynamic Programming
Minori Narita
Ryo Kuroiwa
J. Christopher Beck
78
0
0
20 Mar 2025
Towards Automated Semantic Interpretability in Reinforcement Learning via Vision-Language Models
Zhaoxin Li
Zhang Xi-Jia
Batuhan Altundas
Letian Chen
Rohan R. Paleja
Matthew C. Gombolay
OffRL
76
0
0
20 Mar 2025
Learning with Expert Abstractions for Efficient Multi-Task Continuous Control
Jeff Jewett
Sandhya Saisubramanian
OffRL
117
0
0
19 Mar 2025
Behaviour Discovery and Attribution for Explainable Reinforcement Learning
Rishav Rishav
Somjit Nath
Vincent Michalski
Samira Ebrahimi Kahou
FAtt
OffRL
168
1
0
19 Mar 2025
Deep Belief Markov Models for POMDP Inference
Giacomo Arcieri
K. Papakonstantinou
D. Štraub
Eleni Chatzi
86
0
0
17 Mar 2025
Context-aware Constrained Reinforcement Learning Based Energy-Efficient Power Scheduling for Non-stationary XR Data Traffic
Kexuan Wang
An Liu
83
0
0
13 Mar 2025
Collaborative Expert LLMs Guided Multi-Objective Molecular Optimization
Jiajun Yu
Y. Zheng
Huan Yee Koh
Xiaojun Jia
Tianyue Wang
Haishuai Wang
119
2
0
05 Mar 2025
Embodied Escaping: End-to-End Reinforcement Learning for Robot Navigation in Narrow Environment
Han Zheng
Jing Zhang
Mingyang Jiang
Peiyuan Liu
Danni Liu
Tong Qin
Ming Yang
338
0
0
05 Mar 2025
Efficiently Solving Discounted MDPs with Predictions on Transition Matrices
Lixing Lyu
Jiashuo Jiang
Wang Chi Cheung
87
1
0
24 Feb 2025
Self-Supervised Transformers as Iterative Solution Improvers for Constraint Satisfaction
Yudong Xu
Wenhao Li
Scott Sanner
Elias Boutros Khalil
99
0
0
18 Feb 2025
Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations
Abdelrhman Shaheen
Anas Badr
Ali Abohendy
Hatem Alsaadawy
Nadine Alsayad
143
2
0
14 Feb 2025
A Survey of Reinforcement Learning for Optimization in Automation
Ahmad Farooq
Kamran Iqbal
OffRL
168
2
0
13 Feb 2025
Reinforced Lifelong Editing for Language Models
Zherui Li
Houcheng Jiang
Hao Chen
Baolong Bi
Zhenhong Zhou
Fei Sun
Sihang Li
Xinze Wang
KELM
157
8
0
09 Feb 2025
Survey on Recent Progress of AI for Chemistry: Methods, Applications, and Opportunities
Ding Hu
Pengxiang Hua
Zhen Huang
261
0
0
09 Feb 2025
Learning more with the same effort: how randomization improves the robustness of a robotic deep reinforcement learning agent
Lucía Güitta-López
Jaime Boal
Álvaro J. López-López
117
6
0
24 Jan 2025
Audio-Driven Reinforcement Learning for Head-Orientation in Naturalistic Environments
Wessel Ledder
Yuzhen Qin
Kiki van der Heijden
207
1
0
20 Jan 2025
Advanced Persistent Threats (APT) Attribution Using Deep Reinforcement Learning
Animesh Singh Basnet
M. C. Ghanem
Dipo Dunsin
Wiktor Sowinski-Mydlarz
AAML
63
0
0
08 Jan 2025
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Han Zhang
Xiaoman Pan
Hongwei Wang
Kaixin Ma
Wenhao Yu
Dong Yu
LLMAG
146
4
0
03 Jan 2025
Contrastive Learning from Exploratory Actions: Leveraging Natural Interactions for Preference Elicitation
N. Dennler
Stefanos Nikolaidis
Maja J. Matarić
464
0
0
03 Jan 2025
ReinFog: A DRL Empowered Framework for Resource Management in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
106
1
0
20 Nov 2024
Bridging the Resource Gap: Deploying Advanced Imitation Learning Models onto Affordable Embedded Platforms
Haizhou Ge
Ruixiang Wang
Zhu-ang Xu
Hongrui Zhu
Ruichen Deng
Yuhang Dong
Zeyu Pang
Guyue Zhou
Junyu Zhang
Lu Shi
140
1
0
18 Nov 2024
Joint Age-State Belief is All You Need: Minimizing AoII via Pull-Based Remote Estimation
Ismail Cosandal
S. Ulukus
Nail Akar
50
3
0
11 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
98
6
0
08 Nov 2024
Hypercube Policy Regularization Framework for Offline Reinforcement Learning
Yi Shen
Hanyan Huang
OffRL
60
0
0
07 Nov 2024
Dynamic Weight Adjusting Deep Q-Networks for Real-Time Environmental Adaptation
Xinhao Zhang
Jinghan Zhang
Wujun Si
Kunpeng Liu
79
1
0
04 Nov 2024
Machine Learning Innovations in CPR: A Comprehensive Survey on Enhanced Resuscitation Techniques
Saidul Islam
Gaith Rjoub
Hanae Elmekki
Jamal Bentahar
Witold Pedrycz
R. Cohen
92
0
0
03 Nov 2024
Multi-Agent Deep Q-Network with Layer-based Communication Channel for Autonomous Internal Logistics Vehicle Scheduling in Smart Manufacturing
Mohammad Feizabadi
Arman Hosseini
Zakaria Yahouni
94
0
0
01 Nov 2024
PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference
Kendong Liu
Zhiyu Zhu
Chuanhao Li
Hui Liu
H. Zeng
Junhui Hou
EGVM
76
4
0
29 Oct 2024
1
2
3
4
...
11
12
13
Next