Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1708.05866
Cited By
A Brief Survey of Deep Reinforcement Learning
19 August 2017
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Brief Survey of Deep Reinforcement Learning"
50 / 596 papers shown
Title
Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation
Donghoon Lee
Tung M. Luu
Younghwan Lee
Chang D. Yoo
OffRL
VLM
9
0
0
16 May 2025
Aerodynamic and structural airfoil shape optimisation via Transfer Learning-enhanced Deep Reinforcement Learning
David Ramos
Lucas Lacasa
E. Valero
G. Rubio
AI4CE
27
0
0
05 May 2025
A Generalised and Adaptable Reinforcement Learning Stopping Method
Reem Bin-Hezam
Mark Stevenson
24
0
0
03 May 2025
Quantum-Enhanced Hybrid Reinforcement Learning Framework for Dynamic Path Planning in Autonomous Systems
Sahil Tomar
Shamshe Alam
Sandeep Kumar
Amit Mathur
46
0
0
29 Apr 2025
Motion Generation for Food Topping Challenge 2024: Serving Salmon Roe Bowl and Picking Fried Chicken
Koki Inami
Masashi Konosu
Koki Yamane
Nozomu Masuya
Yunhan Li
Yu-Han Shu
Hiroshi Sato
Shinnosuke Homma
S. Sakaino
49
0
0
28 Apr 2025
An Automated Reinforcement Learning Reward Design Framework with Large Language Model for Cooperative Platoon Coordination
Dixiao Wei
Peng Yi
Jinlong Lei
Yiguang Hong
Yuchuan Du
134
0
0
28 Apr 2025
BQSched: A Non-intrusive Scheduler for Batch Concurrent Queries via Reinforcement Learning
Chenhao Xu
Chunyu Chen
Jinglin Peng
Jiannan Wang
Jun Gao
OffRL
AI4TS
48
0
0
27 Apr 2025
A Systematic Approach to Design Real-World Human-in-the-Loop Deep Reinforcement Learning: Salient Features, Challenges and Trade-offs
Jalal Arabneydi
Saiful Islam
Srijita Das
S. Gottipati
William Duguay
Clodéric Mars
Matthew E. Taylor
Matthew J. Guzdial
Antoine Fagette
Younes Zerouali
21
0
0
23 Apr 2025
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Shihan Dou
Muling Wu
Jingwen Xu
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
OffRL
LRM
32
0
0
19 Apr 2025
Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss
Ukjo Hwang
Songnam Hong
OffRL
41
0
0
14 Apr 2025
Efficient Transformed Gaussian Process State-Space Models for Non-Stationary High-Dimensional Dynamical Systems
Zhidi Lin
Ying Li
Feng Yin
Juan Maroñas
Alexandre Thiéry
54
0
0
24 Mar 2025
Boosting Virtual Agent Learning and Reasoning: A Step-wise, Multi-dimensional, and Generalist Reward Model with Benchmark
Bingchen Miao
Y. Wu
Minghe Gao
Qifan Yu
Wendong Bu
Wenqiao Zhang
Yunfei Li
Siliang Tang
Tat-Seng Chua
Juncheng Billy Li
LLMAG
LRM
56
0
0
24 Mar 2025
Towards Automated Semantic Interpretability in Reinforcement Learning via Vision-Language Models
Zhaoxin Li
Zhang Xi-Jia
Batuhan Altundas
Letian Chen
Rohan R. Paleja
Matthew C. Gombolay
OffRL
41
0
0
20 Mar 2025
Reinforcement Learning-based Heuristics to Guide Domain-Independent Dynamic Programming
Minori Narita
Ryo Kuroiwa
J. Christopher Beck
42
0
0
20 Mar 2025
Behaviour Discovery and Attribution for Explainable Reinforcement Learning
Rishav Rishav
Somjit Nath
Vincent Michalski
Samira Ebrahimi Kahou
FAtt
OffRL
70
0
0
19 Mar 2025
Learning with Expert Abstractions for Efficient Multi-Task Continuous Control
Jeff Jewett
Sandhya Saisubramanian
OffRL
48
0
0
19 Mar 2025
Deep Belief Markov Models for POMDP Inference
Giacomo Arcieri
K. Papakonstantinou
D. Štraub
Eleni Chatzi
43
0
0
17 Mar 2025
Context-aware Constrained Reinforcement Learning Based Energy-Efficient Power Scheduling for Non-stationary XR Data Traffic
Kexuan Wang
An Liu
44
0
0
13 Mar 2025
Collaborative Expert LLMs Guided Multi-Objective Molecular Optimization
Jiajun Yu
Y. Zheng
Huan Yee Koh
Shirui Pan
Tianyue Wang
Haishuai Wang
64
0
0
05 Mar 2025
Embodied Escaping: End-to-End Reinforcement Learning for Robot Navigation in Narrow Environment
Han Zheng
Jingyang Zhang
Mingyang Jiang
Peiyuan Liu
Danni Liu
Tong Qin
Ming Yang
148
0
0
05 Mar 2025
Efficiently Solving Discounted MDPs with Predictions on Transition Matrices
Lixing Lyu
Jiashuo Jiang
Wang Chi Cheung
42
1
0
24 Feb 2025
Self-Supervised Transformers as Iterative Solution Improvers for Constraint Satisfaction
Yudong Xu
Wenhao Li
Scott Sanner
Elias Boutros Khalil
44
0
0
18 Feb 2025
Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations
Abdelrhman Shaheen
Anas Badr
Ali Abohendy
Hatem Alsaadawy
Nadine Alsayad
59
0
0
14 Feb 2025
A Survey of Reinforcement Learning for Optimization in Automation
Ahmad Farooq
Kamran Iqbal
OffRL
89
1
0
13 Feb 2025
Reinforced Lifelong Editing for Language Models
Zherui Li
Houcheng Jiang
Hao Chen
Baolong Bi
Zhenhong Zhou
Fei Sun
Fan Zhang
Qing Guo
KELM
51
5
0
09 Feb 2025
Survey on Recent Progress of AI for Chemistry: Methods, Applications, and Opportunities
Ding Hu
Pengxiang Hua
Zhen Huang
88
0
0
09 Feb 2025
Learning more with the same effort: how randomization improves the robustness of a robotic deep reinforcement learning agent
Lucía Güitta-López
Jaime Boal
Álvaro J. López-López
48
5
0
24 Jan 2025
Audio-Driven Reinforcement Learning for Head-Orientation in Naturalistic Environments
Wessel Ledder
Yuzhen Qin
Kiki van der Heijden
99
0
0
20 Jan 2025
Advanced Persistent Threats (APT) Attribution Using Deep Reinforcement Learning
Animesh Singh Basnet
M. C. Ghanem
Dipo Dunsin
Wiktor Sowinski-Mydlarz
AAML
37
0
0
08 Jan 2025
Contrastive Learning from Exploratory Actions: Leveraging Natural Interactions for Preference Elicitation
N. Dennler
Stefanos Nikolaidis
Maja J. Matarić
141
0
0
03 Jan 2025
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
H. Zhang
Xiaoman Pan
Hongwei Wang
Kaixin Ma
W. Yu
Dong Yu
LLMAG
61
3
0
03 Jan 2025
ReinFog: A DRL Empowered Framework for Resource Management in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
72
0
0
20 Nov 2024
Bridging the Resource Gap: Deploying Advanced Imitation Learning Models onto Affordable Embedded Platforms
Haizhou Ge
Ruixiang Wang
Zhu-ang Xu
Hongrui Zhu
Ruichen Deng
Yuhang Dong
Zeyu Pang
Guyue Zhou
Junyu Zhang
Lu Shi
78
1
0
18 Nov 2024
Joint Age-State Belief is All You Need: Minimizing AoII via Pull-Based Remote Estimation
Ismail Cosandal
S. Ulukus
Nail Akar
21
3
0
11 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
22
2
0
08 Nov 2024
Hypercube Policy Regularization Framework for Offline Reinforcement Learning
Yi Shen
Hanyan Huang
OffRL
28
0
0
07 Nov 2024
Dynamic Weight Adjusting Deep Q-Networks for Real-Time Environmental Adaptation
Xinhao Zhang
Jinghan Zhang
Wujun Si
Kunpeng Liu
43
0
0
04 Nov 2024
Machine Learning Innovations in CPR: A Comprehensive Survey on Enhanced Resuscitation Techniques
Saidul Islam
Gaith Rjoub
Hanae Elmekki
Jamal Bentahar
Witold Pedrycz
R. Cohen
31
0
0
03 Nov 2024
Multi-Agent Deep Q-Network with Layer-based Communication Channel for Autonomous Internal Logistics Vehicle Scheduling in Smart Manufacturing
Mohammad Feizabadi
Arman Hosseini
Zakaria Yahouni
35
0
0
01 Nov 2024
PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference
Kendong Liu
Zhiyu Zhu
Chuanhao Li
Hui Liu
H. Zeng
Junhui Hou
EGVM
38
2
0
29 Oct 2024
Dual-Agent Deep Reinforcement Learning for Dynamic Pricing and Replenishment
Yi Zheng
Zehao Li
Peng Jiang
Yijie Peng
22
0
0
28 Oct 2024
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Kaiyan Zhao
Yiming Wang
Yuyang Chen
Yan Li
Leong Hou U
Xiaoguang Niu
36
1
0
27 Oct 2024
Beyond Simple Sum of Delayed Rewards: Non-Markovian Reward Modeling for Reinforcement Learning
Yuting Tang
Xin-Qiang Cai
Jing-Cheng Pang
Qiyu Wu
Yao-Xiang Ding
Masashi Sugiyama
OffRL
26
0
0
26 Oct 2024
Learning Agents With Prioritization and Parameter Noise in Continuous State and Action Space
Rajesh Mangannavar
Gopalakrishnan Srinivasaraghavan
25
2
0
15 Oct 2024
A Scalable Communication Protocol for Networks of Large Language Models
Samuele Marro
Emanuele La Malfa
Jesse Wright
Bernard Ghanem
Nigel Shadbolt
Michael Wooldridge
Philip H. S. Torr
GNN
AIFin
40
8
0
14 Oct 2024
Transfer Learning for a Class of Cascade Dynamical Systems
Shima Rabiei
Sandipan Mishra
Santiago Paternain
18
0
0
09 Oct 2024
Urban Computing for Climate and Environmental Justice: Early Perspectives From Two Research Initiatives
Carolina Veiga
Ashish Sharma
Daniel de Oliveira
Marcos Lage
Fabio Miranda
AI4CE
37
0
0
06 Oct 2024
TemporalPaD: a reinforcement-learning framework for temporal feature representation and dimension reduction
Xuechen Mu
Zhenyu Huang
Kewei Li
Haotian Zhang
Xiuli Wang
Yusi Fan
Kai Zhang
Fengfeng Zhou
AI4TS
OffRL
18
0
0
27 Sep 2024
Artificial Intelligence for Secured Information Systems in Smart Cities: Collaborative IoT Computing with Deep Reinforcement Learning and Blockchain
Amin Zakaie Far
Mohammad Zakaie Far
Sonia Gharibzadeh
Shiva Zangeneh
Leila Amini
Morteza Rahimi
Morteza Rahimi
Saeed Asadi
26
5
0
24 Sep 2024
Semifactual Explanations for Reinforcement Learning
Jasmina Gajcin
Jovan Jeromela
Ivana Dusparic
OffRL
35
0
0
09 Sep 2024
1
2
3
4
...
10
11
12
Next