Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1701.07274
Cited By
v1
v2
v3
v4
v5
v6 (latest)
Deep Reinforcement Learning: An Overview
25 January 2017
Yuxi Li
OffRL
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Reinforcement Learning: An Overview"
50 / 417 papers shown
Title
Learning Dexterous Object Handover
Daniel Frau-Alfaro
Julio Castaño-Amorós
S. T. Puente
Pablo Gil
Roberto Calandra
12
0
0
20 Jun 2025
Energy-Based Transfer for Reinforcement Learning
Zeyun Deng
Jasorsi Ghosh
Fiona Xie
Yuzhe Lu
Katia Sycara
Joseph Campbell
10
0
0
19 Jun 2025
Federated Neuroevolution O-RAN: Enhancing the Robustness of Deep Reinforcement Learning xApps
M. Kouchaki
Aly Sabri Abdalla
Vuk Marojevic
10
0
0
15 Jun 2025
Evolutionary Developmental Biology Can Serve as the Conceptual Foundation for a New Design Paradigm in Artificial Intelligence
Zeki Doruk Erden
Boi Faltings
14
0
0
15 Jun 2025
MOBODY: Model Based Off-Dynamics Offline Reinforcement Learning
Yihong Guo
Yu Yang
Pan Xu
Anqi Liu
OffRL
31
0
0
10 Jun 2025
GPS Spoofing Attacks on AI-based Navigation Systems with Obstacle Avoidance in UAV
Ji Hyuk Jung
Mi Yeon Hong
Ji Won Yoon
AAML
27
0
0
10 Jun 2025
Safe and Economical UAV Trajectory Planning in Low-Altitude Airspace: A Hybrid DRL-LLM Approach with Compliance Awareness
Yanwei Gong
Xiaolin Chang
13
0
0
10 Jun 2025
Evaluation of LLMs for mathematical problem solving
Ruonan Wang
Runxi Wang
Yunwen Shen
Chengfeng Wu
Qinglin Zhou
Rohitash Chandra
ELM
LRM
30
0
0
30 May 2025
AXIOM: Learning to Play Games in Minutes with Expanding Object-Centric Models
Conor Heins
Toon Van de Maele
Alexander Tschantz
Hampus Linander
Dimitrije Marković
...
Magnus T. Koudahl
Marco Perin
Karl J. Friston
Tim Verbelen
Christopher L. Buckley
OCL
42
0
0
30 May 2025
When Models Reason in Your Language: Controlling Thinking Trace Language Comes at the Cost of Accuracy
Jirui Qi
Shan Chen
Zidi Xiong
Raquel Fernández
Danielle S. Bitterman
Arianna Bisazza
LRM
92
0
0
28 May 2025
PPO-BR: Dual-Signal Entropy-Reward Adaptation for Trust Region Policy Optimization
Ben Rahman
73
0
0
23 May 2025
On the Parallels Between Evolutionary Theory and the State of AI
Zeki Doruk Erden
Boi Faltings
23
0
0
13 May 2025
Continual Reinforcement Learning via Autoencoder-Driven Task and New Environment Recognition
Zeki Doruk Erden
Donia Gasmi
Boi Faltings
CLL
74
1
0
13 May 2025
Adaptive Bias Generalized Rollout Policy Adaptation on the Flexible Job-Shop Scheduling Problem
Lotfi Kobrosly
Marc-Emmanuel Coupvent des Graviers
Christophe Guettier
Tristan Cazenave
107
0
0
13 May 2025
DeeP-Mod: Deep Dynamic Programming based Environment Modelling using Feature Extraction
Chris Child
Lam Ngo
56
0
0
29 Apr 2025
ARMOR: Adaptive Meshing with Reinforcement Optimization for Real-time 3D Monitoring in Unexposed Scenes
Yizhe Zhang
Jianping Li
Xin Zhao
Fuxun Liang
Z. Dong
Bisheng Yang
AI4CE
119
0
0
28 Apr 2025
Dual-Individual Genetic Algorithm: A Dual-Individual Approach for Efficient Training of Multi-Layer Neural Networks
Tran Thuy Nga Truong
Jooyong Kim
99
0
0
24 Apr 2025
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Shihan Dou
Muling Wu
Jingwen Xu
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
OffRL
LRM
75
2
0
19 Apr 2025
Genetic Programming with Reinforcement Learning Trained Transformer for Real-World Dynamic Scheduling Problems
Xian Chen
R. Qu
Jing Dong
Ruibin Bai
Yaochu Jin
OffRL
65
0
0
10 Apr 2025
Anomaly Detection in Time Series Data Using Reinforcement Learning, Variational Autoencoder, and Active Learning
Bahareh Golchin
Banafsheh Rekabdar
AI4TS
163
2
0
03 Apr 2025
A Theory of Machine Understanding via the Minimum Description Length Principle
Canlin Zhang
Xiuwen Liu
127
0
0
01 Apr 2025
Reinforcement Learning for Active Matter
Wenjie Cai
Gongyi Wang
Yu Zhang
X. Qu
Zihan Huang
AI4CE
74
1
0
30 Mar 2025
Iterative Prompt Relocation for Distribution-Adaptive Visual Prompt Tuning
Chikai Shang
Mengke Li
Yiqun Zhang
Zhen Chen
Jinlin Wu
Fangqing Gu
Yang Lu
Yiu-ming Cheung
VLM
111
0
0
10 Mar 2025
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Zhouyu He
Peng Qiao
Rongchun Li
Yong Dou
Yusong Tan
OffRL
166
0
0
27 Feb 2025
A Survey of Reinforcement Learning for Optimization in Automation
Ahmad Farooq
Kamran Iqbal
OffRL
166
2
0
13 Feb 2025
A transformer-based deep q learning approach for dynamic load balancing in software-defined networks
Evans Tetteh Owusu
Kwame Agyemang-Prempeh Agyekum
Marinah Benneh
Pius Ayorna
Justice Owusu Agyemang
George Nii Martey Colley
James Dzisi Gazde
73
0
0
28 Jan 2025
Imperative Learning: A Self-supervised Neuro-Symbolic Learning Framework for Robot Autonomy
Chen Wang
Kaiyi Ji
Junyi Geng
Zhongqiang Ren
Taimeng Fu
...
Yi Du
Qihang Li
Yue Yang
Xiao Lin
Zhipeng Zhao
SSL
160
10
0
28 Jan 2025
Multi-Modality Collaborative Learning for Sentiment Analysis
Shanmin Wang
Chengguang Liu
Qingshan Liu
73
0
0
21 Jan 2025
Application of Deep Reinforcement Learning to UAV Swarming for Ground Surveillance
Raúl Arranz
David Carramiñana
Gonzalo de Miguel
Juan A. Besada
Ana M. Bernardos
72
11
0
15 Jan 2025
Amortized Bayesian Experimental Design for Decision-Making
Daolang Huang
Yujia Guo
Luigi Acerbi
Samuel Kaski
119
3
0
03 Jan 2025
Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution
Ke Xue
Yutong Wang
Cong Guan
Lei Yuan
Haobo Fu
Qiang Fu
Chao Qian
Yang Yu
157
18
0
03 Jan 2025
Deep Reinforcement Learning for Job Scheduling and Resource Management in Cloud Computing: An Algorithm-Level Review
Yan Gu
Zhaoze Liu
Shuhong Dai
Cong Liu
Ying Wang
Shen Wang
Georgios Theodoropoulos
Long Cheng
79
2
0
03 Jan 2025
Supervised Learning-enhanced Multi-Group Actor Critic for Live Stream Allocation in Feed
Jingxin Liu
Xiang Gao
Yisha Li
Xin Li
Haiyang Lu
Ben Wang
OffRL
100
0
0
28 Nov 2024
From Laws to Motivation: Guiding Exploration through Law-Based Reasoning and Rewards
Ziyu Chen
Zhiqing Xiao
Xinbei Jiang
Junbo Zhao
105
0
0
24 Nov 2024
TrojanRobot: Physical-World Backdoor Attacks Against VLM-based Robotic Manipulation
Xiaobei Wang
Hewen Pan
Hangtao Zhang
Minghui Li
Shengshan Hu
...
Peijin Guo
Yichen Wang
Wei Wan
Aishan Liu
L. Zhang
AAML
178
2
0
18 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
96
6
0
08 Nov 2024
Opportunities of Reinforcement Learning in South Africa's Just Transition
Claude Formanek
C. Tilbury
Jonathan P. Shock
137
0
0
06 Nov 2024
Dynamic Weight Adjusting Deep Q-Networks for Real-Time Environmental Adaptation
Xinhao Zhang
Jinghan Zhang
Wujun Si
Kunpeng Liu
79
1
0
04 Nov 2024
Machine Learning Innovations in CPR: A Comprehensive Survey on Enhanced Resuscitation Techniques
Saidul Islam
Gaith Rjoub
Hanae Elmekki
Jamal Bentahar
Witold Pedrycz
R. Cohen
82
0
0
03 Nov 2024
α
α
α
-TCVAE: On the relationship between Disentanglement and Diversity
Cristian Meo
Louis Mahon
Anirudh Goyal
Justin Dauwels
DRL
151
8
0
01 Nov 2024
When to Trust Your Data: Enhancing Dyna-Style Model-Based Reinforcement Learning With Data Filter
Yansong Li
Zeyu Dong
Ertai Luo
Yu Wu
Shuo Wu
Shuo Han
34
2
0
16 Oct 2024
Learning Agents With Prioritization and Parameter Noise in Continuous State and Action Space
Rajesh Mangannavar
Gopalakrishnan Srinivasaraghavan
31
2
0
15 Oct 2024
Whole-Body Dynamic Throwing with Legged Manipulators
Humphrey Munn
Brendan Tidd
Peter Böhm
M. Gallagher
David Howard
98
3
0
08 Oct 2024
Urban Computing for Climate and Environmental Justice: Early Perspectives From Two Research Initiatives
Carolina Veiga
Ashish Sharma
Daniel de Oliveira
Marcos Lage
Fabio Miranda
AI4CE
76
1
0
06 Oct 2024
Distribution Guided Active Feature Acquisition
Yang Li
Junier Oliva
71
0
0
04 Oct 2024
AI Policy Projector: Grounding LLM Policy Design in Iterative Mapmaking
Michelle S. Lam
Fred Hohman
Dominik Moritz
Jeffrey P. Bigham
Kenneth Holstein
Mary Beth Kery
73
1
0
26 Sep 2024
A Survey for Deep Reinforcement Learning Based Network Intrusion Detection
Wanrong Yang
Alberto Acuto
Yihang Zhou
Dominik Wojtczak
OffRL
110
3
0
25 Sep 2024
Fair Reinforcement Learning Algorithm for PV Active Control in LV Distribution Networks
Maurizio Vassallo
A. Benzerga
Alireza Bahmanyar
Damien Ernst
78
2
0
09 Sep 2024
An Introduction to Reinforcement Learning: Fundamental Concepts and Practical Applications
Majid Ghasemi
Amir Hossein Moosavi
Ibrahim Sorkhoh
Anjali Agrawal
Fadi Alzhouri
Dariush Ebrahimi
OffRL
107
0
0
13 Aug 2024
Faster Model Predictive Control via Self-Supervised Initialization Learning
Zhaoxin Li
Letian Chen
Rohan R. Paleja
S. Nageshrao
Matthew C. Gombolay
Matthew Gombolay
359
2
0
06 Aug 2024
1
2
3
4
5
6
7
8
9
Next