ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1701.07274
  4. Cited By
Deep Reinforcement Learning: An Overview
v1v2v3v4v5v6 (latest)

Deep Reinforcement Learning: An Overview

25 January 2017
Yuxi Li
    OffRLVLM
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning: An Overview"

50 / 417 papers shown
Title
Learning Dexterous Object Handover
Learning Dexterous Object Handover
Daniel Frau-Alfaro
Julio Castaño-Amorós
S. T. Puente
Pablo Gil
Roberto Calandra
12
0
0
20 Jun 2025
Energy-Based Transfer for Reinforcement Learning
Energy-Based Transfer for Reinforcement Learning
Zeyun Deng
Jasorsi Ghosh
Fiona Xie
Yuzhe Lu
Katia Sycara
Joseph Campbell
10
0
0
19 Jun 2025
Federated Neuroevolution O-RAN: Enhancing the Robustness of Deep Reinforcement Learning xApps
Federated Neuroevolution O-RAN: Enhancing the Robustness of Deep Reinforcement Learning xApps
M. Kouchaki
Aly Sabri Abdalla
Vuk Marojevic
10
0
0
15 Jun 2025
Evolutionary Developmental Biology Can Serve as the Conceptual Foundation for a New Design Paradigm in Artificial Intelligence
Evolutionary Developmental Biology Can Serve as the Conceptual Foundation for a New Design Paradigm in Artificial Intelligence
Zeki Doruk Erden
Boi Faltings
14
0
0
15 Jun 2025
MOBODY: Model Based Off-Dynamics Offline Reinforcement Learning
Yihong Guo
Yu Yang
Pan Xu
Anqi Liu
OffRL
31
0
0
10 Jun 2025
GPS Spoofing Attacks on AI-based Navigation Systems with Obstacle Avoidance in UAV
Ji Hyuk Jung
Mi Yeon Hong
Ji Won Yoon
AAML
27
0
0
10 Jun 2025
Safe and Economical UAV Trajectory Planning in Low-Altitude Airspace: A Hybrid DRL-LLM Approach with Compliance Awareness
Yanwei Gong
Xiaolin Chang
13
0
0
10 Jun 2025
Evaluation of LLMs for mathematical problem solving
Evaluation of LLMs for mathematical problem solving
Ruonan Wang
Runxi Wang
Yunwen Shen
Chengfeng Wu
Qinglin Zhou
Rohitash Chandra
ELMLRM
30
0
0
30 May 2025
AXIOM: Learning to Play Games in Minutes with Expanding Object-Centric Models
AXIOM: Learning to Play Games in Minutes with Expanding Object-Centric Models
Conor Heins
Toon Van de Maele
Alexander Tschantz
Hampus Linander
Dimitrije Marković
...
Magnus T. Koudahl
Marco Perin
Karl J. Friston
Tim Verbelen
Christopher L. Buckley
OCL
42
0
0
30 May 2025
When Models Reason in Your Language: Controlling Thinking Trace Language Comes at the Cost of Accuracy
When Models Reason in Your Language: Controlling Thinking Trace Language Comes at the Cost of Accuracy
Jirui Qi
Shan Chen
Zidi Xiong
Raquel Fernández
Danielle S. Bitterman
Arianna Bisazza
LRM
92
0
0
28 May 2025
PPO-BR: Dual-Signal Entropy-Reward Adaptation for Trust Region Policy Optimization
PPO-BR: Dual-Signal Entropy-Reward Adaptation for Trust Region Policy Optimization
Ben Rahman
73
0
0
23 May 2025
On the Parallels Between Evolutionary Theory and the State of AI
On the Parallels Between Evolutionary Theory and the State of AI
Zeki Doruk Erden
Boi Faltings
23
0
0
13 May 2025
Continual Reinforcement Learning via Autoencoder-Driven Task and New Environment Recognition
Continual Reinforcement Learning via Autoencoder-Driven Task and New Environment Recognition
Zeki Doruk Erden
Donia Gasmi
Boi Faltings
CLL
74
1
0
13 May 2025
Adaptive Bias Generalized Rollout Policy Adaptation on the Flexible Job-Shop Scheduling Problem
Adaptive Bias Generalized Rollout Policy Adaptation on the Flexible Job-Shop Scheduling Problem
Lotfi Kobrosly
Marc-Emmanuel Coupvent des Graviers
Christophe Guettier
Tristan Cazenave
107
0
0
13 May 2025
DeeP-Mod: Deep Dynamic Programming based Environment Modelling using Feature Extraction
DeeP-Mod: Deep Dynamic Programming based Environment Modelling using Feature Extraction
Chris Child
Lam Ngo
56
0
0
29 Apr 2025
ARMOR: Adaptive Meshing with Reinforcement Optimization for Real-time 3D Monitoring in Unexposed Scenes
ARMOR: Adaptive Meshing with Reinforcement Optimization for Real-time 3D Monitoring in Unexposed Scenes
Yizhe Zhang
Jianping Li
Xin Zhao
Fuxun Liang
Z. Dong
Bisheng Yang
AI4CE
119
0
0
28 Apr 2025
Dual-Individual Genetic Algorithm: A Dual-Individual Approach for Efficient Training of Multi-Layer Neural Networks
Dual-Individual Genetic Algorithm: A Dual-Individual Approach for Efficient Training of Multi-Layer Neural Networks
Tran Thuy Nga Truong
Jooyong Kim
99
0
0
24 Apr 2025
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Shihan Dou
Muling Wu
Jingwen Xu
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
OffRLLRM
75
2
0
19 Apr 2025
Genetic Programming with Reinforcement Learning Trained Transformer for Real-World Dynamic Scheduling Problems
Genetic Programming with Reinforcement Learning Trained Transformer for Real-World Dynamic Scheduling Problems
Xian Chen
R. Qu
Jing Dong
Ruibin Bai
Yaochu Jin
OffRL
65
0
0
10 Apr 2025
Anomaly Detection in Time Series Data Using Reinforcement Learning, Variational Autoencoder, and Active Learning
Anomaly Detection in Time Series Data Using Reinforcement Learning, Variational Autoencoder, and Active Learning
Bahareh Golchin
Banafsheh Rekabdar
AI4TS
163
2
0
03 Apr 2025
A Theory of Machine Understanding via the Minimum Description Length Principle
A Theory of Machine Understanding via the Minimum Description Length Principle
Canlin Zhang
Xiuwen Liu
127
0
0
01 Apr 2025
Reinforcement Learning for Active Matter
Reinforcement Learning for Active Matter
Wenjie Cai
Gongyi Wang
Yu Zhang
X. Qu
Zihan Huang
AI4CE
74
1
0
30 Mar 2025
Iterative Prompt Relocation for Distribution-Adaptive Visual Prompt Tuning
Chikai Shang
Mengke Li
Yiqun Zhang
Zhen Chen
Jinlin Wu
Fangqing Gu
Yang Lu
Yiu-ming Cheung
VLM
111
0
0
10 Mar 2025
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Zhouyu He
Peng Qiao
Rongchun Li
Yong Dou
Yusong Tan
OffRL
166
0
0
27 Feb 2025
A Survey of Reinforcement Learning for Optimization in Automation
A Survey of Reinforcement Learning for Optimization in Automation
Ahmad Farooq
Kamran Iqbal
OffRL
166
2
0
13 Feb 2025
A transformer-based deep q learning approach for dynamic load balancing in software-defined networks
Evans Tetteh Owusu
Kwame Agyemang-Prempeh Agyekum
Marinah Benneh
Pius Ayorna
Justice Owusu Agyemang
George Nii Martey Colley
James Dzisi Gazde
73
0
0
28 Jan 2025
Imperative Learning: A Self-supervised Neuro-Symbolic Learning Framework for Robot Autonomy
Imperative Learning: A Self-supervised Neuro-Symbolic Learning Framework for Robot Autonomy
Chen Wang
Kaiyi Ji
Junyi Geng
Zhongqiang Ren
Taimeng Fu
...
Yi Du
Qihang Li
Yue Yang
Xiao Lin
Zhipeng Zhao
SSL
160
10
0
28 Jan 2025
Multi-Modality Collaborative Learning for Sentiment Analysis
Multi-Modality Collaborative Learning for Sentiment Analysis
Shanmin Wang
Chengguang Liu
Qingshan Liu
73
0
0
21 Jan 2025
Application of Deep Reinforcement Learning to UAV Swarming for Ground Surveillance
Application of Deep Reinforcement Learning to UAV Swarming for Ground Surveillance
Raúl Arranz
David Carramiñana
Gonzalo de Miguel
Juan A. Besada
Ana M. Bernardos
72
11
0
15 Jan 2025
Amortized Bayesian Experimental Design for Decision-Making
Amortized Bayesian Experimental Design for Decision-Making
Daolang Huang
Yujia Guo
Luigi Acerbi
Samuel Kaski
119
3
0
03 Jan 2025
Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution
Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution
Ke Xue
Yutong Wang
Cong Guan
Lei Yuan
Haobo Fu
Qiang Fu
Chao Qian
Yang Yu
157
18
0
03 Jan 2025
Deep Reinforcement Learning for Job Scheduling and Resource Management in Cloud Computing: An Algorithm-Level Review
Yan Gu
Zhaoze Liu
Shuhong Dai
Cong Liu
Ying Wang
Shen Wang
Georgios Theodoropoulos
Long Cheng
79
2
0
03 Jan 2025
Supervised Learning-enhanced Multi-Group Actor Critic for Live Stream Allocation in Feed
Supervised Learning-enhanced Multi-Group Actor Critic for Live Stream Allocation in Feed
Jingxin Liu
Xiang Gao
Yisha Li
Xin Li
Haiyang Lu
Ben Wang
OffRL
100
0
0
28 Nov 2024
From Laws to Motivation: Guiding Exploration through Law-Based Reasoning
  and Rewards
From Laws to Motivation: Guiding Exploration through Law-Based Reasoning and Rewards
Ziyu Chen
Zhiqing Xiao
Xinbei Jiang
Junbo Zhao
105
0
0
24 Nov 2024
TrojanRobot: Physical-World Backdoor Attacks Against VLM-based Robotic Manipulation
Xiaobei Wang
Hewen Pan
Hangtao Zhang
Minghui Li
Shengshan Hu
...
Peijin Guo
Yichen Wang
Wei Wan
Aishan Liu
L. Zhang
AAML
178
2
0
18 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and
  Distributed Computing: A Survey
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
96
6
0
08 Nov 2024
Opportunities of Reinforcement Learning in South Africa's Just
  Transition
Opportunities of Reinforcement Learning in South Africa's Just Transition
Claude Formanek
C. Tilbury
Jonathan P. Shock
137
0
0
06 Nov 2024
Dynamic Weight Adjusting Deep Q-Networks for Real-Time Environmental
  Adaptation
Dynamic Weight Adjusting Deep Q-Networks for Real-Time Environmental Adaptation
Xinhao Zhang
Jinghan Zhang
Wujun Si
Kunpeng Liu
79
1
0
04 Nov 2024
Machine Learning Innovations in CPR: A Comprehensive Survey on Enhanced
  Resuscitation Techniques
Machine Learning Innovations in CPR: A Comprehensive Survey on Enhanced Resuscitation Techniques
Saidul Islam
Gaith Rjoub
Hanae Elmekki
Jamal Bentahar
Witold Pedrycz
R. Cohen
82
0
0
03 Nov 2024
$α$-TCVAE: On the relationship between Disentanglement and
  Diversity
ααα-TCVAE: On the relationship between Disentanglement and Diversity
Cristian Meo
Louis Mahon
Anirudh Goyal
Justin Dauwels
DRL
151
8
0
01 Nov 2024
When to Trust Your Data: Enhancing Dyna-Style Model-Based Reinforcement
  Learning With Data Filter
When to Trust Your Data: Enhancing Dyna-Style Model-Based Reinforcement Learning With Data Filter
Yansong Li
Zeyu Dong
Ertai Luo
Yu Wu
Shuo Wu
Shuo Han
34
2
0
16 Oct 2024
Learning Agents With Prioritization and Parameter Noise in Continuous
  State and Action Space
Learning Agents With Prioritization and Parameter Noise in Continuous State and Action Space
Rajesh Mangannavar
Gopalakrishnan Srinivasaraghavan
31
2
0
15 Oct 2024
Whole-Body Dynamic Throwing with Legged Manipulators
Whole-Body Dynamic Throwing with Legged Manipulators
Humphrey Munn
Brendan Tidd
Peter Böhm
M. Gallagher
David Howard
98
3
0
08 Oct 2024
Urban Computing for Climate and Environmental Justice: Early
  Perspectives From Two Research Initiatives
Urban Computing for Climate and Environmental Justice: Early Perspectives From Two Research Initiatives
Carolina Veiga
Ashish Sharma
Daniel de Oliveira
Marcos Lage
Fabio Miranda
AI4CE
76
1
0
06 Oct 2024
Distribution Guided Active Feature Acquisition
Distribution Guided Active Feature Acquisition
Yang Li
Junier Oliva
71
0
0
04 Oct 2024
AI Policy Projector: Grounding LLM Policy Design in Iterative Mapmaking
AI Policy Projector: Grounding LLM Policy Design in Iterative Mapmaking
Michelle S. Lam
Fred Hohman
Dominik Moritz
Jeffrey P. Bigham
Kenneth Holstein
Mary Beth Kery
73
1
0
26 Sep 2024
A Survey for Deep Reinforcement Learning Based Network Intrusion
  Detection
A Survey for Deep Reinforcement Learning Based Network Intrusion Detection
Wanrong Yang
Alberto Acuto
Yihang Zhou
Dominik Wojtczak
OffRL
110
3
0
25 Sep 2024
Fair Reinforcement Learning Algorithm for PV Active Control in LV
  Distribution Networks
Fair Reinforcement Learning Algorithm for PV Active Control in LV Distribution Networks
Maurizio Vassallo
A. Benzerga
Alireza Bahmanyar
Damien Ernst
78
2
0
09 Sep 2024
An Introduction to Reinforcement Learning: Fundamental Concepts and
  Practical Applications
An Introduction to Reinforcement Learning: Fundamental Concepts and Practical Applications
Majid Ghasemi
Amir Hossein Moosavi
Ibrahim Sorkhoh
Anjali Agrawal
Fadi Alzhouri
Dariush Ebrahimi
OffRL
107
0
0
13 Aug 2024
Faster Model Predictive Control via Self-Supervised Initialization Learning
Faster Model Predictive Control via Self-Supervised Initialization Learning
Zhaoxin Li
Letian Chen
Rohan R. Paleja
S. Nageshrao
Matthew C. Gombolay
Matthew Gombolay
359
2
0
06 Aug 2024
123456789
Next