ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.02971
  4. Cited By
Continuous control with deep reinforcement learning

Continuous control with deep reinforcement learning

9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
ArXivPDFHTML

Papers citing "Continuous control with deep reinforcement learning"

50 / 3,416 papers shown
Title
CaRoSaC: A Reinforcement Learning-Based Kinematic Control of Cable-Driven Parallel Robots by Addressing Cable Sag through Simulation
CaRoSaC: A Reinforcement Learning-Based Kinematic Control of Cable-Driven Parallel Robots by Addressing Cable Sag through Simulation
Rohit Dhakate
Thomas Jantos
Eren Allak
Stephan Weiss
J. Steinbrener
51
0
0
22 Apr 2025
Autonomous Control of Redundant Hydraulic Manipulator Using Reinforcement Learning with Action Feedback
Autonomous Control of Redundant Hydraulic Manipulator Using Reinforcement Learning with Action Feedback
Rohit Dhakate
Christian Brommer
C. Böhm
Stephan Weiss
J. Steinbrener
38
5
0
22 Apr 2025
Accelerating Visual Reinforcement Learning with Separate Primitive Policy for Peg-in-Hole Tasks
Accelerating Visual Reinforcement Learning with Separate Primitive Policy for Peg-in-Hole Tasks
Zichun Xu
Zhaomin Wang
Yuntao Li
Lei Zhuang
Zhiyuan Zhao
Guocai Yang
Jingdong Zhao
33
0
0
21 Apr 2025
Learning to Reason under Off-Policy Guidance
Learning to Reason under Off-Policy Guidance
Jianhao Yan
Yafu Li
Zican Hu
Zhi Wang
Ganqu Cui
Xiaoye Qu
Yu Cheng
Yue Zhang
OffRL
LRM
44
7
0
21 Apr 2025
Is Intelligence the Right Direction in New OS Scheduling for Multiple Resources in Cloud Environments?
Is Intelligence the Right Direction in New OS Scheduling for Multiple Resources in Cloud Environments?
Xinglei Dou
Lei Liu
Limin Xiao
VLM
45
0
0
21 Apr 2025
Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey
Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey
Ahsan Bilal
Muhammad Ahmed Mohsin
Muhammad Umer
Muhammad Awais Khan Bangash
Muhammad Ali Jamshed
LLMAG
LRM
AI4CE
86
1
0
20 Apr 2025
HF4Rec: Human-Like Feedback-Driven Optimization Framework for Explainable Recommendation
HF4Rec: Human-Like Feedback-Driven Optimization Framework for Explainable Recommendation
Jiakai Tang
Jingsen Zhang
Zihang Tian
Xueyang Feng
Lei Wang
Xu Chen
OffRL
284
0
0
19 Apr 2025
High-Fidelity Image Inpainting with Multimodal Guided GAN Inversion
High-Fidelity Image Inpainting with Multimodal Guided GAN Inversion
Libo Zhang
Yongsheng Yu
Jiali Yao
Heng Fan
52
0
0
17 Apr 2025
Modelling Mean-Field Games with Neural Ordinary Differential Equations
Modelling Mean-Field Games with Neural Ordinary Differential Equations
Anna C. M. Thöni
Yoram Bachrach
Tal Kachman
40
0
0
17 Apr 2025
QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?
QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?
Zhouyang Jiang
Bin Zhang
Airong Wei
Zhiwei Xu
OffRL
67
0
0
17 Apr 2025
pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild
pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild
Jonas Myhre Schiøtt
Viktor Sebastian Petersen
Dimitrios P. Papadopoulos
VLM
59
0
0
16 Apr 2025
Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss
Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss
Ukjo Hwang
Songnam Hong
OffRL
48
0
0
14 Apr 2025
GRAIN: Multi-Granular and Implicit Information Aggregation Graph Neural Network for Heterophilous Graphs
GRAIN: Multi-Granular and Implicit Information Aggregation Graph Neural Network for Heterophilous Graphs
Songwei Zhao
Yuan Jiang
Zijing Zhang
Yang Yu
Hechang Chen
39
0
0
09 Apr 2025
Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments
Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments
Licheng Luo
Mingyu Cai
47
0
0
09 Apr 2025
xMTF: A Formula-Free Model for Reinforcement-Learning-Based Multi-Task Fusion in Recommender Systems
xMTF: A Formula-Free Model for Reinforcement-Learning-Based Multi-Task Fusion in Recommender Systems
Yang Cao
Changhao Zhang
Xiaoshuang Chen
Kaiqiao Zhan
Ben Wang
41
1
0
08 Apr 2025
Economic Battery Storage Dispatch with Deep Reinforcement Learning from Rule-Based Demonstrations
Economic Battery Storage Dispatch with Deep Reinforcement Learning from Rule-Based Demonstrations
Manuel Sage
Martin Staniszewski
Yaoyao Fiona Zhao
41
2
0
06 Apr 2025
Optimistic Learning for Communication Networks
Optimistic Learning for Communication Networks
George Iosifidis
N. Mhaisen
D. Leith
OffRL
67
0
0
04 Apr 2025
FADConv: A Frequency-Aware Dynamic Convolution for Farmland Non-agriculturalization Identification and Segmentation
FADConv: A Frequency-Aware Dynamic Convolution for Farmland Non-agriculturalization Identification and Segmentation
Tan Shu
Li Shen
53
0
0
04 Apr 2025
An Extended Symbolic-Arithmetic Model for Teaching Double-Black Removal with Rotation in Red-Black Trees
An Extended Symbolic-Arithmetic Model for Teaching Double-Black Removal with Rotation in Red-Black Trees
Kennedy E. Ehimwenma
Hongyu Zhou
Jinqiao Wang
Ze Zheng
35
0
0
04 Apr 2025
MAD: A Magnitude And Direction Policy Parametrization for Stability Constrained Reinforcement Learning
MAD: A Magnitude And Direction Policy Parametrization for Stability Constrained Reinforcement Learning
Luca Furieri
Sucheth Shenoy
Danilo Saccani
Andrea Martin
Giancarlo Ferrari-Trecate
36
0
0
03 Apr 2025
Probabilistic Curriculum Learning for Goal-Based Reinforcement Learning
Probabilistic Curriculum Learning for Goal-Based Reinforcement Learning
Llewyn Salt
Marcus Gallagher
41
1
0
02 Apr 2025
FastFlow: Early Yet Robust Network Flow Classification using the Minimal Number of Time-Series Packets
FastFlow: Early Yet Robust Network Flow Classification using the Minimal Number of Time-Series Packets
Rushi Jayeshkumar Babaria
Minzhao Lyu
Gustavo E. A. P. A. Batista
V. Sivaraman
AI4TS
48
0
0
02 Apr 2025
Inverse RL Scene Dynamics Learning for Nonlinear Predictive Control in Autonomous Vehicles
Inverse RL Scene Dynamics Learning for Nonlinear Predictive Control in Autonomous Vehicles
Sorin Grigorescu
Mihai V. Zaha
AI4CE
64
0
0
02 Apr 2025
Value Iteration for Learning Concurrently Executable Robotic Control Tasks
Value Iteration for Learning Concurrently Executable Robotic Control Tasks
Sheikh A. Tahmid
Gennaro Notomista
OffRL
56
0
0
01 Apr 2025
Personality-Driven Decision-Making in LLM-Based Autonomous Agents
Personality-Driven Decision-Making in LLM-Based Autonomous Agents
Lewis Newsham
Daniel Prince
LLMAG
AI4CE
63
1
0
01 Apr 2025
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective
Zhuoren Li
Guizhe Jin
Ran Yu
Zhiwen Chen
Nan I. Li
...
Lu Xiong
Bo Leng
Jia Hu
Ilya Kolmanovsky
Dimitar Filev
62
0
0
31 Mar 2025
On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations
On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations
Rajdeep Singh Hundal
Yan Xiao
Xiaochun Cao
Jin Song Dong
Manuel Rigger
76
0
0
28 Mar 2025
Bresa: Bio-inspired Reflexive Safe Reinforcement Learning for Contact-Rich Robotic Tasks
Bresa: Bio-inspired Reflexive Safe Reinforcement Learning for Contact-Rich Robotic Tasks
Heng Zhang
Gokhan Solak
Arash Ajoudani
46
0
0
27 Mar 2025
Inducing Personality in LLM-Based Honeypot Agents: Measuring the Effect on Human-Like Agenda Generation
Inducing Personality in LLM-Based Honeypot Agents: Measuring the Effect on Human-Like Agenda Generation
Lewis Newsham
Ryan Hyland
Daniel Prince
70
1
0
25 Mar 2025
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Yongshuai Liu
Xin Liu
GAN
134
2
0
24 Mar 2025
Option Discovery Using LLM-guided Semantic Hierarchical Reinforcement Learning
Option Discovery Using LLM-guided Semantic Hierarchical Reinforcement Learning
Chak Lam Shek
Pratap Tokekar
56
0
0
24 Mar 2025
AED: Automatic Discovery of Effective and Diverse Vulnerabilities for Autonomous Driving Policy with Large Language Models
AED: Automatic Discovery of Effective and Diverse Vulnerabilities for Autonomous Driving Policy with Large Language Models
Le Qiu
Zelai Xu
Qixin Tan
Wenhao Tang
Chao Yu
Yu Wang
AAML
71
0
0
24 Mar 2025
Evolutionary Policy Optimization
Evolutionary Policy Optimization
Jianren Wang
Yifan Su
Abhinav Gupta
Deepak Pathak
55
0
0
24 Mar 2025
A Survey of Large Language Model Agents for Question Answering
A Survey of Large Language Model Agents for Question Answering
Murong Yue
LLMAG
LM&MA
ELM
71
4
0
24 Mar 2025
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
Yexin Li
Pring Wong
Hanfang Zhang
Shuo Chen
Siyuan Qi
OffRL
72
0
0
23 Mar 2025
Active management of battery degradation in wireless sensor network using deep reinforcement learning for group battery replacement
Active management of battery degradation in wireless sensor network using deep reinforcement learning for group battery replacement
Jong-Hyun Jeonga
Hongki Jo
Qiang Zhou
Tahsin Afroz Hoque Nishat
Lang Wu
45
1
0
20 Mar 2025
Energy-Efficient Federated Learning and Migration in Digital Twin Edge Networks
Energy-Efficient Federated Learning and Migration in Digital Twin Edge Networks
Yuzhi Zhou
Yaru Fu
Zheng Shi
Howard H. Yang
Kevin Hung
Yanzhe Zhang
62
0
0
20 Mar 2025
Distributed Learning over Arbitrary Topology: Linear Speed-Up with Polynomial Transient Time
Distributed Learning over Arbitrary Topology: Linear Speed-Up with Polynomial Transient Time
Runze You
Shi Pu
46
0
0
20 Mar 2025
Design of Reward Function on Reinforcement Learning for Automated Driving
Design of Reward Function on Reinforcement Learning for Automated Driving
Takeru Goto
Yuki Kizumi
Shun Iwasaki
49
4
0
20 Mar 2025
Efficient Action-Constrained Reinforcement Learning via Acceptance-Rejection Method and Augmented MDPs
Efficient Action-Constrained Reinforcement Learning via Acceptance-Rejection Method and Augmented MDPs
Wei-Ting Hung
Shao-Hua Sun
Ping-Chun Hsieh
55
0
0
17 Mar 2025
Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via Exploration
Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via Exploration
Amir Baghi
Jens Sjölund
Joakim Bergdahl
Linus Gisslén
Alessandro Sestini
74
0
0
17 Mar 2025
Dense Policy: Bidirectional Autoregressive Learning of Actions
Dense Policy: Bidirectional Autoregressive Learning of Actions
Yue Su
Xinyu Zhan
Hongjie Fang
Han Xue
Hao-Shu Fang
Yongqian Li
Cewu Lu
Lixin Yang
VGen
62
3
0
17 Mar 2025
Eval-PPO: Building an Efficient Threat Evaluator Using Proximal Policy Optimization
Eval-PPO: Building an Efficient Threat Evaluator Using Proximal Policy Optimization
Wuzhou Sun
Siyi Li
Qingxiang Zou
Zixing Liao
AAML
70
0
0
15 Mar 2025
Generative Modeling of Adversarial Lane-Change Scenario
Generative Modeling of Adversarial Lane-Change Scenario
Chuancheng Zhang
Zhenhao Wang
Jiangcheng Wang
Kun Su
Qiang Lv
Bin Jiang
Kunkun Hao
Wenyu Wang
46
0
0
15 Mar 2025
Training Directional Locomotion for Quadrupedal Low-Cost Robotic Systems via Deep Reinforcement Learning
Peter Böhm
Archie C. Chapman
Pauline Pounds
87
0
0
14 Mar 2025
Safe exploration in reproducing kernel Hilbert spaces
Abdullah Tokmak
Kiran G. Krishnan
Thomas B. Schon
Dominik Baumann
49
0
0
13 Mar 2025
Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning
Zhenghai Xue
Lang Feng
Jiacheng Xu
Kang Kang
Xiang Wen
Jingyi Wang
Shuicheng Yan
OffRL
58
0
0
10 Mar 2025
Precise Insulin Delivery for Artificial Pancreas: A Reinforcement Learning Optimized Adaptive Fuzzy Control Approach
Omar Mameche
Abdelhadi Abedou
Taqwa Mezaache
Mohamed Tadjine
71
0
0
09 Mar 2025
Vairiational Stochastic Games
Zhiyu Zhao
Haifeng Zhang
62
0
0
08 Mar 2025
THE-SEAN: A Heart Rate Variation-Inspired Temporally High-Order Event-Based Visual Odometry with Self-Supervised Spiking Event Accumulation Networks
Chaoran Xiong
Litao Wei
Kehui Ma
Zhen Sun
Yan Xiang
Zihan Nan
Trieu-Kien Truong
Ling Pei
50
5
0
07 Mar 2025
Previous
12345...676869
Next