Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.02971
Cited By
Continuous control with deep reinforcement learning
9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Continuous control with deep reinforcement learning"
50 / 3,416 papers shown
Title
CaRoSaC: A Reinforcement Learning-Based Kinematic Control of Cable-Driven Parallel Robots by Addressing Cable Sag through Simulation
Rohit Dhakate
Thomas Jantos
Eren Allak
Stephan Weiss
J. Steinbrener
51
0
0
22 Apr 2025
Autonomous Control of Redundant Hydraulic Manipulator Using Reinforcement Learning with Action Feedback
Rohit Dhakate
Christian Brommer
C. Böhm
Stephan Weiss
J. Steinbrener
38
5
0
22 Apr 2025
Accelerating Visual Reinforcement Learning with Separate Primitive Policy for Peg-in-Hole Tasks
Zichun Xu
Zhaomin Wang
Yuntao Li
Lei Zhuang
Zhiyuan Zhao
Guocai Yang
Jingdong Zhao
33
0
0
21 Apr 2025
Learning to Reason under Off-Policy Guidance
Jianhao Yan
Yafu Li
Zican Hu
Zhi Wang
Ganqu Cui
Xiaoye Qu
Yu Cheng
Yue Zhang
OffRL
LRM
44
7
0
21 Apr 2025
Is Intelligence the Right Direction in New OS Scheduling for Multiple Resources in Cloud Environments?
Xinglei Dou
Lei Liu
Limin Xiao
VLM
45
0
0
21 Apr 2025
Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey
Ahsan Bilal
Muhammad Ahmed Mohsin
Muhammad Umer
Muhammad Awais Khan Bangash
Muhammad Ali Jamshed
LLMAG
LRM
AI4CE
86
1
0
20 Apr 2025
HF4Rec: Human-Like Feedback-Driven Optimization Framework for Explainable Recommendation
Jiakai Tang
Jingsen Zhang
Zihang Tian
Xueyang Feng
Lei Wang
Xu Chen
OffRL
284
0
0
19 Apr 2025
High-Fidelity Image Inpainting with Multimodal Guided GAN Inversion
Libo Zhang
Yongsheng Yu
Jiali Yao
Heng Fan
52
0
0
17 Apr 2025
Modelling Mean-Field Games with Neural Ordinary Differential Equations
Anna C. M. Thöni
Yoram Bachrach
Tal Kachman
40
0
0
17 Apr 2025
QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?
Zhouyang Jiang
Bin Zhang
Airong Wei
Zhiwei Xu
OffRL
67
0
0
17 Apr 2025
pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild
Jonas Myhre Schiøtt
Viktor Sebastian Petersen
Dimitrios P. Papadopoulos
VLM
59
0
0
16 Apr 2025
Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss
Ukjo Hwang
Songnam Hong
OffRL
48
0
0
14 Apr 2025
GRAIN: Multi-Granular and Implicit Information Aggregation Graph Neural Network for Heterophilous Graphs
Songwei Zhao
Yuan Jiang
Zijing Zhang
Yang Yu
Hechang Chen
39
0
0
09 Apr 2025
Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments
Licheng Luo
Mingyu Cai
47
0
0
09 Apr 2025
xMTF: A Formula-Free Model for Reinforcement-Learning-Based Multi-Task Fusion in Recommender Systems
Yang Cao
Changhao Zhang
Xiaoshuang Chen
Kaiqiao Zhan
Ben Wang
41
1
0
08 Apr 2025
Economic Battery Storage Dispatch with Deep Reinforcement Learning from Rule-Based Demonstrations
Manuel Sage
Martin Staniszewski
Yaoyao Fiona Zhao
41
2
0
06 Apr 2025
Optimistic Learning for Communication Networks
George Iosifidis
N. Mhaisen
D. Leith
OffRL
67
0
0
04 Apr 2025
FADConv: A Frequency-Aware Dynamic Convolution for Farmland Non-agriculturalization Identification and Segmentation
Tan Shu
Li Shen
53
0
0
04 Apr 2025
An Extended Symbolic-Arithmetic Model for Teaching Double-Black Removal with Rotation in Red-Black Trees
Kennedy E. Ehimwenma
Hongyu Zhou
Jinqiao Wang
Ze Zheng
35
0
0
04 Apr 2025
MAD: A Magnitude And Direction Policy Parametrization for Stability Constrained Reinforcement Learning
Luca Furieri
Sucheth Shenoy
Danilo Saccani
Andrea Martin
Giancarlo Ferrari-Trecate
36
0
0
03 Apr 2025
Probabilistic Curriculum Learning for Goal-Based Reinforcement Learning
Llewyn Salt
Marcus Gallagher
41
1
0
02 Apr 2025
FastFlow: Early Yet Robust Network Flow Classification using the Minimal Number of Time-Series Packets
Rushi Jayeshkumar Babaria
Minzhao Lyu
Gustavo E. A. P. A. Batista
V. Sivaraman
AI4TS
48
0
0
02 Apr 2025
Inverse RL Scene Dynamics Learning for Nonlinear Predictive Control in Autonomous Vehicles
Sorin Grigorescu
Mihai V. Zaha
AI4CE
64
0
0
02 Apr 2025
Value Iteration for Learning Concurrently Executable Robotic Control Tasks
Sheikh A. Tahmid
Gennaro Notomista
OffRL
56
0
0
01 Apr 2025
Personality-Driven Decision-Making in LLM-Based Autonomous Agents
Lewis Newsham
Daniel Prince
LLMAG
AI4CE
63
1
0
01 Apr 2025
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective
Zhuoren Li
Guizhe Jin
Ran Yu
Zhiwen Chen
Nan I. Li
...
Lu Xiong
Bo Leng
Jia Hu
Ilya Kolmanovsky
Dimitar Filev
62
0
0
31 Mar 2025
On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations
Rajdeep Singh Hundal
Yan Xiao
Xiaochun Cao
Jin Song Dong
Manuel Rigger
76
0
0
28 Mar 2025
Bresa: Bio-inspired Reflexive Safe Reinforcement Learning for Contact-Rich Robotic Tasks
Heng Zhang
Gokhan Solak
Arash Ajoudani
46
0
0
27 Mar 2025
Inducing Personality in LLM-Based Honeypot Agents: Measuring the Effect on Human-Like Agenda Generation
Lewis Newsham
Ryan Hyland
Daniel Prince
70
1
0
25 Mar 2025
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Yongshuai Liu
Xin Liu
GAN
134
2
0
24 Mar 2025
Option Discovery Using LLM-guided Semantic Hierarchical Reinforcement Learning
Chak Lam Shek
Pratap Tokekar
56
0
0
24 Mar 2025
AED: Automatic Discovery of Effective and Diverse Vulnerabilities for Autonomous Driving Policy with Large Language Models
Le Qiu
Zelai Xu
Qixin Tan
Wenhao Tang
Chao Yu
Yu Wang
AAML
71
0
0
24 Mar 2025
Evolutionary Policy Optimization
Jianren Wang
Yifan Su
Abhinav Gupta
Deepak Pathak
55
0
0
24 Mar 2025
A Survey of Large Language Model Agents for Question Answering
Murong Yue
LLMAG
LM&MA
ELM
71
4
0
24 Mar 2025
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
Yexin Li
Pring Wong
Hanfang Zhang
Shuo Chen
Siyuan Qi
OffRL
72
0
0
23 Mar 2025
Active management of battery degradation in wireless sensor network using deep reinforcement learning for group battery replacement
Jong-Hyun Jeonga
Hongki Jo
Qiang Zhou
Tahsin Afroz Hoque Nishat
Lang Wu
45
1
0
20 Mar 2025
Energy-Efficient Federated Learning and Migration in Digital Twin Edge Networks
Yuzhi Zhou
Yaru Fu
Zheng Shi
Howard H. Yang
Kevin Hung
Yanzhe Zhang
62
0
0
20 Mar 2025
Distributed Learning over Arbitrary Topology: Linear Speed-Up with Polynomial Transient Time
Runze You
Shi Pu
46
0
0
20 Mar 2025
Design of Reward Function on Reinforcement Learning for Automated Driving
Takeru Goto
Yuki Kizumi
Shun Iwasaki
49
4
0
20 Mar 2025
Efficient Action-Constrained Reinforcement Learning via Acceptance-Rejection Method and Augmented MDPs
Wei-Ting Hung
Shao-Hua Sun
Ping-Chun Hsieh
55
0
0
17 Mar 2025
Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via Exploration
Amir Baghi
Jens Sjölund
Joakim Bergdahl
Linus Gisslén
Alessandro Sestini
74
0
0
17 Mar 2025
Dense Policy: Bidirectional Autoregressive Learning of Actions
Yue Su
Xinyu Zhan
Hongjie Fang
Han Xue
Hao-Shu Fang
Yongqian Li
Cewu Lu
Lixin Yang
VGen
62
3
0
17 Mar 2025
Eval-PPO: Building an Efficient Threat Evaluator Using Proximal Policy Optimization
Wuzhou Sun
Siyi Li
Qingxiang Zou
Zixing Liao
AAML
70
0
0
15 Mar 2025
Generative Modeling of Adversarial Lane-Change Scenario
Chuancheng Zhang
Zhenhao Wang
Jiangcheng Wang
Kun Su
Qiang Lv
Bin Jiang
Kunkun Hao
Wenyu Wang
46
0
0
15 Mar 2025
Training Directional Locomotion for Quadrupedal Low-Cost Robotic Systems via Deep Reinforcement Learning
Peter Böhm
Archie C. Chapman
Pauline Pounds
87
0
0
14 Mar 2025
Safe exploration in reproducing kernel Hilbert spaces
Abdullah Tokmak
Kiran G. Krishnan
Thomas B. Schon
Dominik Baumann
49
0
0
13 Mar 2025
Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning
Zhenghai Xue
Lang Feng
Jiacheng Xu
Kang Kang
Xiang Wen
Jingyi Wang
Shuicheng Yan
OffRL
58
0
0
10 Mar 2025
Precise Insulin Delivery for Artificial Pancreas: A Reinforcement Learning Optimized Adaptive Fuzzy Control Approach
Omar Mameche
Abdelhadi Abedou
Taqwa Mezaache
Mohamed Tadjine
71
0
0
09 Mar 2025
Vairiational Stochastic Games
Zhiyu Zhao
Haifeng Zhang
62
0
0
08 Mar 2025
THE-SEAN: A Heart Rate Variation-Inspired Temporally High-Order Event-Based Visual Odometry with Self-Supervised Spiking Event Accumulation Networks
Chaoran Xiong
Litao Wei
Kehui Ma
Zhen Sun
Yan Xiang
Zihan Nan
Trieu-Kien Truong
Ling Pei
50
5
0
07 Mar 2025
Previous
1
2
3
4
5
...
67
68
69
Next