ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.02971
  4. Cited By
Continuous control with deep reinforcement learning

Continuous control with deep reinforcement learning

9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
ArXivPDFHTML

Papers citing "Continuous control with deep reinforcement learning"

50 / 3,416 papers shown
Title
Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows
Xiangxin Zhou
Yi Xiao
Haowei Lin
Xinheng He
Jiaqi Guan
Yang Wang
Qiang Liu
F. I. S. Kevin Zhou
Liang Wang
Jianzhu Ma
AI4CE
68
1
0
06 Mar 2025
Reinforcement Learning-based Threat Assessment
Reinforcement Learning-based Threat Assessment
Wuzhou Sun
Siyi Li
Qingxiang Zou
Zixing Liao
AAML
93
0
0
04 Mar 2025
Closing the Intent-to-Behavior Gap via Fulfillment Priority Logic
Closing the Intent-to-Behavior Gap via Fulfillment Priority Logic
B. Mabsout
Abdelrahman AbdelGawad
R. Mancuso
51
1
0
04 Mar 2025
Is Bellman Equation Enough for Learning Control?
Haoxiang You
Lekan Molu
Ian Abraham
75
0
0
04 Mar 2025
Diffusion Stabilizer Policy for Automated Surgical Robot Manipulations
Chonlam Ho
Jianshu Hu
Haoran Wang
Qi Dou
Yutong Ban
MedIm
84
1
0
03 Mar 2025
Overcoming Non-stationary Dynamics with Evidential Proximal Policy Optimization
Abdullah Akgul
Gulcin Baykal
Manuel Haußmann
M. Kandemir
63
0
0
03 Mar 2025
Runtime Learning of Quadruped Robots in Wild Environments
Yihao Cai
Y. Mao
L. Sha
H. Cao
Marco Caccamo
58
0
0
02 Mar 2025
BodyGen: Advancing Towards Efficient Embodiment Co-Design
Haofei Lu
Zhe Wu
Junliang Xing
Jianshu Li
Ruoyu Li
Zhe Li
Yuanchun Shi
51
0
0
01 Mar 2025
Scalable Reinforcement Learning for Virtual Machine Scheduling
Junjie Sheng
JieHao Wu
Haochuan Cui
Yiqiu Hu
Wenli Zhou
Lei Zhu
Qian Peng
Wenhao Li
Xiangfeng Wang
OffRL
38
0
0
01 Mar 2025
Actor-Critic Cooperative Compensation to Model Predictive Control for Off-Road Autonomous Vehicles Under Unknown Dynamics
Prakhar Gupta
J. Smereka
Yunyi Jia
49
0
0
01 Mar 2025
Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control
Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control
T. Lee
Donghwan Lee
54
0
0
28 Feb 2025
Accelerating Model-Based Reinforcement Learning with State-Space World Models
Accelerating Model-Based Reinforcement Learning with State-Space World Models
Maria Krinner
Elie Aljalbout
Angel Romero
Davide Scaramuzza
OffRL
83
1
0
27 Feb 2025
On the Importance of Reward Design in Reinforcement Learning-based Dynamic Algorithm Configuration: A Case Study on OneMax with (1+($λ$,$λ$))-GA
On the Importance of Reward Design in Reinforcement Learning-based Dynamic Algorithm Configuration: A Case Study on OneMax with (1+(λλλ,λλλ))-GA
Tai Nguyen
Phong Le
André Biendenkapp
Carola Doerr
Nguyen Dang
47
0
0
27 Feb 2025
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Zhouyu He
Peng Qiao
Rongchun Li
Yong Dou
Yusong Tan
OffRL
93
0
0
27 Feb 2025
Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application
Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application
Thomas Hickling
Maxwell Hogan
Abdulla Tammam
Nabil Aouf
81
0
0
27 Feb 2025
FinTSB: A Comprehensive and Practical Benchmark for Financial Time Series Forecasting
FinTSB: A Comprehensive and Practical Benchmark for Financial Time Series Forecasting
Yifan Hu
Yuante Li
Peiyuan Liu
Yuxia Zhu
Naiqi Li
Tao Dai
Shu-Tao Xia
Dawei Cheng
Changjun Jiang
AI4TS
99
1
0
26 Feb 2025
Learning Policy Committees for Effective Personalization in MDPs with Diverse Tasks
Learning Policy Committees for Effective Personalization in MDPs with Diverse Tasks
Luise Ge
Michael Lanier
Anindya Sarkar
Bengisu Guresti
Yevgeniy Vorobeychik
Chongjie Zhang
60
0
0
26 Feb 2025
XSS Adversarial Attacks Based on Deep Reinforcement Learning: A Replication and Extension Study
XSS Adversarial Attacks Based on Deep Reinforcement Learning: A Replication and Extension Study
Samuele Pasini
Gianluca Maragliano
Jinhan Kim
Paolo Tonella
AAML
47
0
0
26 Feb 2025
Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement Learning
Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement Learning
Meng Feng
Viraj Parimi
B. Williams
82
1
0
25 Feb 2025
A Simulation Pipeline to Facilitate Real-World Robotic Reinforcement Learning Applications
A Simulation Pipeline to Facilitate Real-World Robotic Reinforcement Learning Applications
Jefferson Silveira
Joshua A. Marshall
Sidney N. Givigi Jr
86
0
0
24 Feb 2025
A Reinforcement Learning Approach to Non-prehensile Manipulation through Sliding
Hamidreza Raei
Elena De Momi
Arash Ajoudani
88
0
0
24 Feb 2025
Policy Learning with a Natural Language Action Space: A Causal Approach
Policy Learning with a Natural Language Action Space: A Causal Approach
Bohan Zhang
Yixin Wang
Paramveer S. Dhillon
CML
53
0
0
24 Feb 2025
Reinforcement Learning-based Approach for Vehicle-to-Building Charging with Heterogeneous Agents and Long Term Rewards
Reinforcement Learning-based Approach for Vehicle-to-Building Charging with Heterogeneous Agents and Long Term Rewards
Fangqi Liu
Rishav Sen
J. P. Talusan
Ava Pettet
Aaron Kandel
Yoshinori Suzue
Ayan Mukhopadhyay
A. Dubey
OffRL
61
0
0
24 Feb 2025
Exploring Sentiment Manipulation by LLM-Enabled Intelligent Trading Agents
Exploring Sentiment Manipulation by LLM-Enabled Intelligent Trading Agents
David Byrd
LLMAG
LM&Ro
AIFin
63
0
0
22 Feb 2025
Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition
Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition
Chuanguang Yang
Xinqiang Yu
Han Yang
Zhulin An
Chengqing Yu
Libo Huang
Yongjun Xu
63
1
0
22 Feb 2025
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee
Youngdo Lee
Takuma Seno
Donghu Kim
Peter Stone
Jaegul Choo
83
1
0
21 Feb 2025
SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning
SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning
Xuyang Li
Romit Maulik
63
0
0
21 Feb 2025
Estimating Control Barriers from Offline Data
Hongzhan Yu
Seth Farrell
Ryo Yoshimitsu
Zhizhen Qin
Henrik I. Christensen
Sicun Gao
OffRL
63
3
0
21 Feb 2025
PPO-MI: Efficient Black-Box Model Inversion via Proximal Policy Optimization
PPO-MI: Efficient Black-Box Model Inversion via Proximal Policy Optimization
Xinpeng Shou
86
0
0
21 Feb 2025
Reinforcement Learning-based Receding Horizon Control using Adaptive Control Barrier Functions for Safety-Critical Systems
Reinforcement Learning-based Receding Horizon Control using Adaptive Control Barrier Functions for Safety-Critical Systems
Ehsan Sabouni
Hijaz Ahmad
Vittorio Giammarino
Christos G. Cassandras
I. Paschalidis
Wenchao Li
124
2
0
21 Feb 2025
Enhancing PPO with Trajectory-Aware Hybrid Policies
Qisai Liu
Zhanhong Jiang
Hsin-Jung Yang
Mahsa Khosravi
Joshua R. Waite
Soumik Sarkar
62
0
0
21 Feb 2025
ArrayBot: Reinforcement Learning for Generalizable Distributed Manipulation through Touch
ArrayBot: Reinforcement Learning for Generalizable Distributed Manipulation through Touch
Zhengrong Xue
H. Zhang
Jin Cheng
Zhengmao He
Yuanchen Ju
Chan-Yu Lin
Gu Zhang
Huazhe Xu
OffRL
111
9
0
20 Feb 2025
RobotIQ: Empowering Mobile Robots with Human-Level Planning for Real-World Execution
RobotIQ: Empowering Mobile Robots with Human-Level Planning for Real-World Execution
Emmanuel K. Raptis
Athanasios Ch. Kapoutsis
Elias B. Kosmatopoulos
LM&Ro
96
0
0
18 Feb 2025
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Michael Psenka
Alejandro Escontrela
Pieter Abbeel
Yi-An Ma
DiffM
100
26
0
17 Feb 2025
Data Center Cooling System Optimization Using Offline Reinforcement Learning
Data Center Cooling System Optimization Using Offline Reinforcement Learning
Xianyuan Zhan
Xiangyu Zhu
Peng Cheng
Xiao Hu
Ziteng He
...
Chenhui Liu
Tianshun Hong
Huiwen Zheng
Yunxin Liu
Feng Zhao
AI4CE
86
0
0
17 Feb 2025
Deep Reinforcement Learning-Based Bidding Strategies for Prosumers Trading in Double Auction-Based Transactive Energy Market
Jun Jiang
Yuanliang Li
Luyang Hou
Mohsen Ghafouri
Peng Zhang
Jun Yan
Yuhong Liu
56
0
0
16 Feb 2025
Intelligent Legal Assistant: An Interactive Clarification System for Legal Question Answering
Intelligent Legal Assistant: An Interactive Clarification System for Legal Question Answering
Rujing Yao
Yiquan Wu
Tong Zhang
Xuhui Zhang
Yuting Huang
Yang Wu
Jiayin Yang
Changlong Sun
Fang Wang
Xiaozhong Liu
AILaw
ELM
38
0
0
11 Feb 2025
Infinite-Horizon Value Function Approximation for Model Predictive Control
Armand Jordana
Sébastien Kleff
Arthur Haffemayer
Joaquim Ortiz de Haro
Justin Carpentier
Nicolas Mansard
Ludovic Righetti
48
0
0
10 Feb 2025
Deep Reinforcement Learning based Triggering Function for Early Classifiers of Time Series
Aurélien Renault
A. Bondu
Antoine Cornuéjols
Vincent Lemaire
60
0
0
10 Feb 2025
Task Offloading in Vehicular Edge Computing using Deep Reinforcement Learning: A Survey
Task Offloading in Vehicular Edge Computing using Deep Reinforcement Learning: A Survey
Ashab Uddin
Ahmed Hamdi Sakr
Ning Zhang
OffRL
67
0
0
10 Feb 2025
Leveraging Constraint Violation Signals For Action-Constrained Reinforcement Learning
Leveraging Constraint Violation Signals For Action-Constrained Reinforcement Learning
J. Brahmanage
Jiajing Ling
Akshat Kumar
67
0
0
08 Feb 2025
Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification
Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification
Rudolf Reiter
Jasper Hoffmann
D. Reinhardt
Florian Messerer
Katrin Baumgärtner
Shamburaj Sawant
Joschka Boedecker
Moritz Diehl
S. Gros
97
5
0
04 Feb 2025
Policy-Guided Causal State Representation for Offline Reinforcement Learning Recommendation
Policy-Guided Causal State Representation for Offline Reinforcement Learning Recommendation
Siyu Wang
Xiaocong Chen
Lina Yao
CML
OffRL
97
0
0
04 Feb 2025
Circular Microalgae-Based Carbon Control for Net Zero
Circular Microalgae-Based Carbon Control for Net Zero
Federico Zocco
Joan García
W. Haddad
134
0
0
04 Feb 2025
VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play
VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play
Zelai Xu
Chao Yu
Chao Yu
Huining Yuan
Xiangmin Yi
...
Wenhao Tang
Yu Wang
Wenbo Ding
Xiusi Chen
Yu Wang
167
0
0
04 Feb 2025
Search-Based Adversarial Estimates for Improving Sample Efficiency in Off-Policy Reinforcement Learning
Search-Based Adversarial Estimates for Improving Sample Efficiency in Off-Policy Reinforcement Learning
Federico Malato
Ville Hautamaki
54
0
0
03 Feb 2025
HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment
HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment
Lifan Jiang
Boxi Wu
Jiahui Zhang
Xiaotong Guan
Shuang Chen
VGen
73
1
0
02 Feb 2025
Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network
Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network
Jijia Liu
Feng Gao
Q. Liao
Chao Yu
Yu Wang
OffRL
81
0
0
01 Feb 2025
On-Line Learning for Planning and Control of Underactuated Robots with Uncertain Dynamics
On-Line Learning for Planning and Control of Underactuated Robots with Uncertain Dynamics
Giulio Turrisi
Marco Capotondi
C. Gaz
Valerio Modugno
Giuseppe Oriolo
Alessandro De Luca
73
8
0
30 Jan 2025
Reinforcement-Learning Portfolio Allocation with Dynamic Embedding of Market Information
Reinforcement-Learning Portfolio Allocation with Dynamic Embedding of Market Information
Jinghai He
Cheng Hua
Chunyang Zhou
Zeyu Zheng
AIFin
53
2
0
29 Jan 2025
Previous
123456...676869
Next