ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.09142
  4. Cited By
Learning Continuous Control Policies by Stochastic Value Gradients

Learning Continuous Control Policies by Stochastic Value Gradients

30 October 2015
N. Heess
Greg Wayne
David Silver
Timothy Lillicrap
Yuval Tassa
Tom Erez
ArXivPDFHTML

Papers citing "Learning Continuous Control Policies by Stochastic Value Gradients"

50 / 329 papers shown
Title
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a
  Survey
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey
Aske Plaat
W. Kosters
Mike Preuss
BDL
OffRL
21
17
0
11 Aug 2020
Model-based Reinforcement Learning: A Survey
Model-based Reinforcement Learning: A Survey
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
OffRL
36
47
0
30 Jun 2020
Critic Regularized Regression
Critic Regularized Regression
Ziyun Wang
Alexander Novikov
Konrad Zolna
Jost Tobias Springenberg
Scott E. Reed
...
Noah Y. Siegel
J. Merel
Çağlar Gülçehre
N. Heess
Nando de Freitas
OffRL
36
319
0
26 Jun 2020
A Unifying Framework for Reinforcement Learning and Planning
A Unifying Framework for Reinforcement Learning and Planning
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
OffRL
36
9
0
26 Jun 2020
Experience Replay with Likelihood-free Importance Weights
Experience Replay with Likelihood-free Importance Weights
Samarth Sinha
Jiaming Song
Animesh Garg
Stefano Ermon
OffRL
33
55
0
23 Jun 2020
Aligning Time Series on Incomparable Spaces
Aligning Time Series on Incomparable Spaces
Samuel N. Cohen
Giulia Luise
Alexander Terenin
Brandon Amos
M. Deisenroth
OT
AI4TS
30
16
0
22 Jun 2020
dm_control: Software and Tasks for Continuous Control
dm_control: Software and Tasks for Continuous Control
Yuval Tassa
S. Tunyasuvunakool
Alistair Muldal
Yotam Doron
Piotr Trochim
...
Steven Bohez
J. Merel
Tom Erez
Timothy Lillicrap
N. Heess
LM&Ro
42
397
0
22 Jun 2020
Model Embedding Model-Based Reinforcement Learning
Model Embedding Model-Based Reinforcement Learning
Xiao Tan
Chao Qu
Junwu Xiong
James Y. Zhang
OffRL
18
0
0
16 Jun 2020
GO Hessian for Expectation-Based Objectives
GO Hessian for Expectation-Based Objectives
Yulai Cong
Miaoyun Zhao
Jianqiao Li
Junya Chen
Lawrence Carin
32
0
0
16 Jun 2020
Efficient Model-Based Reinforcement Learning through Optimistic Policy
  Search and Planning
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
Sebastian Curi
Felix Berkenkamp
Andreas Krause
33
82
0
15 Jun 2020
Combining Model-Based and Model-Free Methods for Nonlinear Control: A
  Provably Convergent Policy Gradient Approach
Combining Model-Based and Model-Free Methods for Nonlinear Control: A Provably Convergent Policy Gradient Approach
Guannan Qu
Chenkai Yu
S. Low
Adam Wierman
22
19
0
12 Jun 2020
Deployment-Efficient Reinforcement Learning via Model-Based Offline
  Optimization
Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
T. Matsushima
Hiroki Furuta
Y. Matsuo
Ofir Nachum
S. Gu
OffRL
25
147
0
05 Jun 2020
Kernel Taylor-Based Value Function Approximation for Continuous-State
  Markov Decision Processes
Kernel Taylor-Based Value Function Approximation for Continuous-State Markov Decision Processes
Junhong Xu
Kai-Li Yin
Lantao Liu
OffRL
6
3
0
03 Jun 2020
Model-Augmented Actor-Critic: Backpropagating through Paths
Model-Augmented Actor-Critic: Backpropagating through Paths
I. Clavera
Yao Fu
Pieter Abbeel
44
87
0
16 May 2020
Simple Sensor Intentions for Exploration
Simple Sensor Intentions for Exploration
Tim Hertweck
Martin Riedmiller
Michael Bloesch
Jost Tobias Springenberg
Noah Y. Siegel
Markus Wulfmeier
Roland Hafner
N. Heess
27
10
0
15 May 2020
A Distributional View on Multi-Objective Policy Optimization
A Distributional View on Multi-Objective Policy Optimization
A. Abdolmaleki
Sandy H. Huang
Leonard Hasenclever
Michael Neunert
H. F. Song
Martina Zambelli
M. Martins
N. Heess
R. Hadsell
Martin Riedmiller
26
74
0
15 May 2020
Continuous Multiagent Control using Collective Behavior Entropy for
  Large-Scale Home Energy Management
Continuous Multiagent Control using Collective Behavior Entropy for Large-Scale Home Energy Management
Jianwen Sun
Yan Zheng
Jianye Hao
Zhaopeng Meng
Yang Liu
21
14
0
14 May 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
343
1,968
0
04 May 2020
DSAC: Distributional Soft Actor Critic for Risk-Sensitive Reinforcement
  Learning
DSAC: Distributional Soft Actor Critic for Risk-Sensitive Reinforcement Learning
Xiaoteng Ma
Li Xia
Zhengyuan Zhou
Jun Yang
Qianchuan Zhao
37
17
0
30 Apr 2020
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator
  Policy Optimization
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization
P. DÓro
Wojciech Ja'skowski
OffRL
30
27
0
29 Apr 2020
Knowledge-guided Deep Reinforcement Learning for Interactive
  Recommendation
Knowledge-guided Deep Reinforcement Learning for Interactive Recommendation
Xiaocong Chen
Chaoran Huang
Lina Yao
Xianzhi Wang
Wei Liu
Wenjie Zhang
20
35
0
17 Apr 2020
Comprehensive Review of Deep Reinforcement Learning Methods and
  Applications in Economics
Comprehensive Review of Deep Reinforcement Learning Methods and Applications in Economics
Amir H. Mosavi
Pedram Ghamisi
Yaser Faghan
Puhong Duan
OffRL
27
152
0
21 Mar 2020
Learning to Fly via Deep Model-Based Reinforcement Learning
Learning to Fly via Deep Model-Based Reinforcement Learning
Philip Becker-Ehmck
Maximilian Karl
Jan Peters
Patrick van der Smagt
SSL
41
37
0
19 Mar 2020
Contextual Policy Transfer in Reinforcement Learning Domains via Deep
  Mixtures-of-Experts
Contextual Policy Transfer in Reinforcement Learning Domains via Deep Mixtures-of-Experts
Michael Gimelfarb
Scott Sanner
Chi-Guhn Lee
20
1
0
29 Feb 2020
Keep Doing What Worked: Behavioral Modelling Priors for Offline
  Reinforcement Learning
Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning
Noah Y. Siegel
Jost Tobias Springenberg
Felix Berkenkamp
A. Abdolmaleki
Michael Neunert
Thomas Lampe
Roland Hafner
Nicolas Heess
Martin Riedmiller
OffRL
22
282
0
19 Feb 2020
Causally Correct Partial Models for Reinforcement Learning
Causally Correct Partial Models for Reinforcement Learning
Danilo Jimenez Rezende
Ivo Danihelka
George Papamakarios
Nan Rosemary Ke
Ray Jiang
...
Jane X. Wang
Jovana Mitrović
F. Besse
Ioannis Antonoglou
Lars Buesing
AI4TS
26
32
0
07 Feb 2020
Dynamic Energy Dispatch Based on Deep Reinforcement Learning in
  IoT-Driven Smart Isolated Microgrids
Dynamic Energy Dispatch Based on Deep Reinforcement Learning in IoT-Driven Smart Isolated Microgrids
Lei Lei
Yue Tan
Glenn Dahlenburg
W. Xiang
K. Zheng
24
68
0
07 Feb 2020
Continuous-Discrete Reinforcement Learning for Hybrid Control in
  Robotics
Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics
Michael Neunert
A. Abdolmaleki
Markus Wulfmeier
Thomas Lampe
Jost Tobias Springenberg
Roland Hafner
Francesco Romano
J. Buchli
N. Heess
Martin Riedmiller
21
91
0
02 Jan 2020
The Gambler's Problem and Beyond
The Gambler's Problem and Beyond
Baoxiang Wang
Shuai Li
Jiajin Li
S. Chan
16
0
0
31 Dec 2019
Pontryagin Differentiable Programming: An End-to-End Learning and
  Control Framework
Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework
Wanxin Jin
Zhaoran Wang
Zhuoran Yang
Shaoshuai Mou
30
77
0
30 Dec 2019
Quasi-Newton Trust Region Policy Optimization
Quasi-Newton Trust Region Policy Optimization
Devesh K. Jha
A. Raghunathan
Diego Romeres
25
8
0
26 Dec 2019
Direct and indirect reinforcement learning
Direct and indirect reinforcement learning
Yang Guan
Shengbo Eben Li
Jingliang Duan
Jie Li
Yangang Ren
Qi Sun
B. Cheng
OffRL
38
34
0
23 Dec 2019
Centralized Cooperation for Connected and Automated Vehicles at
  Intersections by Proximal Policy Optimization
Centralized Cooperation for Connected and Automated Vehicles at Intersections by Proximal Policy Optimization
Yang Guan
Yangang Ren
Shengbo Eben Li
Qi Sun
Laiquan Luo
Keqiang Li
6
6
0
18 Dec 2019
Hindsight Credit Assignment
Hindsight Credit Assignment
Anna Harutyunyan
Will Dabney
Thomas Mesnard
M. G. Azar
Bilal Piot
...
H. V. Hasselt
Greg Wayne
Satinder Singh
Doina Precup
Rémi Munos
27
72
0
05 Dec 2019
Visual Reaction: Learning to Play Catch with Your Drone
Visual Reaction: Learning to Play Catch with Your Drone
Kuo-Hao Zeng
Roozbeh Mottaghi
Luca Weihs
Ali Farhadi
29
14
0
04 Dec 2019
Dream to Control: Learning Behaviors by Latent Imagination
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
63
1,313
0
03 Dec 2019
Adaptive dynamic programming for nonaffine nonlinear optimal control
  problem with state constraints
Adaptive dynamic programming for nonaffine nonlinear optimal control problem with state constraints
Jingliang Duan
Zhengyu Liu
Shengbo Eben Li
Qi Sun
Zhenzhong Jia
B. Cheng
23
64
0
26 Nov 2019
From Persistent Homology to Reinforcement Learning with Applications for
  Retail Banking
From Persistent Homology to Reinforcement Learning with Applications for Retail Banking
Jérémy Charlier
16
1
0
23 Nov 2019
Evaluating task-agnostic exploration for fixed-batch learning of
  arbitrary future tasks
Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks
Vibhavari Dasagi
Robert Lee
Jake Bruce
Jurgen Leitner
OffRL
31
2
0
20 Nov 2019
Planning with Goal-Conditioned Policies
Planning with Goal-Conditioned Policies
Soroush Nasiriany
Vitchyr H. Pong
Steven Lin
Sergey Levine
OffRL
81
216
0
19 Nov 2019
Improved Exploration through Latent Trajectory Optimization in Deep
  Deterministic Policy Gradient
Improved Exploration through Latent Trajectory Optimization in Deep Deterministic Policy Gradient
K. Luck
Mel Vecerík
Simon Stepputtis
H. B. Amor
Jonathan Scholz
22
9
0
15 Nov 2019
Real-Time Reinforcement Learning
Real-Time Reinforcement Learning
Simon Ramstedt
C. Pal
AI4CE
19
62
0
11 Nov 2019
Better Exploration with Optimistic Actor-Critic
Better Exploration with Optimistic Actor-Critic
K. Ciosek
Q. Vuong
R. Loftin
Katja Hofmann
29
149
0
28 Oct 2019
Asynchronous Methods for Model-Based Reinforcement Learning
Asynchronous Methods for Model-Based Reinforcement Learning
Yunzhi Zhang
I. Clavera
Bo-Yu Tsai
Pieter Abbeel
OffRL
19
27
0
28 Oct 2019
OffWorld Gym: open-access physical robotics environment for real-world
  reinforcement learning benchmark and research
OffWorld Gym: open-access physical robotics environment for real-world reinforcement learning benchmark and research
Ashish Kumar
Toby Buckley
John B. Lanier
Qiaozhi Wang
A. Kavelaars
Ilya Kuzovkin
OffRL
19
14
0
18 Oct 2019
Imagined Value Gradients: Model-Based Policy Optimization with
  Transferable Latent Dynamics Models
Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models
Arunkumar Byravan
Jost Tobias Springenberg
A. Abdolmaleki
Roland Hafner
Michael Neunert
Thomas Lampe
Noah Y. Siegel
N. Heess
Martin Riedmiller
OffRL
17
41
0
09 Oct 2019
If MaxEnt RL is the Answer, What is the Question?
If MaxEnt RL is the Answer, What is the Question?
Benjamin Eysenbach
Sergey Levine
33
58
0
04 Oct 2019
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete
  and Continuous Control
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
H. F. Song
A. Abdolmaleki
Jost Tobias Springenberg
Aidan Clark
Hubert Soyer
...
Dhruva Tirumala
N. Heess
Dan Belov
Martin Riedmiller
M. Botvinick
37
121
0
26 Sep 2019
Constrained Attractor Selection Using Deep Reinforcement Learning
Constrained Attractor Selection Using Deep Reinforcement Learning
Xue-She Wang
J. Turner
B. Mann
13
35
0
23 Sep 2019
Gradient-Aware Model-based Policy Search
Gradient-Aware Model-based Policy Search
P. DÓro
Alberto Maria Metelli
Andrea Tirinzoni
Matteo Papini
Marcello Restelli
29
34
0
09 Sep 2019
Previous
1234567
Next