ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.05477
  4. Cited By
Trust Region Policy Optimization

Trust Region Policy Optimization

19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
ArXivPDFHTML

Papers citing "Trust Region Policy Optimization"

50 / 3,098 papers shown
Title
A Deep Reinforcement Learning Approach for Dynamically Stable Inverse
  Kinematics of Humanoid Robots
A Deep Reinforcement Learning Approach for Dynamically Stable Inverse Kinematics of Humanoid Robots
S. Teja
Parijat Dewangan
P. Guhan
Abhishek Sarkar
K Madhava Krishna
25
55
0
31 Jan 2018
Constraint Estimation and Derivative-Free Recovery for Robot Learning
  from Demonstrations
Constraint Estimation and Derivative-Free Recovery for Robot Learning from Demonstrations
Jonathan Lee
Michael Laskey
Roy Fox
Ken Goldberg
13
4
0
31 Jan 2018
Understanding Human Behaviors in Crowds by Imitating the Decision-Making
  Process
Understanding Human Behaviors in Crowds by Imitating the Decision-Making Process
Haosheng Zou
Hang Su
Shihong Song
Jun Zhu
27
48
0
25 Jan 2018
Learning Symmetric and Low-energy Locomotion
Learning Symmetric and Low-energy Locomotion
Wenhao Yu
Greg Turk
Chenxi Liu
40
183
0
24 Jan 2018
An Empirical Analysis of Proximal Policy Optimization with
  Kronecker-factored Natural Gradients
An Empirical Analysis of Proximal Policy Optimization with Kronecker-factored Natural Gradients
Jiaming Song
Yuhuai Wu
35
2
0
17 Jan 2018
Global Convergence of Policy Gradient Methods for the Linear Quadratic
  Regulator
Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator
Maryam Fazel
Rong Ge
Sham Kakade
M. Mesbahi
35
597
0
15 Jan 2018
Autonomous Driving in Reality with Reinforcement Learning and Image
  Translation
Autonomous Driving in Reality with Reinforcement Learning and Image Translation
N. Xu
Bowen Tan
Bingyu Kong
16
36
0
13 Jan 2018
Model-Based Action Exploration for Learning Dynamic Motion Skills
Model-Based Action Exploration for Learning Dynamic Motion Skills
Glen Berseth
M. van de Panne
38
0
0
11 Jan 2018
Expected Policy Gradients for Reinforcement Learning
Expected Policy Gradients for Reinforcement Learning
K. Ciosek
Shimon Whiteson
50
51
0
10 Jan 2018
Distributed Deep Reinforcement Learning: Learn how to play Atari games
  in 21 minutes
Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes
Igor Adamski
R. Adamski
T. Grel
Adam Jedrych
Kamil Kaczmarek
Henryk Michalewski
OffRL
41
37
0
09 Jan 2018
Jointly Learning to Construct and Control Agents using Deep
  Reinforcement Learning
Jointly Learning to Construct and Control Agents using Deep Reinforcement Learning
Charles B. Schaff
David Yunis
Ayan Chakrabarti
Matthew R. Walter
18
100
0
04 Jan 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
28
8,170
0
04 Jan 2018
DeepMind Control Suite
DeepMind Control Suite
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
...
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELM
LM&Ro
BDL
56
1,105
0
02 Jan 2018
f-Divergence constrained policy improvement
f-Divergence constrained policy improvement
Boris Belousov
Jan Peters
13
19
0
29 Dec 2017
SBEED: Convergent Reinforcement Learning with Nonlinear Function
  Approximation
SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Bo Dai
Albert Eaton Shaw
Lihong Li
Lin Xiao
Niao He
Zhen Liu
Jianshu Chen
Le Song
34
25
0
29 Dec 2017
Boosting the Actor with Dual Critic
Boosting the Actor with Dual Critic
Bo Dai
Albert Eaton Shaw
Niao He
Lihong Li
Le Song
35
46
0
29 Dec 2017
RLlib: Abstractions for Distributed Reinforcement Learning
RLlib: Abstractions for Distributed Reinforcement Learning
Eric Liang
Richard Liaw
Philipp Moritz
Robert Nishihara
Roy Fox
Ken Goldberg
Joseph E. Gonzalez
Michael I. Jordan
Ion Stoica
OffRL
AI4CE
31
173
0
26 Dec 2017
A short variational proof of equivalence between policy gradients and
  soft Q learning
A short variational proof of equivalence between policy gradients and soft Q learning
Pierre Harvey Richemond
B. Maginnis
16
5
0
22 Dec 2017
Least-Squares Temporal Difference Learning for the Linear Quadratic
  Regulator
Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator
Stephen Tu
Benjamin Recht
OffRL
37
130
0
22 Dec 2017
On Wasserstein Reinforcement Learning and the Fokker-Planck equation
On Wasserstein Reinforcement Learning and the Fokker-Planck equation
Pierre Harvey Richemond
B. Maginnis
22
23
0
19 Dec 2017
Safe Policy Improvement with Baseline Bootstrapping
Safe Policy Improvement with Baseline Bootstrapping
Romain Laroche
P. Trichelair
Rémi Tachet des Combes
OffRL
30
198
0
19 Dec 2017
ES Is More Than Just a Traditional Finite-Difference Approximator
ES Is More Than Just a Traditional Finite-Difference Approximator
Joel Lehman
Jay Chen
Jeff Clune
Kenneth O. Stanley
25
89
0
18 Dec 2017
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative
  for Training Deep Neural Networks for Reinforcement Learning
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning
F. Such
Vashisht Madhavan
Edoardo Conti
Joel Lehman
Kenneth O. Stanley
Jeff Clune
47
686
0
18 Dec 2017
Safe Mutations for Deep and Recurrent Neural Networks through Output
  Gradients
Safe Mutations for Deep and Recurrent Neural Networks through Output Gradients
Joel Lehman
Jay Chen
Jeff Clune
Kenneth O. Stanley
25
93
0
18 Dec 2017
Improving Exploration in Evolution Strategies for Deep Reinforcement
  Learning via a Population of Novelty-Seeking Agents
Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents
Edoardo Conti
Vashisht Madhavan
F. Such
Joel Lehman
Kenneth O. Stanley
Jeff Clune
27
342
0
18 Dec 2017
A Berkeley View of Systems Challenges for AI
A Berkeley View of Systems Challenges for AI
Ion Stoica
D. Song
Raluca A. Popa
D. Patterson
Michael W. Mahoney
...
Joseph E. Gonzalez
Ken Goldberg
A. Ghodsi
David Culler
Pieter Abbeel
32
199
0
15 Dec 2017
Safe Policy Search with Gaussian Process Models
Safe Policy Search with Gaussian Process Models
Kyriakos Polymenakos
Alessandro Abate
Stephen J. Roberts
27
4
0
15 Dec 2017
Robust Deep Reinforcement Learning with Adversarial Attacks
Robust Deep Reinforcement Learning with Adversarial Attacks
Anay Pattanaik
Zhenyi Tang
Shuijing Liu
Gautham Bommannan
Girish Chowdhary
OOD
18
303
0
11 Dec 2017
Noisy Natural Gradient as Variational Inference
Noisy Natural Gradient as Variational Inference
Guodong Zhang
Shengyang Sun
David Duvenaud
Roger C. Grosse
ODL
30
210
0
06 Dec 2017
Bayesian Policy Gradients via Alpha Divergence Dropout Inference
Bayesian Policy Gradients via Alpha Divergence Dropout Inference
Peter Henderson
T. Doan
Riashat Islam
David Meger
BDL
20
13
0
06 Dec 2017
A Deeper Look at Experience Replay
A Deeper Look at Experience Replay
Shangtong Zhang
R. Sutton
OffRL
VLM
47
269
0
04 Dec 2017
Variational Deep Q Network
Variational Deep Q Network
Yunhao Tang
A. Kucukelbir
BDL
38
10
0
30 Nov 2017
Comparing Deep Reinforcement Learning and Evolutionary Methods in
  Continuous Control
Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control
Shangtong Zhang
Osmar R. Zaiane
42
10
0
30 Nov 2017
Learnings Options End-to-End for Continuous Action Tasks
Learnings Options End-to-End for Continuous Action Tasks
Martin Klissarov
Pierre-Luc Bacon
J. Harb
Doina Precup
13
54
0
30 Nov 2017
Hierarchical Policy Search via Return-Weighted Density Estimation
Hierarchical Policy Search via Return-Weighted Density Estimation
Takayuki Osa
Masashi Sugiyama
21
15
0
28 Nov 2017
Variational Inference for Gaussian Process Models with Linear Complexity
Variational Inference for Gaussian Process Models with Linear Complexity
Ching-An Cheng
Byron Boots
BDL
24
75
0
28 Nov 2017
Divide-and-Conquer Reinforcement Learning
Divide-and-Conquer Reinforcement Learning
Dibya Ghosh
Avi Singh
Aravind Rajeswaran
Vikash Kumar
Sergey Levine
OffRL
45
125
0
27 Nov 2017
Cascade Attribute Learning Network
Cascade Attribute Learning Network
Zhuo Xu
Haonan Chang
Masayoshi Tomizuka
33
4
0
24 Nov 2017
Deterministic Policy Optimization by Combining Pathwise and Score
  Function Estimators for Discrete Action Spaces
Deterministic Policy Optimization by Combining Pathwise and Score Function Estimators for Discrete Action Spaces
Daniel Levy
Stefano Ermon
29
4
0
21 Nov 2017
Run, skeleton, run: skeletal model in a physics-based simulation
Run, skeleton, run: skeletal model in a physics-based simulation
Mikhail Pavlov
Sergey Kolesnikov
Sergey Plis
AI4CE
26
14
0
18 Nov 2017
How Generative Adversarial Networks and Their Variants Work: An Overview
How Generative Adversarial Networks and Their Variants Work: An Overview
Yongjun Hong
Uiwon Hwang
Jaeyoon Yoo
Sungroh Yoon
GAN
41
153
0
16 Nov 2017
Advances in Variational Inference
Advances in Variational Inference
Cheng Zhang
Judith Butepage
Hedvig Kjellström
Stephan Mandt
BDL
40
684
0
15 Nov 2017
Learning Image-Conditioned Dynamics Models for Control of Under-actuated
  Legged Millirobots
Learning Image-Conditioned Dynamics Models for Control of Under-actuated Legged Millirobots
Anusha Nagabandi
Guangzhao Yang
T. Asmar
Ravi Pandya
G. Kahn
Sergey Levine
R. Fearing
AI4CE
8
22
0
14 Nov 2017
Composing Meta-Policies for Autonomous Driving Using Hierarchical Deep
  Reinforcement Learning
Composing Meta-Policies for Autonomous Driving Using Hierarchical Deep Reinforcement Learning
Richard Liaw
S. Krishnan
Animesh Garg
D. Crankshaw
Joseph E. Gonzalez
Ken Goldberg
BDL
32
23
0
04 Nov 2017
Policy Optimization by Genetic Distillation
Policy Optimization by Genetic Distillation
Tanmay Gangwani
Jian-wei Peng
19
18
0
03 Nov 2017
Regret Minimization for Partially Observable Deep Reinforcement Learning
Regret Minimization for Partially Observable Deep Reinforcement Learning
Peter H. Jin
Kurt Keutzer
Sergey Levine
29
51
0
31 Oct 2017
Learning Robust Rewards with Adversarial Inverse Reinforcement Learning
Learning Robust Rewards with Adversarial Inverse Reinforcement Learning
Justin Fu
Katie Z Luo
Sergey Levine
53
740
0
30 Oct 2017
Action-depedent Control Variates for Policy Optimization via Stein's
  Identity
Action-depedent Control Variates for Policy Optimization via Stein's Identity
Hao Liu
Yihao Feng
Yi Mao
Dengyong Zhou
Jian-wei Peng
Qiang Liu
35
4
0
30 Oct 2017
Meta Learning Shared Hierarchies
Meta Learning Shared Hierarchies
Kevin Frans
Jonathan Ho
Xi Chen
Pieter Abbeel
John Schulman
28
349
0
26 Oct 2017
Fast Model Identification via Physics Engines for Data-Efficient Policy
  Search
Fast Model Identification via Physics Engines for Data-Efficient Policy Search
Shaojun Zhu
A. Kimmel
Kostas E. Bekris
Abdeslam Boularias
31
14
0
24 Oct 2017
Previous
123...575859606162
Next