ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.05477
  4. Cited By
Trust Region Policy Optimization
v1v2v3v4v5 (latest)

Trust Region Policy Optimization

19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
ArXiv (abs)PDFHTML

Papers citing "Trust Region Policy Optimization"

50 / 2,008 papers shown
Title
Learning Unmanned Aerial Vehicle Control for Autonomous Target Following
Learning Unmanned Aerial Vehicle Control for Autonomous Target Following
Siyi Li
Tianbo Liu
Fangqiu Yi
Dit-Yan Yeung
Shaojie Shen
53
38
0
24 Sep 2017
OptLayer - Practical Constrained Optimization for Deep Reinforcement
  Learning in the Real World
OptLayer - Practical Constrained Optimization for Deep Reinforcement Learning in the Real World
Tu-Hoa Pham
Giovanni De Magistris
Ryuki Tachibana
OffRL
78
143
0
22 Sep 2017
Neural Optimizer Search with Reinforcement Learning
Neural Optimizer Search with Reinforcement Learning
Irwan Bello
Barret Zoph
Vijay Vasudevan
Quoc V. Le
ODL
92
387
0
21 Sep 2017
Deep Reinforcement Learning for Dexterous Manipulation with Concept
  Networks
Deep Reinforcement Learning for Dexterous Manipulation with Concept Networks
Aditya Gudimella
Ross Story
M. Shaker
Ruofan Kong
Matthew A. Brown
Victor Shnayder
Marcos Campos
53
24
0
20 Sep 2017
Bayesian Optimization with Automatic Prior Selection for Data-Efficient
  Direct Policy Search
Bayesian Optimization with Automatic Prior Selection for Data-Efficient Direct Policy Search
Rémi Pautrat
Konstantinos Chatzilygeroudis
Jean-Baptiste Mouret
98
44
0
20 Sep 2017
Transfer learning from synthetic to real images using variational
  autoencoders for robotic applications
Transfer learning from synthetic to real images using variational autoencoders for robotic applications
Tadanobu Inoue
Subhajit Chaudhury
Giovanni De Magistris
Sakyasingha Dasgupta
54
19
0
20 Sep 2017
OptionGAN: Learning Joint Reward-Policy Options using Generative
  Adversarial Inverse Reinforcement Learning
OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning
Peter Henderson
Wei-Di Chang
Pierre-Luc Bacon
David Meger
Joelle Pineau
Doina Precup
GAN
86
73
0
20 Sep 2017
Deep Reinforcement Learning for Event-Driven Multi-Agent Decision
  Processes
Deep Reinforcement Learning for Event-Driven Multi-Agent Decision Processes
Kunal Menda
Yi-Chun Chen
J. Grana
J. Bono
Brendan D. Tracey
Mykel J. Kochenderfer
David Wolpert
91
48
0
19 Sep 2017
Deep Reinforcement Learning that Matters
Deep Reinforcement Learning that Matters
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
164
1,973
0
19 Sep 2017
DropoutDAgger: A Bayesian Approach to Safe Imitation Learning
DropoutDAgger: A Bayesian Approach to Safe Imitation Learning
Kunal Menda
Katherine Driggs-Campbell
Mykel J. Kochenderfer
115
28
0
18 Sep 2017
Transforming Cooling Optimization for Green Data Center via Deep
  Reinforcement Learning
Transforming Cooling Optimization for Green Data Center via Deep Reinforcement Learning
Yuanlong Li
Yonggang Wen
K. Guan
Dacheng Tao
AI4CE
88
181
0
15 Sep 2017
Automated Cloud Provisioning on AWS using Deep Reinforcement Learning
Automated Cloud Provisioning on AWS using Deep Reinforcement Learning
Zhiguang Wang
C. Gwon
Tim Oates
A. Iezzi
49
23
0
13 Sep 2017
BOOK: Storing Algorithm-Invariant Episodes for Deep Reinforcement
  Learning
BOOK: Storing Algorithm-Invariant Episodes for Deep Reinforcement Learning
Simyung Chang
Y. Yoo
Jaeseok Choi
Nojun Kwak
OffRL
15
1
0
05 Sep 2017
Uncertainty-Aware Learning from Demonstration using Mixture Density
  Networks with Sampling-Free Variance Modeling
Uncertainty-Aware Learning from Demonstration using Mixture Density Networks with Sampling-Free Variance Modeling
Sungjoon Choi
Kyungjae Lee
Sungbin Lim
Songhwai Oh
95
98
0
03 Sep 2017
Mean Actor Critic
Mean Actor Critic
Cameron Allen
Kavosh Asadi
Melrose Roderick
Abdel-rahman Mohamed
George Konidaris
Michael Littman
92
45
0
01 Sep 2017
Deep Learning for Video Game Playing
Deep Learning for Video Game Playing
Niels Justesen
Philip Bontrager
Julian Togelius
S. Risi
VLM
101
208
0
25 Aug 2017
A Brief Survey of Deep Reinforcement Learning
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
185
2,830
0
19 Aug 2017
Scalable trust-region method for deep reinforcement learning using
  Kronecker-factored approximation
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Yuhuai Wu
Elman Mansimov
Shun Liao
Roger C. Grosse
Jimmy Ba
OffRL
155
631
0
17 Aug 2017
Benchmark Environments for Multitask Learning in Continuous Domains
Benchmark Environments for Multitask Learning in Continuous Domains
Peter Henderson
Wei-Di Chang
Florian Shkurti
Johanna Hansen
David Meger
Gregory Dudek
72
40
0
14 Aug 2017
Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for
  Continuous Control
Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control
Riashat Islam
Peter Henderson
Maziar Gomrokchi
Doina Precup
BDLOffRL
106
253
0
10 Aug 2017
A Machine Learning Approach to Routing
A Machine Learning Approach to Routing
Asaf Valadarsky
Michael Schapira
Dafna Shahaf
Aviv Tamar
71
38
0
10 Aug 2017
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with
  Model-Free Fine-Tuning
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning
Anusha Nagabandi
G. Kahn
R. Fearing
Sergey Levine
167
977
0
08 Aug 2017
GPLAC: Generalizing Vision-Based Robotic Skills using Weakly Labeled
  Images
GPLAC: Generalizing Vision-Based Robotic Skills using Weakly Labeled Images
Avi Singh
Larry Yang
Sergey Levine
61
23
0
07 Aug 2017
An Information-Theoretic Optimality Principle for Deep Reinforcement
  Learning
An Information-Theoretic Optimality Principle for Deep Reinforcement Learning
Felix Leibfried
Jordi Grau-Moya
Haitham Bou-Ammar
116
24
0
06 Aug 2017
CASSL: Curriculum Accelerated Self-Supervised Learning
CASSL: Curriculum Accelerated Self-Supervised Learning
Adithyavairavan Murali
Lerrel Pinto
Dhiraj Gandhi
Abhinav Gupta
SSL
73
35
0
04 Aug 2017
Meta-SGD: Learning to Learn Quickly for Few-Shot Learning
Meta-SGD: Learning to Learn Quickly for Few-Shot Learning
Zhenguo Li
Fengwei Zhou
Fei Chen
Hang Li
139
1,127
0
31 Jul 2017
Leveraging Demonstrations for Deep Reinforcement Learning on Robotics
  Problems with Sparse Rewards
Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Matej Vecerík
Todd Hester
Jonathan Scholz
Fumin Wang
Olivier Pietquin
Bilal Piot
N. Heess
Thomas Rothörl
Thomas Lampe
Martin Riedmiller
OffRL
130
671
0
27 Jul 2017
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
I. Higgins
Arka Pal
Andrei A. Rusu
Loic Matthey
Christopher P. Burgess
Alexander Pritzel
M. Botvinick
Charles Blundell
Alexander Lerchner
DRL
176
417
0
26 Jul 2017
Mutual Alignment Transfer Learning
Mutual Alignment Transfer Learning
Markus Wulfmeier
Ingmar Posner
Pieter Abbeel
156
61
0
25 Jul 2017
Learning Transferable Architectures for Scalable Image Recognition
Learning Transferable Architectures for Scalable Image Recognition
Barret Zoph
Vijay Vasudevan
Jonathon Shlens
Quoc V. Le
298
5,623
0
21 Jul 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
709
19,377
0
20 Jul 2017
Imagination-Augmented Agents for Deep Reinforcement Learning
Imagination-Augmented Agents for Deep Reinforcement Learning
T. Weber
S. Racanière
David P. Reichert
Lars Buesing
A. Guez
...
Razvan Pascanu
Peter W. Battaglia
Demis Hassabis
David Silver
Daan Wierstra
LM&Ro
124
557
0
19 Jul 2017
Reverse Curriculum Generation for Reinforcement Learning
Reverse Curriculum Generation for Reinforcement Learning
Carlos Florensa
David Held
Markus Wulfmeier
Michael Zhang
Pieter Abbeel
126
444
0
17 Jul 2017
Control of a Quadrotor with Reinforcement Learning
Control of a Quadrotor with Reinforcement Learning
Jemin Hwangbo
Inkyu Sa
Roland Siegwart
Marco Hutter
96
482
0
17 Jul 2017
Efficient Architecture Search by Network Transformation
Efficient Architecture Search by Network Transformation
Han Cai
Tianyao Chen
Weinan Zhang
Yong Yu
Jun Wang
OOD3DV
99
67
0
16 Jul 2017
ADAPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical
  Systems
ADAPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical Systems
James Harrison
Animesh Garg
Boris Ivanovic
Yuke Zhu
Silvio Savarese
Li Fei-Fei
Marco Pavone
83
25
0
15 Jul 2017
Distral: Robust Multitask Reinforcement Learning
Distral: Robust Multitask Reinforcement Learning
Yee Whye Teh
V. Bapst
Wojciech M. Czarnecki
John Quan
J. Kirkpatrick
R. Hadsell
N. Heess
Razvan Pascanu
228
554
0
13 Jul 2017
Imitation from Observation: Learning to Imitate Behaviors from Raw Video
  via Context Translation
Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation
YuXuan Liu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
144
380
0
11 Jul 2017
A Simple Neural Attentive Meta-Learner
A Simple Neural Attentive Meta-Learner
Nikhil Mishra
Mostafa Rohaninejad
Xi Chen
Pieter Abbeel
OOD
109
200
0
11 Jul 2017
Learning Heuristic Search via Imitation
Learning Heuristic Search via Imitation
M. Bhardwaj
Sanjiban Choudhury
Sebastian Scherer
71
83
0
10 Jul 2017
Robust Imitation of Diverse Behaviors
Robust Imitation of Diverse Behaviors
Ziyun Wang
J. Merel
Scott E. Reed
Greg Wayne
Nando de Freitas
N. Heess
121
198
0
10 Jul 2017
Emergence of Locomotion Behaviours in Rich Environments
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
239
940
0
07 Jul 2017
Learning human behaviors from motion capture by adversarial imitation
Learning human behaviors from motion capture by adversarial imitation
J. Merel
Yuval Tassa
TB Dhruva
S. Srinivasan
Jay Lemmon
Ziyun Wang
Greg Wayne
N. Heess
GAN
88
202
0
07 Jul 2017
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control
Ofir Nachum
Mohammad Norouzi
Kelvin Xu
Dale Schuurmans
95
107
0
06 Jul 2017
ELF: An Extensive, Lightweight and Flexible Research Platform for
  Real-time Strategy Games
ELF: An Extensive, Lightweight and Flexible Research Platform for Real-time Strategy Games
Yuandong Tian
Qucheng Gong
Wenling Shang
Yuxin Wu
C. L. Zitnick
OffRL
76
126
0
04 Jul 2017
Teacher-Student Curriculum Learning
Teacher-Student Curriculum Learning
Tambet Matiisen
Avital Oliver
Taco S. Cohen
John Schulman
ODL
133
383
0
01 Jul 2017
Sample-efficient Actor-Critic Reinforcement Learning with Supervised
  Data for Dialogue Management
Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management
Pei-hao Su
Paweł Budzianowski
Stefan Ultes
Milica Gasic
S. Young
OffRL
143
130
0
01 Jul 2017
Noisy Networks for Exploration
Noisy Networks for Exploration
Meire Fortunato
M. G. Azar
Bilal Piot
Jacob Menick
Ian Osband
...
Rémi Munos
Demis Hassabis
Olivier Pietquin
Charles Blundell
Shane Legg
121
898
0
30 Jun 2017
Path Integral Networks: End-to-End Differentiable Optimal Control
Path Integral Networks: End-to-End Differentiable Optimal Control
Masashi Okada
Luca Rigazio
T. Aoshima
PINN
73
56
0
29 Jun 2017
Count-Based Exploration in Feature Space for Reinforcement Learning
Count-Based Exploration in Feature Space for Reinforcement Learning
Jarryd Martin
S. N. Sasikumar
Tom Everitt
Marcus Hutter
76
124
0
25 Jun 2017
Previous
123...3738394041
Next