ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.05477
  4. Cited By
Trust Region Policy Optimization

Trust Region Policy Optimization

19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
ArXivPDFHTML

Papers citing "Trust Region Policy Optimization"

50 / 3,098 papers shown
Title
Nonsmooth optimal value and policy functions in mechanical systems
  subject to unilateral constraints
Nonsmooth optimal value and policy functions in mechanical systems subject to unilateral constraints
Bora S. Banjanin
Samuel A. Burden
11
0
0
18 Oct 2017
Asymmetric Actor Critic for Image-Based Robot Learning
Asymmetric Actor Critic for Image-Based Robot Learning
Lerrel Pinto
Marcin Andrychowicz
Peter Welinder
Wojciech Zaremba
Pieter Abbeel
OffRL
15
364
0
18 Oct 2017
Map-based Multi-Policy Reinforcement Learning: Enhancing Adaptability of
  Robots by Deep Reinforcement Learning
Map-based Multi-Policy Reinforcement Learning: Enhancing Adaptability of Robots by Deep Reinforcement Learning
A. Kume
Eiichi Matsumoto
K. Takahashi
W. Ko
Jethro Tan
35
11
0
17 Oct 2017
Stochastic Variance Reduction for Policy Gradient Estimation
Stochastic Variance Reduction for Policy Gradient Estimation
Tianbing Xu
Qiang Liu
Jian-wei Peng
16
19
0
17 Oct 2017
Flow: A Modular Learning Framework for Mixed Autonomy Traffic
Flow: A Modular Learning Framework for Mixed Autonomy Traffic
Cathy Wu
Abdul Rahman Kreidieh
Kanaad Parvate
Eugene Vinitsky
Alexandre M. Bayen
15
153
0
16 Oct 2017
Burn-In Demonstrations for Multi-Modal Imitation Learning
Burn-In Demonstrations for Multi-Modal Imitation Learning
Alex Kuefler
Mykel J. Kochenderfer
29
24
0
13 Oct 2017
Deep Imitation Learning for Complex Manipulation Tasks from Virtual
  Reality Teleoperation
Deep Imitation Learning for Complex Manipulation Tasks from Virtual Reality Teleoperation
Tianhao Zhang
Zoe McCarthy
Owen Jow
Dennis Lee
Xi Chen
Ken Goldberg
Pieter Abbeel
SSL
27
646
0
12 Oct 2017
AMBER: Adaptive Multi-Batch Experience Replay for Continuous Action
  Control
AMBER: Adaptive Multi-Batch Experience Replay for Continuous Action Control
Seungyul Han
Y. Sung
OffRL
16
8
0
12 Oct 2017
Emergent Complexity via Multi-Agent Competition
Emergent Complexity via Multi-Agent Competition
Trapit Bansal
J. Pachocki
Szymon Sidor
Ilya Sutskever
Igor Mordatch
28
384
0
10 Oct 2017
Continuous Adaptation via Meta-Learning in Nonstationary and Competitive
  Environments
Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments
Maruan Al-Shedivat
Trapit Bansal
Yuri Burda
Ilya Sutskever
Igor Mordatch
Pieter Abbeel
CLL
17
353
0
10 Oct 2017
On- and Off-Policy Monotonic Policy Improvement
On- and Off-Policy Monotonic Policy Improvement
R. Iwaki
Minoru Asada
OffRL
22
0
0
10 Oct 2017
Socially Compliant Navigation through Raw Depth Inputs with Generative
  Adversarial Imitation Learning
Socially Compliant Navigation through Raw Depth Inputs with Generative Adversarial Imitation Learning
L. Tai
Jingwei Zhang
Ming-Yu Liu
Wolfram Burgard
GAN
25
176
0
06 Oct 2017
Rainbow: Combining Improvements in Deep Reinforcement Learning
Rainbow: Combining Improvements in Deep Reinforcement Learning
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
OffRL
63
2,237
0
06 Oct 2017
Detecting Adversarial Attacks on Neural Network Policies with Visual
  Foresight
Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight
Yen-Chen Lin
Ming-Yu Liu
Min Sun
Jia-Bin Huang
AAML
29
48
0
02 Oct 2017
Parameter Sharing Deep Deterministic Policy Gradient for Cooperative
  Multi-agent Reinforcement Learning
Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning
Xiangxiang Chu
Hangjun Ye
38
56
0
01 Oct 2017
Learning a Structured Neural Network Policy for a Hopping Task
Learning a Structured Neural Network Policy for a Hopping Task
Julian Viereck
Jules Kozolinsky
Alexander Herzog
Ludovic Righetti
35
12
0
29 Sep 2017
Overcoming Exploration in Reinforcement Learning with Demonstrations
Overcoming Exploration in Reinforcement Learning with Demonstrations
Ashvin Nair
Bob McGrew
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
OffRL
56
771
0
28 Sep 2017
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning
  and Demonstrations
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
Aravind Rajeswaran
Vikash Kumar
Abhishek Gupta
Giulia Vezzani
John Schulman
E. Todorov
Sergey Levine
74
1,071
0
28 Sep 2017
Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep
  Reinforcement Learning
Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning
Pinxin Long
Tingxiang Fan
X. Liao
Wenxi Liu
Huatian Zhang
Jia Pan
OOD
19
452
0
28 Sep 2017
Predictive-State Decoders: Encoding the Future into Recurrent Networks
Predictive-State Decoders: Encoding the Future into Recurrent Networks
Arun Venkatraman
Nicholas Rhinehart
Wen Sun
Lerrel Pinto
M. Hebert
Byron Boots
Kris Kitani
J. Andrew Bagnell
AI4CE
21
42
0
25 Sep 2017
Learning Unmanned Aerial Vehicle Control for Autonomous Target Following
Learning Unmanned Aerial Vehicle Control for Autonomous Target Following
Siyi Li
Tianbo Liu
Chi Zhang
Dit-Yan Yeung
Shaojie Shen
6
37
0
24 Sep 2017
Multi-task Learning with Gradient Guided Policy Specialization
Multi-task Learning with Gradient Guided Policy Specialization
Wenhao Yu
Chenxi Liu
Greg Turk
26
2
0
23 Sep 2017
OptLayer - Practical Constrained Optimization for Deep Reinforcement
  Learning in the Real World
OptLayer - Practical Constrained Optimization for Deep Reinforcement Learning in the Real World
Tu-Hoa Pham
Giovanni De Magistris
Ryuki Tachibana
OffRL
12
140
0
22 Sep 2017
Neural Optimizer Search with Reinforcement Learning
Neural Optimizer Search with Reinforcement Learning
Irwan Bello
Barret Zoph
Vijay Vasudevan
Quoc V. Le
ODL
29
383
0
21 Sep 2017
Local Communication Protocols for Learning Complex Swarm Behaviors with
  Deep Reinforcement Learning
Local Communication Protocols for Learning Complex Swarm Behaviors with Deep Reinforcement Learning
Maximilian Hüttenrauch
Adrian Šošić
Gerhard Neumann
11
3
0
21 Sep 2017
Learning Human Behaviors for Robot-Assisted Dressing
Learning Human Behaviors for Robot-Assisted Dressing
Alexander Clegg
Wenhao Yu
Jie Tan
Charles C. Kemp
Greg Turk
Chenxi Liu
21
3
0
20 Sep 2017
Deep Reinforcement Learning for Dexterous Manipulation with Concept
  Networks
Deep Reinforcement Learning for Dexterous Manipulation with Concept Networks
Aditya Gudimella
Ross Story
M. Shaker
Ruofan Kong
Matthew A. Brown
Victor Shnayder
Marcos Campos
34
24
0
20 Sep 2017
Bayesian Optimization with Automatic Prior Selection for Data-Efficient
  Direct Policy Search
Bayesian Optimization with Automatic Prior Selection for Data-Efficient Direct Policy Search
Rémi Pautrat
Konstantinos Chatzilygeroudis
Jean-Baptiste Mouret
23
44
0
20 Sep 2017
Using Parameterized Black-Box Priors to Scale Up Model-Based Policy
  Search for Robotics
Using Parameterized Black-Box Priors to Scale Up Model-Based Policy Search for Robotics
Konstantinos Chatzilygeroudis
Jean-Baptiste Mouret
15
45
0
20 Sep 2017
Transfer learning from synthetic to real images using variational
  autoencoders for robotic applications
Transfer learning from synthetic to real images using variational autoencoders for robotic applications
Tadanobu Inoue
Subhajit Chaudhury
Giovanni De Magistris
Sakyasingha Dasgupta
26
19
0
20 Sep 2017
OptionGAN: Learning Joint Reward-Policy Options using Generative
  Adversarial Inverse Reinforcement Learning
OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning
Peter Henderson
Wei-Di Chang
Pierre-Luc Bacon
David Meger
Joelle Pineau
Doina Precup
GAN
14
72
0
20 Sep 2017
Deep Reinforcement Learning for Event-Driven Multi-Agent Decision
  Processes
Deep Reinforcement Learning for Event-Driven Multi-Agent Decision Processes
Kunal Menda
Yi-Chun Chen
J. Grana
J. Bono
Brendan D. Tracey
Mykel J. Kochenderfer
David Wolpert
19
48
0
19 Sep 2017
Deep Reinforcement Learning that Matters
Deep Reinforcement Learning that Matters
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
74
1,932
0
19 Sep 2017
DropoutDAgger: A Bayesian Approach to Safe Imitation Learning
DropoutDAgger: A Bayesian Approach to Safe Imitation Learning
Kunal Menda
Katherine Driggs-Campbell
Mykel J. Kochenderfer
32
28
0
18 Sep 2017
Transforming Cooling Optimization for Green Data Center via Deep
  Reinforcement Learning
Transforming Cooling Optimization for Green Data Center via Deep Reinforcement Learning
Yuanlong Li
Yonggang Wen
K. Guan
Dacheng Tao
AI4CE
21
174
0
15 Sep 2017
Shapechanger: Environments for Transfer Learning
Shapechanger: Environments for Transfer Learning
Sébastien M. R. Arnold
Tsam Kiu Pun
Théo-Tim J. Denisart
Francisco J. Valero Cuevas
3DPC
LM&Ro
19
0
0
15 Sep 2017
Automated Cloud Provisioning on AWS using Deep Reinforcement Learning
Automated Cloud Provisioning on AWS using Deep Reinforcement Learning
Zhiguang Wang
C. Gwon
Tim Oates
A. Iezzi
24
23
0
13 Sep 2017
Mirror Descent Search and its Acceleration
Mirror Descent Search and its Acceleration
Megumi Miyashita
S. Yano
T. Kondo
16
7
0
08 Sep 2017
BOOK: Storing Algorithm-Invariant Episodes for Deep Reinforcement
  Learning
BOOK: Storing Algorithm-Invariant Episodes for Deep Reinforcement Learning
Simyung Chang
Y. Yoo
Jaeseok Choi
Nojun Kwak
OffRL
11
1
0
05 Sep 2017
Uncertainty-Aware Learning from Demonstration using Mixture Density
  Networks with Sampling-Free Variance Modeling
Uncertainty-Aware Learning from Demonstration using Mixture Density Networks with Sampling-Free Variance Modeling
Sungjoon Choi
Kyungjae Lee
Sungbin Lim
Songhwai Oh
29
97
0
03 Sep 2017
Mean Actor Critic
Mean Actor Critic
Cameron Allen
Kavosh Asadi
Melrose Roderick
Abdel-rahman Mohamed
George Konidaris
Michael Littman
25
44
0
01 Sep 2017
Deep Learning for Video Game Playing
Deep Learning for Video Game Playing
Niels Justesen
Philip Bontrager
Julian Togelius
S. Risi
VLM
24
206
0
25 Aug 2017
A Brief Survey of Deep Reinforcement Learning
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
65
2,780
0
19 Aug 2017
Scalable trust-region method for deep reinforcement learning using
  Kronecker-factored approximation
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Yuhuai Wu
Elman Mansimov
Shun Liao
Roger C. Grosse
Jimmy Ba
OffRL
22
622
0
17 Aug 2017
Benchmark Environments for Multitask Learning in Continuous Domains
Benchmark Environments for Multitask Learning in Continuous Domains
Peter Henderson
Wei-Di Chang
Florian Shkurti
Johanna Hansen
David Meger
Gregory Dudek
14
40
0
14 Aug 2017
Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for
  Continuous Control
Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control
Riashat Islam
Peter Henderson
Maziar Gomrokchi
Doina Precup
BDL
OffRL
13
251
0
10 Aug 2017
A Machine Learning Approach to Routing
A Machine Learning Approach to Routing
Asaf Valadarsky
Michael Schapira
Dafna Shahaf
Aviv Tamar
20
38
0
10 Aug 2017
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with
  Model-Free Fine-Tuning
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning
Anusha Nagabandi
G. Kahn
R. Fearing
Sergey Levine
28
965
0
08 Aug 2017
GPLAC: Generalizing Vision-Based Robotic Skills using Weakly Labeled
  Images
GPLAC: Generalizing Vision-Based Robotic Skills using Weakly Labeled Images
Avi Singh
Larry Yang
Sergey Levine
22
23
0
07 Aug 2017
An Information-Theoretic Optimality Principle for Deep Reinforcement
  Learning
An Information-Theoretic Optimality Principle for Deep Reinforcement Learning
Felix Leibfried
Jordi Grau-Moya
Haitham Bou-Ammar
38
24
0
06 Aug 2017
Previous
123...5859606162
Next