ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.05477
  4. Cited By
Trust Region Policy Optimization

Trust Region Policy Optimization

19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
ArXivPDFHTML

Papers citing "Trust Region Policy Optimization"

50 / 3,098 papers shown
Title
Hindsight Generative Adversarial Imitation Learning
Hindsight Generative Adversarial Imitation Learning
N. Liu
Tao Lu
Yinghao Cai
Boyao Li
Shuo Wang
27
6
0
19 Mar 2019
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Dhruva Tirumala
Hyeonwoo Noh
Alexandre Galashov
Leonard Hasenclever
Arun Ahuja
Greg Wayne
Razvan Pascanu
Yee Whye Teh
N. Heess
OffRL
19
45
0
18 Mar 2019
Adaptive Variance for Changing Sparse-Reward Environments
Adaptive Variance for Changing Sparse-Reward Environments
Xingyu Lin
Pengsheng Guo
Carlos Florensa
David Held
33
6
0
15 Mar 2019
ROS2Learn: a reinforcement learning framework for ROS 2
ROS2Learn: a reinforcement learning framework for ROS 2
Y. Nuin
N. G. Lopez
Elias Barba Moral
Lander Usategui San Juan
A. Rueda
Víctor Mayoral-Vilches
R. Kojcev
OffRL
8
9
0
14 Mar 2019
Simulating Emergent Properties of Human Driving Behavior Using
  Multi-Agent Reward Augmented Imitation Learning
Simulating Emergent Properties of Human Driving Behavior Using Multi-Agent Reward Augmented Imitation Learning
Raunak P. Bhattacharyya
Derek J. Phillips
Changliu Liu
Jayesh K. Gupta
Katherine Driggs-Campbell
Mykel J. Kochenderfer
AI4CE
16
54
0
14 Mar 2019
Trajectory Optimization for Unknown Constrained Systems using
  Reinforcement Learning
Trajectory Optimization for Unknown Constrained Systems using Reinforcement Learning
Keita Ota
Devesh K. Jha
Tomoaki Oiki
Mamoru Miura
Takashi Nammoto
D. Nikovski
T. Mariyama
35
26
0
13 Mar 2019
Augment-Reinforce-Merge Policy Gradient for Binary Stochastic Policy
Augment-Reinforce-Merge Policy Gradient for Binary Stochastic Policy
Yunhao Tang
Mingzhang Yin
Mingyuan Zhou
14
0
0
13 Mar 2019
Task-oriented Design through Deep Reinforcement Learning
Task-oriented Design through Deep Reinforcement Learning
Junyoung Choi
Minsung Hyun
Nojun Kwak
3DV
4
2
0
13 Mar 2019
On the Pitfalls of Measuring Emergent Communication
On the Pitfalls of Measuring Emergent Communication
Ryan J. Lowe
Jakob N. Foerster
Y-Lan Boureau
Joelle Pineau
Yann N. Dauphin
28
131
0
12 Mar 2019
Deep Recurrent Q-Learning vs Deep Q-Learning on a simple Partially
  Observable Markov Decision Process with Minecraft
Deep Recurrent Q-Learning vs Deep Q-Learning on a simple Partially Observable Markov Decision Process with Minecraft
Clément Romac
Vincent Béraud
21
5
0
11 Mar 2019
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy
  Critics
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics
Denis Steckelmacher
Hélène Plisnier
D. Roijers
A. Nowé
OffRL
26
17
0
11 Mar 2019
Orthogonal Estimation of Wasserstein Distances
Orthogonal Estimation of Wasserstein Distances
Mark Rowland
Jiri Hron
Yunhao Tang
K. Choromanski
Tamás Sarlós
Adrian Weller
36
43
0
09 Mar 2019
Improved Robustness and Safety for Autonomous Vehicle Control with
  Adversarial Reinforcement Learning
Improved Robustness and Safety for Autonomous Vehicle Control with Adversarial Reinforcement Learning
Xiaobai Ma
Katherine Driggs-Campbell
Mykel J. Kochenderfer
AAML
29
48
0
08 Mar 2019
Dyna-AIL : Adversarial Imitation Learning by Planning
Dyna-AIL : Adversarial Imitation Learning by Planning
Vaibhav Saxena
Srinivasan Sivanandan
Pulkit Mathur
27
1
0
08 Mar 2019
From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox
  Optimization
From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization
K. Choromanski
Aldo Pacchiano
Jack Parker-Holder
Yunhao Tang
20
1
0
07 Mar 2019
Provably Robust Blackbox Optimization for Reinforcement Learning
Provably Robust Blackbox Optimization for Reinforcement Learning
K. Choromanski
Aldo Pacchiano
Jack Parker-Holder
Yunhao Tang
Deepali Jain
Yuxiang Yang
Atil Iscen
Jasmine Hsu
Vikas Sindhwani
18
5
0
07 Mar 2019
Training in Task Space to Speed Up and Guide Reinforcement Learning
Training in Task Space to Speed Up and Guide Reinforcement Learning
Guillaume Bellegarda
Katie Byl
18
19
0
06 Mar 2019
Open-Sourced Reinforcement Learning Environments for Surgical Robotics
Open-Sourced Reinforcement Learning Environments for Surgical Robotics
Florian Richter
Ryan K. Orosco
Michael C. Yip
OffRL
27
79
0
05 Mar 2019
Learning Dynamics Model in Reinforcement Learning by Incorporating the
  Long Term Future
Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term Future
Nan Rosemary Ke
Amanpreet Singh
Ahmed Touati
Anirudh Goyal
Yoshua Bengio
Devi Parikh
Dhruv Batra
35
48
0
05 Mar 2019
Sim-to-Real Transfer for Biped Locomotion
Sim-to-Real Transfer for Biped Locomotion
Wenhao Yu
Visak C. V. Kumar
Greg Turk
Chenxi Liu
12
114
0
04 Mar 2019
Microscopic Traffic Simulation by Cooperative Multi-agent Deep
  Reinforcement Learning
Microscopic Traffic Simulation by Cooperative Multi-agent Deep Reinforcement Learning
Giulio Bacchiani
Daniele Molinari
Marco Patander
24
22
0
04 Mar 2019
Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space
Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space
Zhou Fan
Ruilong Su
Weinan Zhang
Yong Yu
19
133
0
04 Mar 2019
Reinforcement Learning on Variable Impedance Controller for
  High-Precision Robotic Assembly
Reinforcement Learning on Variable Impedance Controller for High-Precision Robotic Assembly
Jianlan Luo
Eugen Solowjow
Chengtao Wen
J. A. Ojea
A. Agogino
Aviv Tamar
Pieter Abbeel
29
169
0
04 Mar 2019
Asynchronous Episodic Deep Deterministic Policy Gradient: Towards
  Continuous Control in Computationally Complex Environments
Asynchronous Episodic Deep Deterministic Policy Gradient: Towards Continuous Control in Computationally Complex Environments
Zhizheng Zhang
Jiale Chen
Zhibo Chen
Weiping Li
OffRL
30
60
0
03 Mar 2019
A Regularized Approach to Sparse Optimal Policy in Reinforcement
  Learning
A Regularized Approach to Sparse Optimal Policy in Reinforcement Learning
Xiang Li
Wenhao Yang
Zhihua Zhang
6
2
0
02 Mar 2019
Design of intentional backdoors in sequential models
Design of intentional backdoors in sequential models
Zhaoyuan Yang
N. Iyer
Johan Reimann
Nurali Virani
SILM
AAML
25
38
0
26 Feb 2019
Distributionally Robust Reinforcement Learning
Distributionally Robust Reinforcement Learning
E. Smirnova
Elvis Dohmatob
Jérémie Mary
OffRL
29
58
0
23 Feb 2019
World Discovery Models
World Discovery Models
M. G. Azar
Bilal Piot
Bernardo Avila-Pires
Jean-Bastien Grill
Florent Altché
Rémi Munos
23
26
0
20 Feb 2019
Investigating Generalisation in Continuous Deep Reinforcement Learning
Investigating Generalisation in Continuous Deep Reinforcement Learning
Chenyang Zhao
Olivier Sigaud
F. Stulp
Timothy M. Hospedales
OffRL
22
48
0
19 Feb 2019
Sufficiently Accurate Model Learning
Sufficiently Accurate Model Learning
Clark Zhang
Arbaaz Khan
Santiago Paternain
Alejandro Ribeiro
31
3
0
19 Feb 2019
DIViS: Domain Invariant Visual Servoing for Collision-Free Goal Reaching
DIViS: Domain Invariant Visual Servoing for Collision-Free Goal Reaching
Fereshteh Sadeghi
13
28
0
18 Feb 2019
Fast Efficient Hyperparameter Tuning for Policy Gradients
Fast Efficient Hyperparameter Tuning for Policy Gradients
Supratik Paul
Vitaly Kurin
Shimon Whiteson
22
32
0
18 Feb 2019
Verifiably Safe Off-Model Reinforcement Learning
Verifiably Safe Off-Model Reinforcement Learning
Nathan Fulton
André Platzer
OffRL
14
66
0
14 Feb 2019
Learn a Prior for RHEA for Better Online Planning
Learn a Prior for RHEA for Better Online Planning
Xinyao Tong
W. Liu
Bin Li
OffRL
43
0
0
14 Feb 2019
Non-Asymptotic Analysis of Monte Carlo Tree Search
Non-Asymptotic Analysis of Monte Carlo Tree Search
Devavrat Shah
Qiaomin Xie
Zhi Xu
19
9
0
14 Feb 2019
Off-Policy Actor-Critic in an Ensemble: Achieving Maximum General
  Entropy and Effective Environment Exploration in Deep Reinforcement Learning
Off-Policy Actor-Critic in an Ensemble: Achieving Maximum General Entropy and Effective Environment Exploration in Deep Reinforcement Learning
Gang Chen
Yiming Peng
19
8
0
14 Feb 2019
Manipulating Soft Tissues by Deep Reinforcement Learning for Autonomous
  Robotic Surgery
Manipulating Soft Tissues by Deep Reinforcement Learning for Autonomous Robotic Surgery
Ngoc Duy Nguyen
Thanh Nguyen
S. Nahavandi
Asim Bhatti
Glenn Guest
21
41
0
14 Feb 2019
Value constrained model-free continuous control
Value constrained model-free continuous control
Steven Bohez
A. Abdolmaleki
Michael Neunert
J. Buchli
N. Heess
R. Hadsell
24
62
0
12 Feb 2019
A Bandit Framework for Optimal Selection of Reinforcement Learning
  Agents
A Bandit Framework for Optimal Selection of Reinforcement Learning Agents
A. Merentitis
Kashif Rasul
Roland Vollgraf
Abdul-Saboor Sheikh
Urs M. Bergmann
27
2
0
10 Feb 2019
Diverse Exploration via Conjugate Policies for Policy Gradient Methods
Diverse Exploration via Conjugate Policies for Policy Gradient Methods
Andrew Cohen
Xingye Qiao
Lei Yu
E. Way
Xiangrong Tong
16
9
0
10 Feb 2019
Meta-Curvature
Meta-Curvature
Eunbyung Park
Junier B. Oliva
BDL
21
123
0
09 Feb 2019
Compatible Natural Gradient Policy Search
Compatible Natural Gradient Policy Search
Joni Pajarinen
Hong Linh Thai
R. Akrour
Jan Peters
Gerhard Neumann
16
21
0
07 Feb 2019
Virtual Training for a Real Application: Accurate Object-Robot Relative
  Localization without Calibration
Virtual Training for a Real Application: Accurate Object-Robot Relative Localization without Calibration
Vianney Loing
Renaud Marlet
Mathieu Aubry
29
23
0
07 Feb 2019
Cost-Effective Incentive Allocation via Structured Counterfactual
  Inference
Cost-Effective Incentive Allocation via Structured Counterfactual Inference
Romain Lopez
Chenchen Li
X. Yan
Junwu Xiong
Michael I. Jordan
Yuan Qi
Le Song
OffRL
11
15
0
07 Feb 2019
Artificial Intelligence for Prosthetics - challenge solutions
Artificial Intelligence for Prosthetics - challenge solutions
L. Kidzinski
Carmichael F. Ong
Sharada Mohanty
Jennifer Hicks
Sean F. Carroll
...
E. Tumer
J. Watson
M. Salathé
Sergey Levine
Scott L. Delp
15
40
0
07 Feb 2019
Distilling Policy Distillation
Distilling Policy Distillation
Wojciech M. Czarnecki
Razvan Pascanu
Simon Osindero
Siddhant M. Jayakumar
G. Swirszcz
Max Jaderberg
24
132
0
06 Feb 2019
Neural Fictitious Self-Play on ELF Mini-RTS
Neural Fictitious Self-Play on ELF Mini-RTS
Keigo Kawamura
Yoshimasa Tsuruoka
34
7
0
06 Feb 2019
Adaptive Stress Testing for Autonomous Vehicles
Adaptive Stress Testing for Autonomous Vehicles
Mark Koren
Saud Alsaif
Ritchie Lee
Mykel J. Kochenderfer
13
190
0
05 Feb 2019
Visual Rationalizations in Deep Reinforcement Learning for Atari Games
Visual Rationalizations in Deep Reinforcement Learning for Atari Games
L. Weitkamp
Elise van der Pol
Zeynep Akata
14
27
0
01 Feb 2019
Competitive Experience Replay
Competitive Experience Replay
Hao Liu
Alexander R. Trott
R. Socher
Caiming Xiong
OffRL
39
52
0
01 Feb 2019
Previous
123...505152...606162
Next