ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.05477
  4. Cited By
Trust Region Policy Optimization
v1v2v3v4v5 (latest)

Trust Region Policy Optimization

19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
ArXiv (abs)PDFHTML

Papers citing "Trust Region Policy Optimization"

50 / 2,023 papers shown
Title
Learning Whole-body Motor Skills for Humanoids
Learning Whole-body Motor Skills for Humanoids
Chuanyu Yang
Kai Yuan
W. Merkt
Taku Komura
S. Vijayakumar
Zhibin Li
105
38
0
07 Feb 2020
Accelerating Reinforcement Learning for Reaching using Continuous
  Curriculum Learning
Accelerating Reinforcement Learning for Reaching using Continuous Curriculum Learning
Sha Luo
Hamidreza Kasaei
Lambert Schomaker
CLL
95
46
0
07 Feb 2020
Ready Policy One: World Building Through Active Learning
Ready Policy One: World Building Through Active Learning
Philip J. Ball
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
OffRL
99
49
0
07 Feb 2020
Automated Lane Change Strategy using Proximal Policy Optimization-based
  Deep Reinforcement Learning
Automated Lane Change Strategy using Proximal Policy Optimization-based Deep Reinforcement Learning
Fei Ye
Xuxin Cheng
Pin Wang
Ching-yao Chan
Jiucai Zhang
42
100
0
07 Feb 2020
Dynamic Energy Dispatch Based on Deep Reinforcement Learning in
  IoT-Driven Smart Isolated Microgrids
Dynamic Energy Dispatch Based on Deep Reinforcement Learning in IoT-Driven Smart Isolated Microgrids
Lei Lei
Yue Tan
Glenn Dahlenburg
W. Xiang
K. Zheng
76
71
0
07 Feb 2020
Effective Diversity in Population Based Reinforcement Learning
Effective Diversity in Population Based Reinforcement Learning
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
146
165
0
03 Feb 2020
Deep Reinforcement Learning for Autonomous Driving: A Survey
Deep Reinforcement Learning for Autonomous Driving: A Survey
B. R. Kiran
Ibrahim Sobh
V. Talpaert
Patrick Mannion
A. A. Sallab
S. Yogamani
P. Pérez
369
1,710
0
02 Feb 2020
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement
  Learning
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement Learning
Zhang-Wei Hong
P. Nagarajan
Guilherme J. Maeda
OffRL
53
4
0
01 Feb 2020
Constrained Deep Reinforcement Learning for Energy Sustainable Multi-UAV
  based Random Access IoT Networks with NOMA
Constrained Deep Reinforcement Learning for Energy Sustainable Multi-UAV based Random Access IoT Networks with NOMA
Sami Khairy
Prasanna Balaprakash
L. Cai
Y. Cheng
31
73
0
31 Jan 2020
Variational Autoencoders for Opponent Modeling in Multi-Agent Systems
Variational Autoencoders for Opponent Modeling in Multi-Agent Systems
Georgios Papoudakis
Stefano V. Albrecht
BDLDRL
64
29
0
29 Jan 2020
Challenges and Countermeasures for Adversarial Attacks on Deep
  Reinforcement Learning
Challenges and Countermeasures for Adversarial Attacks on Deep Reinforcement Learning
Inaam Ilahi
Muhammad Usama
Junaid Qadir
M. Janjua
Ala I. Al-Fuqaha
D. Hoang
Dusit Niyato
AAML
152
137
0
27 Jan 2020
Interpretable End-to-end Urban Autonomous Driving with Latent Deep
  Reinforcement Learning
Interpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement Learning
Jianyu Chen
Shengbo Eben Li
Masayoshi Tomizuka
158
246
0
23 Jan 2020
Expected Information Maximization: Using the I-Projection for Mixture
  Density Estimation
Expected Information Maximization: Using the I-Projection for Mixture Density Estimation
P. Becker
Oleg Arenz
Gerhard Neumann
56
16
0
23 Jan 2020
Q-Learning in enormous action spaces via amortized approximate
  maximization
Q-Learning in enormous action spaces via amortized approximate maximization
T. Wiele
David Warde-Farley
A. Mnih
Volodymyr Mnih
78
60
0
22 Jan 2020
Lyceum: An efficient and scalable ecosystem for robot learning
Lyceum: An efficient and scalable ecosystem for robot learning
Colin Summers
Kendall Lowrey
Aravind Rajeswaran
S. Srinivasa
E. Todorov
88
19
0
21 Jan 2020
Nested-Wasserstein Self-Imitation Learning for Sequence Generation
Nested-Wasserstein Self-Imitation Learning for Sequence Generation
Ruiyi Zhang
Changyou Chen
Zhe Gan
Zheng Wen
Wenlin Wang
Lawrence Carin
81
7
0
20 Jan 2020
Reinforcement Learning with Probabilistically Complete Exploration
Reinforcement Learning with Probabilistically Complete Exploration
Philippe Morere
Gilad Francis
Tom Blau
Fabio Ramos
OffRL
43
7
0
20 Jan 2020
Memristor Hardware-Friendly Reinforcement Learning
Memristor Hardware-Friendly Reinforcement Learning
Nan Wu
A. Vincent
D. Strukov
Yuan Xie
34
1
0
20 Jan 2020
A Survey of Reinforcement Learning Techniques: Strategies, Recent Development, and Future Directions
A. Mondal
OffRL
53
8
0
19 Jan 2020
FRESH: Interactive Reward Shaping in High-Dimensional State Spaces using
  Human Feedback
FRESH: Interactive Reward Shaping in High-Dimensional State Spaces using Human Feedback
Baicen Xiao
Qifan Lu
Bhaskar Ramasubramanian
Andrew Clark
L. Bushnell
Radha Poovendran
78
25
0
19 Jan 2020
Algorithms in Multi-Agent Systems: A Holistic Perspective from
  Reinforcement Learning and Game Theory
Algorithms in Multi-Agent Systems: A Holistic Perspective from Reinforcement Learning and Game Theory
Yunlong Lu
Kai Yan
AI4CE
172
13
0
17 Jan 2020
SEERL: Sample Efficient Ensemble Reinforcement Learning
SEERL: Sample Efficient Ensemble Reinforcement Learning
Rohan Saphal
Balaraman Ravindran
Dheevatsa Mudigere
Sasikanth Avancha
Bharat Kaul
65
19
0
15 Jan 2020
Population-Guided Parallel Policy Search for Reinforcement Learning
Population-Guided Parallel Policy Search for Reinforcement Learning
Whiyoung Jung
Giseung Park
Y. Sung
OffRL
72
38
0
09 Jan 2020
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for
  Addressing Value Estimation Errors
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors
Jingliang Duan
Yang Guan
Shengbo Eben Li
Yangang Ren
B. Cheng
OffRL
104
186
0
09 Jan 2020
On Computation and Generalization of Generative Adversarial Imitation
  Learning
On Computation and Generalization of Generative Adversarial Imitation Learning
Minshuo Chen
Yizhou Wang
Tianyi Liu
Zhuoran Yang
Xingguo Li
Zhaoran Wang
T. Zhao
136
40
0
09 Jan 2020
Reward-Conditioned Policies
Reward-Conditioned Policies
Aviral Kumar
Xue Bin Peng
Sergey Levine
77
96
0
31 Dec 2019
Deep Innovation Protection: Confronting the Credit Assignment Problem in
  Training Heterogeneous Neural Architectures
Deep Innovation Protection: Confronting the Credit Assignment Problem in Training Heterogeneous Neural Architectures
S. Risi
Kenneth O. Stanley
86
4
0
29 Dec 2019
Deep reinforcement learning for complex evaluation of one-loop diagrams
  in quantum field theory
Deep reinforcement learning for complex evaluation of one-loop diagrams in quantum field theory
Andreas Windisch
Thomas Gallien
Christopher Schwarzlmueller
29
4
0
27 Dec 2019
Quasi-Newton Trust Region Policy Optimization
Quasi-Newton Trust Region Policy Optimization
Devesh K. Jha
A. Raghunathan
Diego Romeres
64
9
0
26 Dec 2019
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRLAI4TS
138
193
0
23 Dec 2019
Monte-Carlo Tree Search for Policy Optimization
Monte-Carlo Tree Search for Policy Optimization
Xiaobai Ma
Katherine Driggs-Campbell
Zongzhang Zhang
Mykel J. Kochenderfer
114
6
0
23 Dec 2019
Direct and indirect reinforcement learning
Direct and indirect reinforcement learning
Yang Guan
Shengbo Eben Li
Jingliang Duan
Jie Li
Yangang Ren
Qi Sun
B. Cheng
OffRL
77
34
0
23 Dec 2019
Optimizing Collision Avoidance in Dense Airspace using Deep
  Reinforcement Learning
Optimizing Collision Avoidance in Dense Airspace using Deep Reinforcement Learning
Sheng Li
M. Egorov
Mykel Kochenderfer
66
32
0
20 Dec 2019
Soft Q Network
Soft Q Network
Jingbin Liu
Shuai Liu
Xinyang Gu
OffRL
51
2
0
20 Dec 2019
Taming an autonomous surface vehicle for path following and collision
  avoidance using deep reinforcement learning
Taming an autonomous surface vehicle for path following and collision avoidance using deep reinforcement learning
Eivind Meyer
Haakon Robinson
Adil Rasheed
Omer San
65
66
0
18 Dec 2019
Distributional Reinforcement Learning for Energy-Based Sequential Models
Distributional Reinforcement Learning for Energy-Based Sequential Models
Tetiana Parshakova
J. Andreoli
Marc Dymetman
83
21
0
18 Dec 2019
Centralized Cooperation for Connected and Automated Vehicles at
  Intersections by Proximal Policy Optimization
Centralized Cooperation for Connected and Automated Vehicles at Intersections by Proximal Policy Optimization
Yang Guan
Yangang Ren
Shengbo Eben Li
Qi Sun
Laiquan Luo
Keqiang Li
38
6
0
18 Dec 2019
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning
Shuai Lu
Shuai Han
Wenbo Zhou
Junwei Zhang
72
26
0
13 Dec 2019
Learning to Reach Goals via Iterated Supervised Learning
Learning to Reach Goals via Iterated Supervised Learning
Dibya Ghosh
Abhishek Gupta
Ashwin Reddy
Justin Fu
Coline Devin
Benjamin Eysenbach
Sergey Levine
119
35
0
12 Dec 2019
Provably Efficient Exploration in Policy Optimization
Provably Efficient Exploration in Policy Optimization
Qi Cai
Zhuoran Yang
Chi Jin
Zhaoran Wang
116
283
0
12 Dec 2019
Online Deep Reinforcement Learning for Autonomous UAV Navigation and
  Exploration of Outdoor Environments
Online Deep Reinforcement Learning for Autonomous UAV Navigation and Exploration of Outdoor Environments
Bruna G. Maciel-Pearson
Letizia Marchegiani
S. Akçay
Amir Atapour-Abarghouei
James Garforth
T. Breckon
68
30
0
11 Dec 2019
SMiRL: Surprise Minimizing Reinforcement Learning in Unstable
  Environments
SMiRL: Surprise Minimizing Reinforcement Learning in Unstable Environments
Glen Berseth
Daniel Geng
Coline Devin
Nicholas Rhinehart
Chelsea Finn
Dinesh Jayaraman
Sergey Levine
103
22
0
11 Dec 2019
Efficacy of Modern Neuro-Evolutionary Strategies for Continuous Control
  Optimization
Efficacy of Modern Neuro-Evolutionary Strategies for Continuous Control Optimization
Paolo Pagliuca
Nicola Milano
S. Nolfi
80
31
0
11 Dec 2019
Energy-aware Scheduling of Jobs in Heterogeneous Cluster Systems Using
  Deep Reinforcement Learning
Energy-aware Scheduling of Jobs in Heterogeneous Cluster Systems Using Deep Reinforcement Learning
Amirhossein Esmaili
Massoud Pedram
13
3
0
11 Dec 2019
Risk-Averse Trust Region Optimization for Reward-Volatility Reduction
Risk-Averse Trust Region Optimization for Reward-Volatility Reduction
Qianggang Ding
Sifan Wu
Hao Sun
Jiadong Guo
Jian Guo
51
126
0
06 Dec 2019
Training Agents using Upside-Down Reinforcement Learning
Training Agents using Upside-Down Reinforcement Learning
R. Srivastava
Pranav Shyam
Filipe Wall Mutz
Wojciech Ja'skowski
Jürgen Schmidhuber
OffRL
95
126
0
05 Dec 2019
Hindsight Credit Assignment
Hindsight Credit Assignment
Anna Harutyunyan
Will Dabney
Thomas Mesnard
M. G. Azar
Bilal Piot
...
H. V. Hasselt
Greg Wayne
Satinder Singh
Doina Precup
Rémi Munos
100
75
0
05 Dec 2019
Inter-Level Cooperation in Hierarchical Reinforcement Learning
Inter-Level Cooperation in Hierarchical Reinforcement Learning
Abdul Rahman Kreidieh
Yiling You
Nathan Lichtlé
Samyak Parajuli
Rayyan Nasr
Alexandre M. Bayen
116
14
0
05 Dec 2019
Optimization for Reinforcement Learning: From Single Agent to
  Cooperative Agents
Optimization for Reinforcement Learning: From Single Agent to Cooperative Agents
Dong-hwan Lee
Niao He
Parameswaran Kamalaruban
Volkan Cevher
65
89
0
01 Dec 2019
Adversary A3C for Robust Reinforcement Learning
Adversary A3C for Robust Reinforcement Learning
Zhaoyuan Gu
Zhenzhong Jia
Howie Choset
AAML
66
24
0
01 Dec 2019
Previous
123...242526...394041
Next