ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.05477
  4. Cited By
Trust Region Policy Optimization
v1v2v3v4v5 (latest)

Trust Region Policy Optimization

19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
ArXiv (abs)PDFHTML

Papers citing "Trust Region Policy Optimization"

50 / 2,020 papers shown
Title
Switching Isotropic and Directional Exploration with Parameter Space
  Noise in Deep Reinforcement Learning
Switching Isotropic and Directional Exploration with Parameter Space Noise in Deep Reinforcement Learning
Izumi Karino
Kazutoshi Tanaka
Ryuma Niiyama
Yasuo Kuniyoshi
37
3
0
18 Sep 2018
Adversarial Imitation via Variational Inverse Reinforcement Learning
Adversarial Imitation via Variational Inverse Reinforcement Learning
A. H. Qureshi
Byron Boots
Michael C. Yip
77
61
0
17 Sep 2018
Curriculum goal masking for continuous deep reinforcement learning
Curriculum goal masking for continuous deep reinforcement learning
Manfred Eppe
S. Magg
S. Wermter
56
19
0
17 Sep 2018
Policy Optimization via Importance Sampling
Policy Optimization via Importance Sampling
Alberto Maria Metelli
Matteo Papini
Francesco Faccio
Marcello Restelli
OffRL
116
90
0
17 Sep 2018
Model-Based Reinforcement Learning via Meta-Policy Optimization
Model-Based Reinforcement Learning via Meta-Policy Optimization
I. Clavera
Jonas Rothfuss
John Schulman
Yasuhiro Fujita
Tamim Asfour
Pieter Abbeel
85
228
0
14 Sep 2018
Deep Reinforcement Learning for Event-Triggered Control
Deep Reinforcement Learning for Event-Triggered Control
Dominik Baumann
Jia Jie Zhu
Georg Martius
Sebastian Trimpe
BDL
83
61
0
13 Sep 2018
Multi-task Deep Reinforcement Learning with PopArt
Multi-task Deep Reinforcement Learning with PopArt
Matteo Hessel
Hubert Soyer
L. Espeholt
Wojciech M. Czarnecki
Simon Schmitt
H. V. Hasselt
147
320
0
12 Sep 2018
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge
Michal Garmulewicz
Henryk Michalewski
Piotr Milos
78
8
0
10 Sep 2018
Learning Adaptive Display Exposure for Real-Time Advertising
Learning Adaptive Display Exposure for Real-Time Advertising
Weixun Wang
Junqi Jin
Jianye Hao
Chunjie Chen
Chuan Yu
...
Xiaotian Hao
Yixi Wang
Han Li
Jian Xu
Kun Gai
43
6
0
10 Sep 2018
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward
  Bias in Adversarial Imitation Learning
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
Ilya Kostrikov
Kumar Krishna Agrawal
Debidatta Dwibedi
Sergey Levine
Jonathan Tompson
111
259
0
09 Sep 2018
Learning Invariances for Policy Generalization
Learning Invariances for Policy Generalization
Rémi Tachet des Combes
Philip Bachman
H. V. Seijen
83
12
0
07 Sep 2018
A Block Coordinate Ascent Algorithm for Mean-Variance Optimization
A Block Coordinate Ascent Algorithm for Mean-Variance Optimization
Bo Liu
Tengyang Xie
Yangyang Xu
Mohammad Ghavamzadeh
Yinlam Chow
Daoming Lyu
Daesub Yoon
90
30
0
07 Sep 2018
Emergence of Human-comparable Balancing Behaviors by Deep Reinforcement
  Learning
Emergence of Human-comparable Balancing Behaviors by Deep Reinforcement Learning
Chuanyu Yang
Taku Komura
Zhibin Li
60
20
0
06 Sep 2018
Sample-Efficient Imitation Learning via Generative Adversarial Nets
Sample-Efficient Imitation Learning via Generative Adversarial Nets
Lionel Blondé
Alexandros Kalousis
GAN
79
47
0
06 Sep 2018
A Robotic Auto-Focus System based on Deep Reinforcement Learning
A Robotic Auto-Focus System based on Deep Reinforcement Learning
Xiaofan Yu
Runze Yu
Jingsong Yang
Xiaohui Duan
28
9
0
05 Sep 2018
Gibson Env: Real-World Perception for Embodied Agents
Gibson Env: Real-World Perception for Embodied Agents
F. Xia
Amir Zamir
Zhi-Yang He
Alexander Sax
Jitendra Malik
Silvio Savarese
AI4CELM&Ro
108
830
0
31 Aug 2018
Application of Self-Play Reinforcement Learning to a Four-Player Game of
  Imperfect Information
Application of Self-Play Reinforcement Learning to a Four-Player Game of Imperfect Information
Henry Charlesworth
SSL
23
13
0
30 Aug 2018
SOLAR: Deep Structured Representations for Model-Based Reinforcement
  Learning
SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning
Marvin Zhang
Sharad Vikram
Laura M. Smith
Pieter Abbeel
Matthew J. Johnson
Sergey Levine
OffRL
110
42
0
28 Aug 2018
LIFT: Reinforcement Learning in Computer Systems by Learning From
  Demonstrations
LIFT: Reinforcement Learning in Computer Systems by Learning From Demonstrations
Michael Schaarschmidt
A. Kuhnle
Ben Ellis
Kai Fricke
Felix Gessert
Eiko Yoneki
OffRL
65
41
0
23 Aug 2018
Intelligent Middle-Level Game Control
Intelligent Middle-Level Game Control
Amin Babadi
Kourosh Naderi
Perttu Hämäläinen
17
5
0
19 Aug 2018
Importance mixing: Improving sample reuse in evolutionary policy search
  methods
Importance mixing: Improving sample reuse in evolutionary policy search methods
Aloïs Pourchot
Nicolas Perrin
Olivier Sigaud
68
14
0
17 Aug 2018
Risk-Sensitive Generative Adversarial Imitation Learning
Risk-Sensitive Generative Adversarial Imitation Learning
Jonathan Lacotte
Mohammad Ghavamzadeh
Yinlam Chow
Marco Pavone
GAN
90
24
0
13 Aug 2018
Fully Distributed Multi-Robot Collision Avoidance via Deep Reinforcement
  Learning for Safe and Efficient Navigation in Complex Scenarios
Fully Distributed Multi-Robot Collision Avoidance via Deep Reinforcement Learning for Safe and Efficient Navigation in Complex Scenarios
Tingxiang Fan
Pinxin Long
Wenxi Liu
Jia Pan
65
69
0
11 Aug 2018
Policy Optimization as Wasserstein Gradient Flows
Policy Optimization as Wasserstein Gradient Flows
Ruiyi Zhang
Changyou Chen
Chunyuan Li
Lawrence Carin
88
68
0
09 Aug 2018
ToriLLE: Learning Environment for Hand-to-Hand Combat
ToriLLE: Learning Environment for Hand-to-Hand Combat
Anssi Kanervisto
Ville Hautamaki
62
2
0
26 Jul 2018
Variational Bayesian Reinforcement Learning with Regret Bounds
Variational Bayesian Reinforcement Learning with Regret Bounds
Brendan O'Donoghue
117
41
0
25 Jul 2018
CrowdMove: Autonomous Mapless Navigation in Crowded Scenarios
CrowdMove: Autonomous Mapless Navigation in Crowded Scenarios
Tingxiang Fan
Xinjing Cheng
Jia Pan
Tianyi Zhou
Ruigang Yang
91
53
0
19 Jul 2018
Deep Reinforcement Learning for Swarm Systems
Deep Reinforcement Learning for Swarm Systems
Maximilian Hüttenrauch
Adrian Šošić
Gerhard Neumann
96
198
0
17 Jul 2018
Generative Adversarial Imitation from Observation
Generative Adversarial Imitation from Observation
F. Torabi
Garrett A. Warnell
Peter Stone
GAN
106
245
0
17 Jul 2018
Discrete linear-complexity reinforcement learning in continuous action
  spaces for Q-learning algorithms
Discrete linear-complexity reinforcement learning in continuous action spaces for Q-learning algorithms
P. Tavallali
G. Doran
L. Mandrake
28
0
0
16 Jul 2018
Online Robust Policy Learning in the Presence of Unknown Adversaries
Online Robust Policy Learning in the Presence of Unknown Adversaries
Aaron J. Havens
Zhanhong Jiang
Soumik Sarkar
AAML
120
44
0
16 Jul 2018
Remember and Forget for Experience Replay
Remember and Forget for Experience Replay
G. Novati
Petros Koumoutsakos
OffRL
108
92
0
16 Jul 2018
Hierarchical Reinforcement Learning Framework towards Multi-agent
  Navigation
Hierarchical Reinforcement Learning Framework towards Multi-agent Navigation
Wenhao Ding
Shuaijun Li
Huihuan Qian
121
32
0
14 Jul 2018
Algorithmic Framework for Model-based Deep Reinforcement Learning with
  Theoretical Guarantees
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
Yuping Luo
Huazhe Xu
Yuanzhi Li
Yuandong Tian
Trevor Darrell
Tengyu Ma
OffRL
136
227
0
10 Jul 2018
Is Q-learning Provably Efficient?
Is Q-learning Provably Efficient?
Chi Jin
Zeyuan Allen-Zhu
Sébastien Bubeck
Michael I. Jordan
OffRL
145
813
0
10 Jul 2018
Memory Augmented Policy Optimization for Program Synthesis and Semantic
  Parsing
Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing
Chen Liang
Mohammad Norouzi
Jonathan Berant
Quoc V. Le
Ni Lao
136
134
0
06 Jul 2018
A survey on policy search algorithms for learning robot controllers in a
  handful of trials
A survey on policy search algorithms for learning robot controllers in a handful of trials
Konstantinos Chatzilygeroudis
Vassilis Vassiliades
F. Stulp
Sylvain Calinon
Jean-Baptiste Mouret
103
155
0
06 Jul 2018
Variance Reduction for Reinforcement Learning in Input-Driven
  Environments
Variance Reduction for Reinforcement Learning in Input-Driven Environments
Hongzi Mao
S. Venkatakrishnan
Malte Schwarzkopf
Mohammad Alizadeh
OffRL
104
95
0
06 Jul 2018
Using Reinforcement Learning with Partial Vehicle Detection for
  Intelligent Traffic Signal Control
Using Reinforcement Learning with Partial Vehicle Detection for Intelligent Traffic Signal Control
Rusheng Zhang
A. Ishikawa
Wenli Wang
Benjamin Striner
Ozan Tonguz
74
104
0
04 Jul 2018
Learning to Drive in a Day
Learning to Drive in a Day
Alex Kendall
Jeffrey Hawke
David Janz
Przemyslaw Mazur
Daniele Reda
John M. Allen
Vinh-Dieu Lam
Alex Bewley
Amar Shah
141
659
0
01 Jul 2018
Towards Mixed Optimization for Reinforcement Learning with Program
  Synthesis
Towards Mixed Optimization for Reinforcement Learning with Program Synthesis
Surya Bhupatiraju
Kumar Krishna Agrawal
Rishabh Singh
42
8
0
01 Jul 2018
Bayesian Counterfactual Risk Minimization
Bayesian Counterfactual Risk Minimization
Ben London
Ted Sandler
OffRL
56
33
0
29 Jun 2018
Deep Generative Models with Learnable Knowledge Constraints
Deep Generative Models with Learnable Knowledge Constraints
Zhiting Hu
Zichao Yang
Ruslan Salakhutdinov
Xiaodan Liang
Lianhui Qin
Haoye Dong
Eric Xing
BDLAI4CE
111
76
0
26 Jun 2018
A Tour of Reinforcement Learning: The View from Continuous Control
A Tour of Reinforcement Learning: The View from Continuous Control
Benjamin Recht
137
635
0
25 Jun 2018
Multi-objective Model-based Policy Search for Data-efficient Learning
  with Sparse Rewards
Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards
Rituraj Kaushik
Konstantinos Chatzilygeroudis
Jean-Baptiste Mouret
72
19
0
25 Jun 2018
How Many Random Seeds? Statistical Power Analysis in Deep Reinforcement
  Learning Experiments
How Many Random Seeds? Statistical Power Analysis in Deep Reinforcement Learning Experiments
Cédric Colas
Olivier Sigaud
Pierre-Yves Oudeyer
63
93
0
21 Jun 2018
Deep Reinforcement Learning for Surgical Gesture Segmentation and
  Classification
Deep Reinforcement Learning for Surgical Gesture Segmentation and Classification
Daochang Liu
Tingting Jiang
91
63
0
21 Jun 2018
RUDDER: Return Decomposition for Delayed Rewards
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
134
222
0
20 Jun 2018
Maximum a Posteriori Policy Optimisation
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
102
480
0
14 Jun 2018
Qualitative Measurements of Policy Discrepancy for Return-Based Deep
  Q-Network
Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network
Wenjia Meng
Qian Zheng
L. Yang
Pengfei Li
Gang Pan
45
21
0
14 Jun 2018
Previous
123...333435...394041
Next