ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.05477
  4. Cited By
Trust Region Policy Optimization

Trust Region Policy Optimization

19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
ArXivPDFHTML

Papers citing "Trust Region Policy Optimization"

50 / 3,098 papers shown
Title
Asymptotic optimality of adaptive importance sampling
Asymptotic optimality of adaptive importance sampling
B. Delyon
François Portier
18
29
0
04 Jun 2018
Internal Model from Observations for Reward Shaping
Internal Model from Observations for Reward Shaping
Daiki Kimura
Subhajit Chaudhury
Ryuki Tachibana
Sakyasingha Dasgupta
22
22
0
02 Jun 2018
Neural Control Variates for Variance Reduction
Neural Control Variates for Variance Reduction
Ruosi Wan
Mingjun Zhong
Haoyi Xiong
Zhanxing Zhu
BDL
DRL
27
18
0
01 Jun 2018
Supervised Policy Update for Deep Reinforcement Learning
Supervised Policy Update for Deep Reinforcement Learning
Q. Vuong
Yiming Zhang
Keith Ross
19
20
0
29 May 2018
Variational Inverse Control with Events: A General Framework for
  Data-Driven Reward Definition
Variational Inverse Control with Events: A General Framework for Data-Driven Reward Definition
Justin Fu
Avi Singh
Dibya Ghosh
Larry Yang
Sergey Levine
BDL
14
125
0
29 May 2018
Truncated Horizon Policy Search: Combining Reinforcement Learning &
  Imitation Learning
Truncated Horizon Policy Search: Combining Reinforcement Learning & Imitation Learning
Wen Sun
J. Andrew Bagnell
Byron Boots
28
93
0
29 May 2018
Reward Constrained Policy Optimization
Reward Constrained Policy Optimization
Chen Tessler
D. Mankowitz
Shie Mannor
25
536
0
28 May 2018
Dual Policy Iteration
Dual Policy Iteration
Wen Sun
Geoffrey J. Gordon
Byron Boots
J. Andrew Bagnell
OffRL
23
56
0
28 May 2018
Fingerprint Policy Optimisation for Robust Reinforcement Learning
Fingerprint Policy Optimisation for Robust Reinforcement Learning
Supratik Paul
Michael A. Osborne
Shimon Whiteson
32
18
0
27 May 2018
Reliability and Learnability of Human Bandit Feedback for
  Sequence-to-Sequence Reinforcement Learning
Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning
Julia Kreutzer
Joshua Uyheng
Stefan Riezler
30
85
0
27 May 2018
Fast Policy Learning through Imitation and Reinforcement
Fast Policy Learning through Imitation and Reinforcement
Ching-An Cheng
Xinyan Yan
Nolan Wagener
Byron Boots
26
83
0
26 May 2018
Learning Self-Imitating Diverse Policies
Learning Self-Imitating Diverse Policies
Tanmay Gangwani
Qiang Liu
Jian Peng
29
65
0
25 May 2018
Object-Oriented Dynamics Predictor
Object-Oriented Dynamics Predictor
Guangxiang Zhu
Zhiao Huang
Chongjie Zhang
AI4CE
24
36
0
25 May 2018
Virtual-Taobao: Virtualizing Real-world Online Retail Environment for
  Reinforcement Learning
Virtual-Taobao: Virtualizing Real-world Online Retail Environment for Reinforcement Learning
Jing-Cheng Shi
Yang Yu
Qing Da
Shi-Yong Chen
Anxiang Zeng
OffRL
36
185
0
25 May 2018
Intelligent Trainer for Model-Based Reinforcement Learning
Intelligent Trainer for Model-Based Reinforcement Learning
Yuanlong Li
Linsen Dong
Xin Zhou
Yonggang Wen
K. Guan
OffRL
24
0
0
24 May 2018
Maximum Causal Tsallis Entropy Imitation Learning
Maximum Causal Tsallis Entropy Imitation Learning
Kyungjae Lee
Sungjoon Choi
Songhwai Oh
OOD
29
20
0
22 May 2018
Scalable Centralized Deep Multi-Agent Reinforcement Learning via Policy
  Gradients
Scalable Centralized Deep Multi-Agent Reinforcement Learning via Policy Gradients
Arbaaz Khan
Clark Zhang
Daniel D. Lee
Vijay Kumar
Alejandro Ribeiro
22
30
0
22 May 2018
Data-Efficient Hierarchical Reinforcement Learning
Data-Efficient Hierarchical Reinforcement Learning
Ofir Nachum
S. Gu
Honglak Lee
Sergey Levine
OffRL
68
797
0
21 May 2018
Multiple-Step Greedy Policies in Online and Approximate Reinforcement
  Learning
Multiple-Step Greedy Policies in Online and Approximate Reinforcement Learning
Yonathan Efroni
Gal Dalal
B. Scherrer
Shie Mannor
OffRL
31
14
0
21 May 2018
Evolution-Guided Policy Gradient in Reinforcement Learning
Evolution-Guided Policy Gradient in Reinforcement Learning
Shauharda Khadka
Kagan Tumer
19
224
0
21 May 2018
Constrained Policy Improvement for Safe and Efficient Reinforcement
  Learning
Constrained Policy Improvement for Safe and Efficient Reinforcement Learning
Elad Sarafian
Aviv Tamar
Sarit Kraus
OffRL
32
11
0
20 May 2018
Unsupervised Video Object Segmentation for Deep Reinforcement Learning
Unsupervised Video Object Segmentation for Deep Reinforcement Learning
Vikrant Goel
James Weng
Pascal Poupart
OCL
14
66
0
20 May 2018
Task-Agnostic Meta-Learning for Few-shot Learning
Task-Agnostic Meta-Learning for Few-shot Learning
Muhammad Abdullah Jamal
Guo-Jun Qi
M. Shah
52
456
0
20 May 2018
Reinforced Imitation: Sample Efficient Deep Reinforcement Learning for
  Map-less Navigation by Leveraging Prior Demonstrations
Reinforced Imitation: Sample Efficient Deep Reinforcement Learning for Map-less Navigation by Leveraging Prior Demonstrations
Mark Pfeiffer
Samarth Shukla
M. Turchetta
Cesar Cadena
Andreas Krause
Roland Siegwart
Juan I. Nieto
27
157
0
18 May 2018
GAN Q-learning
GAN Q-learning
T. Doan
Bogdan Mazoure
Clare Lyle
OOD
OffRL
11
19
0
13 May 2018
Deep Hierarchical Reinforcement Learning Algorithm in Partially
  Observable Markov Decision Processes
Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes
T. P. Le
Ngo Anh Vien
Abu Layek
TaeChoong Chung
25
51
0
11 May 2018
Behavioral Cloning from Observation
Behavioral Cloning from Observation
F. Torabi
Garrett A. Warnell
Peter Stone
OffRL
58
710
0
04 May 2018
Exploration by Distributional Reinforcement Learning
Exploration by Distributional Reinforcement Learning
Yunhao Tang
Shipra Agrawal
OOD
41
30
0
04 May 2018
Reinforcement Learning and Control as Probabilistic Inference: Tutorial
  and Review
Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review
Sergey Levine
AI4CE
BDL
33
662
0
02 May 2018
Reward Learning from Narrated Demonstrations
Reward Learning from Narrated Demonstrations
H. Tung
Adam W. Harley
Liang-Kang Huang
Katerina Fragkiadaki
LM&Ro
SSL
39
28
0
27 Apr 2018
Decoupling Dynamics and Reward for Transfer Learning
Decoupling Dynamics and Reward for Transfer Learning
Amy Zhang
Harsh Satija
Joelle Pineau
OOD
11
72
0
27 Apr 2018
Deep Reinforcement Learning to Acquire Navigation Skills for
  Wheel-Legged Robots in Complex Environments
Deep Reinforcement Learning to Acquire Navigation Skills for Wheel-Legged Robots in Complex Environments
Xi Chen
Ali Ghadirzadeh
John Folkesson
Patric Jensfelt
22
44
0
27 Apr 2018
Scalable Bilinear $π$ Learning Using State and Action Features
Scalable Bilinear πππ Learning Using State and Action Features
Yichen Chen
Lihong Li
Mengdi Wang
12
46
0
27 Apr 2018
Distributed Distributional Deterministic Policy Gradients
Distributed Distributional Deterministic Policy Gradients
Gabriel Barth-Maron
Matthew W. Hoffman
David Budden
Will Dabney
Dan Horgan
TB Dhruva
Alistair Muldal
N. Heess
Timothy Lillicrap
OffRL
63
477
0
23 Apr 2018
Taskonomy: Disentangling Task Transfer Learning
Taskonomy: Disentangling Task Transfer Learning
Amir Zamir
Alexander Sax
Bokui (William) Shen
Leonidas J. Guibas
Jitendra Malik
Silvio Savarese
36
1,206
0
23 Apr 2018
A Reinforcement Learning Based Approach for Automated Lane Change
  Maneuvers
A Reinforcement Learning Based Approach for Automated Lane Change Maneuvers
Pin Wang
Ching-yao Chan
A. de La Fortelle
17
252
0
21 Apr 2018
The Limits and Potentials of Deep Learning for Robotics
The Limits and Potentials of Deep Learning for Robotics
Niko Sünderhauf
Oliver Brock
Walter J. Scheirer
R. Hadsell
Dieter Fox
...
B. Upcroft
Pieter Abbeel
Wolfram Burgard
Michael Milford
Peter Corke
17
522
0
18 Apr 2018
An Adaptive Clipping Approach for Proximal Policy Optimization
An Adaptive Clipping Approach for Proximal Policy Optimization
Gang Chen
Yiming Peng
Mengjie Zhang
22
22
0
17 Apr 2018
An information-theoretic on-line update principle for perception-action
  coupling
An information-theoretic on-line update principle for perception-action coupling
Zhen Peng
Tim Genewein
Felix Leibfried
Daniel A. Braun
14
13
0
16 Apr 2018
Intrinsically motivated reinforcement learning for human-robot
  interaction in the real-world
Intrinsically motivated reinforcement learning for human-robot interaction in the real-world
A. H. Qureshi
Yutaka Nakamura
Y. Yoshikawa
H. Ishiguro
14
58
0
14 Apr 2018
Reinforcement Learning for UAV Attitude Control
Reinforcement Learning for UAV Attitude Control
W. Koch
R. Mancuso
R. West
Azer Bestavros
29
381
0
11 Apr 2018
Derivative free optimization via repeated classification
Derivative free optimization via repeated classification
Tatsunori B. Hashimoto
Steve Yadlowsky
John C. Duchi
11
18
0
11 Apr 2018
Policy Gradient With Value Function Approximation For Collective
  Multiagent Planning
Policy Gradient With Value Function Approximation For Collective Multiagent Planning
D. Nguyen
Akshat Kumar
H. Lau
25
43
0
09 Apr 2018
Latent Space Policies for Hierarchical Reinforcement Learning
Latent Space Policies for Hierarchical Reinforcement Learning
Tuomas Haarnoja
Kristian Hartikainen
Pieter Abbeel
Sergey Levine
BDL
27
189
0
09 Apr 2018
DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based
  Character Skills
DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
Xue Bin Peng
Pieter Abbeel
Sergey Levine
M. van de Panne
AI4CE
177
495
0
08 Apr 2018
Structured Evolution with Compact Architectures for Scalable Policy
  Optimization
Structured Evolution with Compact Architectures for Scalable Policy Optimization
K. Choromanski
Mark Rowland
Vikas Sindhwani
Richard Turner
Adrian Weller
27
147
0
06 Apr 2018
Renewal Monte Carlo: Renewal theory based reinforcement learning
Renewal Monte Carlo: Renewal theory based reinforcement learning
Jayakumar Subramanian
Aditya Mahajan
10
11
0
03 Apr 2018
StarCraft Micromanagement with Reinforcement Learning and Curriculum
  Transfer Learning
StarCraft Micromanagement with Reinforcement Learning and Curriculum Transfer Learning
Kun Shao
Yuanheng Zhu
Dongbin Zhao
107
170
0
03 Apr 2018
Recall Traces: Backtracking Models for Efficient Reinforcement Learning
Recall Traces: Backtracking Models for Efficient Reinforcement Learning
Anirudh Goyal
Philemon Brakel
W. Fedus
Soumye Singhal
Timothy Lillicrap
Sergey Levine
Hugo Larochelle
Yoshua Bengio
OffRL
23
68
0
02 Apr 2018
Learning to Run challenge: Synthesizing physiologically accurate motion
  using deep reinforcement learning
Learning to Run challenge: Synthesizing physiologically accurate motion using deep reinforcement learning
L. Kidzinski
Sharada Mohanty
Carmichael F. Ong
Jennifer Hicks
Sean F. Carroll
Sergey Levine
M. Salathé
Scott L. Delp
34
60
0
31 Mar 2018
Previous
123...555657...606162
Next