ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.05477
  4. Cited By
Trust Region Policy Optimization

Trust Region Policy Optimization

19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
ArXivPDFHTML

Papers citing "Trust Region Policy Optimization"

50 / 3,102 papers shown
Title
ISL: A novel approach for deep exploration
ISL: A novel approach for deep exploration
Lucas Cassano
Ali H. Sayed
25
1
0
13 Sep 2019
Safe Policy Improvement with an Estimated Baseline Policy
Safe Policy Improvement with an Estimated Baseline Policy
T. D. Simão
Romain Laroche
Rémi Tachet des Combes
OffRL
11
22
0
11 Sep 2019
Mutual-Information Regularization in Markov Decision Processes and
  Actor-Critic Learning
Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning
Felix Leibfried
Jordi Grau-Moya
21
22
0
11 Sep 2019
Meta-Learning with Implicit Gradients
Meta-Learning with Implicit Gradients
Aravind Rajeswaran
Chelsea Finn
Sham Kakade
Sergey Levine
51
844
0
10 Sep 2019
Discovery of Useful Questions as Auxiliary Tasks
Discovery of Useful Questions as Auxiliary Tasks
Vivek Veeriah
Matteo Hessel
Zhongwen Xu
Richard L. Lewis
Janarthanan Rajendran
Junhyuk Oh
H. V. Hasselt
David Silver
Satinder Singh
LLMAG
24
86
0
10 Sep 2019
Option Encoder: A Framework for Discovering a Policy Basis in
  Reinforcement Learning
Option Encoder: A Framework for Discovering a Policy Basis in Reinforcement Learning
Arjun Manoharan
Rahul Ramesh
Balaraman Ravindran
20
3
0
09 Sep 2019
Deterministic Value-Policy Gradients
Deterministic Value-Policy Gradients
Qingpeng Cai
L. Pan
Pingzhong Tang
31
1
0
09 Sep 2019
Imitation Learning from Pixel-Level Demonstrations by HashReward
Imitation Learning from Pixel-Level Demonstrations by HashReward
Xin-Qiang Cai
Yao-Xiang Ding
Yuan Jiang
Zhi Zhou
22
10
0
09 Sep 2019
A Survey on Reproducibility by Evaluating Deep Reinforcement Learning
  Algorithms on Real-World Robots
A Survey on Reproducibility by Evaluating Deep Reinforcement Learning Algorithms on Real-World Robots
Nicolai A. Lynnerup
Laura Nolling
Rasmus Hasle
J. Hallam
27
16
0
09 Sep 2019
Learning How to Dynamically Route Autonomous Vehicles on Shared Roads
Learning How to Dynamically Route Autonomous Vehicles on Shared Roads
Daniel A. Lazar
Erdem Biyik
Dorsa Sadigh
Ramtin Pedarsani
21
45
0
09 Sep 2019
Imitation Learning for Human Pose Prediction
Imitation Learning for Human Pose Prediction
Borui Wang
Ehsan Adeli
Hsu-kuang Chiu
De-An Huang
Juan Carlos Niebles
24
99
0
08 Sep 2019
Mature GAIL: Imitation Learning for Low-level and High-dimensional Input
  using Global Encoder and Cost Transformation
Mature GAIL: Imitation Learning for Low-level and High-dimensional Input using Global Encoder and Cost Transformation
Wonsup Shin
Hyolim Kang
Sunghoon Hong
13
0
0
07 Sep 2019
Soft Policy Gradient Method for Maximum Entropy Deep Reinforcement
  Learning
Soft Policy Gradient Method for Maximum Entropy Deep Reinforcement Learning
Wenjie Shi
Shiji Song
Cheng Wu
33
36
0
07 Sep 2019
Reinforcement Learning for Joint Optimization of Multiple Rewards
Reinforcement Learning for Joint Optimization of Multiple Rewards
Mridul Agarwal
Vaneet Aggarwal
10
16
0
06 Sep 2019
Adaptive Trust Region Policy Optimization: Global Convergence and Faster
  Rates for Regularized MDPs
Adaptive Trust Region Policy Optimization: Global Convergence and Faster Rates for Regularized MDPs
Lior Shani
Yonathan Efroni
Shie Mannor
15
172
0
06 Sep 2019
ACES -- Automatic Configuration of Energy Harvesting Sensors with
  Reinforcement Learning
ACES -- Automatic Configuration of Energy Harvesting Sensors with Reinforcement Learning
Francesco Fraternali
Bharathan Balaji
Yuvraj Agarwal
Rajesh K. Gupta
14
42
0
04 Sep 2019
Quasi-Newton Optimization Methods For Deep Learning Applications
Quasi-Newton Optimization Methods For Deep Learning Applications
J. Rafati
Roummel F. Marcia
ODL
6
13
0
04 Sep 2019
rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch
rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch
Adam Stooke
Pieter Abbeel
OffRL
24
96
0
03 Sep 2019
Generalization in Transfer Learning
Generalization in Transfer Learning
S. E. Ada
Emre Ugur
H. L. Akin
27
17
0
03 Sep 2019
Neural Policy Gradient Methods: Global Optimality and Rates of
  Convergence
Neural Policy Gradient Methods: Global Optimality and Rates of Convergence
Lingxiao Wang
Qi Cai
Zhuoran Yang
Zhaoran Wang
25
236
0
29 Aug 2019
Tutorial and Survey on Probabilistic Graphical Model and Variational
  Inference in Deep Reinforcement Learning
Tutorial and Survey on Probabilistic Graphical Model and Variational Inference in Deep Reinforcement Learning
Xudong Sun
B. Bischl
BDL
19
9
0
25 Aug 2019
A Comparison of Action Spaces for Learning Manipulation Tasks
A Comparison of Action Spaces for Learning Manipulation Tasks
Patrick Varin
Lev Grossman
S. Kuindersma
33
34
0
23 Aug 2019
Model-based Lookahead Reinforcement Learning
Model-based Lookahead Reinforcement Learning
Zhang-Wei Hong
Joni Pajarinen
Jan Peters
16
10
0
15 Aug 2019
Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real
Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real
Ofir Nachum
Michael Ahn
Hugo Ponte
S. Gu
Vikash Kumar
35
89
0
13 Aug 2019
Generative Question Refinement with Deep Reinforcement Learning in
  Retrieval-based QA System
Generative Question Refinement with Deep Reinforcement Learning in Retrieval-based QA System
Ye Liu
Chenwei Zhang
Xiaohui Yan
Yi-Ju Chang
Philip S. Yu
32
19
0
13 Aug 2019
A review on Deep Reinforcement Learning for Fluid Mechanics
A review on Deep Reinforcement Learning for Fluid Mechanics
Paul Garnier
J. Viquerat
Jean Rabault
A. Larcher
A. Kuhnle
E. Hachem
AI4CE
29
254
0
12 Aug 2019
Fast Adaptation with Meta-Reinforcement Learning for Trust Modelling in
  Human-Robot Interaction
Fast Adaptation with Meta-Reinforcement Learning for Trust Modelling in Human-Robot Interaction
Yuan Gao
E. Sibirtseva
Ginevra Castellano
Danica Kragic
23
20
0
12 Aug 2019
A Review of Cooperative Multi-Agent Deep Reinforcement Learning
A Review of Cooperative Multi-Agent Deep Reinforcement Learning
Afshin Oroojlooyjadid
Davood Hajinezhad
70
415
0
11 Aug 2019
Trajectory-wise Control Variates for Variance Reduction in Policy
  Gradient Methods
Trajectory-wise Control Variates for Variance Reduction in Policy Gradient Methods
Ching-An Cheng
Xinyan Yan
Byron Boots
32
22
0
08 Aug 2019
Learning to Grasp from 2.5D images: a Deep Reinforcement Learning
  Approach
Learning to Grasp from 2.5D images: a Deep Reinforcement Learning Approach
Alessia Bertugli
P. Galeone
16
1
0
08 Aug 2019
Attention Control with Metric Learning Alignment for Image Set-based
  Recognition
Attention Control with Metric Learning Alignment for Image Set-based Recognition
Xiaofeng Liu
A. Marques
J. You
G. Giannakis
CVBM
21
10
0
05 Aug 2019
Adaptive Stress Testing with Reward Augmentation for Autonomous Vehicle
  Validation
Adaptive Stress Testing with Reward Augmentation for Autonomous Vehicle Validation
Anthony Corso
Peter Du
Katherine Driggs-Campbell
Mykel J. Kochenderfer
9
97
0
02 Aug 2019
Health-Informed Policy Gradients for Multi-Agent Reinforcement Learning
Health-Informed Policy Gradients for Multi-Agent Reinforcement Learning
R. Allen
Jayesh K. Gupta
Jaime Pena
Yutai Zhou
Javona White Bear
Mykel J. Kochenderfer
17
7
0
02 Aug 2019
Neural Simplex Architecture
Neural Simplex Architecture
Dung Phan
Radu Grosu
N. Jansen
Nicola Paoletti
S. Smolka
Scott D. Stoller
30
61
0
01 Aug 2019
On the Theory of Policy Gradient Methods: Optimality, Approximation, and
  Distribution Shift
On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift
Alekh Agarwal
Sham Kakade
Jason D. Lee
G. Mahajan
13
316
0
01 Aug 2019
Optimal Attacks on Reinforcement Learning Policies
Optimal Attacks on Reinforcement Learning Policies
Alessio Russo
Alexandre Proutiere
AAML
27
41
0
31 Jul 2019
Wasserstein Robust Reinforcement Learning
Wasserstein Robust Reinforcement Learning
Mohammed Abdullah
Hang Ren
Haitham Bou-Ammar
Vladimir Milenkovic
Rui Luo
Mingtian Zhang
Jun Wang
32
75
0
30 Jul 2019
Hindsight Trust Region Policy Optimization
Hindsight Trust Region Policy Optimization
Hanbo Zhang
Site Bai
Xuguang Lan
David Hsu
Nanning Zheng
38
8
0
29 Jul 2019
Making Sense of Vision and Touch: Learning Multimodal Representations
  for Contact-Rich Tasks
Making Sense of Vision and Touch: Learning Multimodal Representations for Contact-Rich Tasks
Michelle A. Lee
Yuke Zhu
Peter Zachares
Matthew Tan
K. Srinivasan
Silvio Savarese
Fei-Fei Li
Animesh Garg
Jeannette Bohg
SSL
28
208
0
28 Jul 2019
Self-Imitation Learning of Locomotion Movements through Termination
  Curriculum
Self-Imitation Learning of Locomotion Movements through Termination Curriculum
Amin Babadi
Kourosh Naderi
Perttu Hämäläinen
20
7
0
27 Jul 2019
Deep Reinforcement Learning for Personalized Search Story Recommendation
Deep Reinforcement Learning for Personalized Search Story Recommendation
Jason Zhang
Zhang
Junming Yin
Dongwon Lee
Linhong Zhu
33
2
0
26 Jul 2019
Environment Probing Interaction Policies
Environment Probing Interaction Policies
Wenxuan Zhou
Lerrel Pinto
Abhinav Gupta
33
68
0
26 Jul 2019
A Unified Bellman Optimality Principle Combining Reward Maximization and
  Empowerment
A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment
Felix Leibfried
Sergio Pascual-Diaz
Jordi Grau-Moya
25
27
0
26 Jul 2019
An Information-theoretic On-line Learning Principle for Specialization
  in Hierarchical Decision-Making Systems
An Information-theoretic On-line Learning Principle for Specialization in Hierarchical Decision-Making Systems
Heinke Hihn
Sebastian Gottwald
Daniel A. Braun
30
16
0
26 Jul 2019
Differentiable Gaussian Process Motion Planning
Differentiable Gaussian Process Motion Planning
M. Bhardwaj
Byron Boots
Mustafa Mukadam
15
63
0
22 Jul 2019
Deep Reinforcement Learning for Autonomous Internet of Things: Model,
  Applications and Challenges
Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges
Lei Lei
Yue Tan
Kan Zheng
Shiwen Liu
K. Zheng
Xuemin Shen
Shen
OffRL
40
202
0
22 Jul 2019
Prioritized Guidance for Efficient Multi-Agent Reinforcement Learning Exploration
Qisheng Wang
Qichao Wang
23
1
0
18 Jul 2019
Efficient Autonomy Validation in Simulation with Adaptive Stress Testing
Efficient Autonomy Validation in Simulation with Adaptive Stress Testing
Mark Koren
Mykel Kochenderfer
12
47
0
16 Jul 2019
Proximal Policy Optimization with Mixed Distributed Training
Proximal Policy Optimization with Mixed Distributed Training
Zhenyu Zhang
Xiangfeng Luo
Tong Liu
Shaorong Xie
Jianshu Wang
Wei Wang
Yongbin Li
Yan Peng
OffRL
30
21
0
15 Jul 2019
A Convergence Result for Regularized Actor-Critic Methods
A Convergence Result for Regularized Actor-Critic Methods
Wesley A Suttle
Zhuoran Yang
Kai Zhang
Ji Liu
14
0
0
13 Jul 2019
Previous
123...464748...616263
Next