ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.05477
  4. Cited By
Trust Region Policy Optimization
v1v2v3v4v5 (latest)

Trust Region Policy Optimization

19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
ArXiv (abs)PDFHTML

Papers citing "Trust Region Policy Optimization"

50 / 2,023 papers shown
Title
Reinforcement Learning for Joint Optimization of Multiple Rewards
Reinforcement Learning for Joint Optimization of Multiple Rewards
Mridul Agarwal
Vaneet Aggarwal
87
16
0
06 Sep 2019
Adaptive Trust Region Policy Optimization: Global Convergence and Faster
  Rates for Regularized MDPs
Adaptive Trust Region Policy Optimization: Global Convergence and Faster Rates for Regularized MDPs
Lior Shani
Yonathan Efroni
Shie Mannor
94
176
0
06 Sep 2019
ACES -- Automatic Configuration of Energy Harvesting Sensors with
  Reinforcement Learning
ACES -- Automatic Configuration of Energy Harvesting Sensors with Reinforcement Learning
Francesco Fraternali
Bharathan Balaji
Yuvraj Agarwal
Rajesh K. Gupta
21
44
0
04 Sep 2019
rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch
rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch
Adam Stooke
Pieter Abbeel
OffRL
94
98
0
03 Sep 2019
Generalization in Transfer Learning
Generalization in Transfer Learning
S. E. Ada
Emre Ugur
H. L. Akin
76
18
0
03 Sep 2019
Neural Policy Gradient Methods: Global Optimality and Rates of
  Convergence
Neural Policy Gradient Methods: Global Optimality and Rates of Convergence
Lingxiao Wang
Qi Cai
Zhuoran Yang
Zhaoran Wang
115
242
0
29 Aug 2019
Tutorial and Survey on Probabilistic Graphical Model and Variational
  Inference in Deep Reinforcement Learning
Tutorial and Survey on Probabilistic Graphical Model and Variational Inference in Deep Reinforcement Learning
Xudong Sun
B. Bischl
BDL
75
9
0
25 Aug 2019
A Comparison of Action Spaces for Learning Manipulation Tasks
A Comparison of Action Spaces for Learning Manipulation Tasks
Patrick Varin
Lev Grossman
S. Kuindersma
67
34
0
23 Aug 2019
Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real
Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real
Ofir Nachum
Michael Ahn
Hugo Ponte
S. Gu
Vikash Kumar
68
91
0
13 Aug 2019
Generative Question Refinement with Deep Reinforcement Learning in
  Retrieval-based QA System
Generative Question Refinement with Deep Reinforcement Learning in Retrieval-based QA System
Ye Liu
Chenwei Zhang
Xiaohui Yan
Yi-Ju Chang
Philip S. Yu
63
20
0
13 Aug 2019
Fast Adaptation with Meta-Reinforcement Learning for Trust Modelling in
  Human-Robot Interaction
Fast Adaptation with Meta-Reinforcement Learning for Trust Modelling in Human-Robot Interaction
Yuan Gao
E. Sibirtseva
Ginevra Castellano
Danica Kragic
74
21
0
12 Aug 2019
A Review of Cooperative Multi-Agent Deep Reinforcement Learning
A Review of Cooperative Multi-Agent Deep Reinforcement Learning
Afshin Oroojlooyjadid
Davood Hajinezhad
126
439
0
11 Aug 2019
Trajectory-wise Control Variates for Variance Reduction in Policy
  Gradient Methods
Trajectory-wise Control Variates for Variance Reduction in Policy Gradient Methods
Ching-An Cheng
Xinyan Yan
Byron Boots
73
22
0
08 Aug 2019
Attention Control with Metric Learning Alignment for Image Set-based
  Recognition
Attention Control with Metric Learning Alignment for Image Set-based Recognition
Xiaofeng Liu
A. Marques
J. You
G. Giannakis
CVBM
83
10
0
05 Aug 2019
Adaptive Stress Testing with Reward Augmentation for Autonomous Vehicle
  Validation
Adaptive Stress Testing with Reward Augmentation for Autonomous Vehicle Validation
Anthony Corso
Peter Du
Katherine Driggs-Campbell
Mykel J. Kochenderfer
59
99
0
02 Aug 2019
Health-Informed Policy Gradients for Multi-Agent Reinforcement Learning
Health-Informed Policy Gradients for Multi-Agent Reinforcement Learning
R. Allen
Jayesh K. Gupta
Jaime Pena
Yutai Zhou
Javona White Bear
Mykel J. Kochenderfer
58
7
0
02 Aug 2019
Neural Simplex Architecture
Neural Simplex Architecture
Dung Phan
Radu Grosu
N. Jansen
Nicola Paoletti
S. Smolka
Scott D. Stoller
87
62
0
01 Aug 2019
On the Theory of Policy Gradient Methods: Optimality, Approximation, and
  Distribution Shift
On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift
Alekh Agarwal
Sham Kakade
Jason D. Lee
G. Mahajan
142
321
0
01 Aug 2019
Optimal Attacks on Reinforcement Learning Policies
Optimal Attacks on Reinforcement Learning Policies
Alessio Russo
Alexandre Proutiere
AAML
65
42
0
31 Jul 2019
Wasserstein Robust Reinforcement Learning
Wasserstein Robust Reinforcement Learning
Mohammed Abdullah
Hang Ren
Haitham Bou-Ammar
Vladimir Milenkovic
Rui Luo
Mingtian Zhang
Jun Wang
164
76
0
30 Jul 2019
Hindsight Trust Region Policy Optimization
Hindsight Trust Region Policy Optimization
Hanbo Zhang
Site Bai
Xuguang Lan
David Hsu
Nanning Zheng
68
8
0
29 Jul 2019
Making Sense of Vision and Touch: Learning Multimodal Representations
  for Contact-Rich Tasks
Making Sense of Vision and Touch: Learning Multimodal Representations for Contact-Rich Tasks
Michelle A. Lee
Yuke Zhu
Peter Zachares
Matthew Tan
K. Srinivasan
Silvio Savarese
Fei-Fei Li
Animesh Garg
Jeannette Bohg
SSL
97
213
0
28 Jul 2019
Self-Imitation Learning of Locomotion Movements through Termination
  Curriculum
Self-Imitation Learning of Locomotion Movements through Termination Curriculum
Amin Babadi
Kourosh Naderi
Perttu Hämäläinen
60
7
0
27 Jul 2019
Deep Reinforcement Learning for Personalized Search Story Recommendation
Deep Reinforcement Learning for Personalized Search Story Recommendation
Jason Zhang
Zhang
Junming Yin
Dongwon Lee
Linhong Zhu
55
2
0
26 Jul 2019
Environment Probing Interaction Policies
Environment Probing Interaction Policies
Wenxuan Zhou
Lerrel Pinto
Abhinav Gupta
61
67
0
26 Jul 2019
A Unified Bellman Optimality Principle Combining Reward Maximization and
  Empowerment
A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment
Felix Leibfried
Sergio Pascual-Diaz
Jordi Grau-Moya
125
29
0
26 Jul 2019
An Information-theoretic On-line Learning Principle for Specialization
  in Hierarchical Decision-Making Systems
An Information-theoretic On-line Learning Principle for Specialization in Hierarchical Decision-Making Systems
Heinke Hihn
Sebastian Gottwald
Daniel A. Braun
99
16
0
26 Jul 2019
Differentiable Gaussian Process Motion Planning
Differentiable Gaussian Process Motion Planning
M. Bhardwaj
Byron Boots
Mustafa Mukadam
77
63
0
22 Jul 2019
Deep Reinforcement Learning for Autonomous Internet of Things: Model,
  Applications and Challenges
Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges
Lei Lei
Yue Tan
Kan Zheng
Shiwen Liu
K. Zheng
Xuemin Shen
Shen
OffRL
89
205
0
22 Jul 2019
Prioritized Guidance for Efficient Multi-Agent Reinforcement Learning Exploration
Qisheng Wang
Qichao Wang
52
1
0
18 Jul 2019
Efficient Autonomy Validation in Simulation with Adaptive Stress Testing
Efficient Autonomy Validation in Simulation with Adaptive Stress Testing
Mark Koren
Mykel Kochenderfer
41
47
0
16 Jul 2019
Seeker based Adaptive Guidance via Reinforcement Meta-Learning Applied
  to Asteroid Close Proximity Operations
Seeker based Adaptive Guidance via Reinforcement Meta-Learning Applied to Asteroid Close Proximity Operations
B. Gaudet
R. Linares
R. Furfaro
41
33
0
13 Jul 2019
Environment Reconstruction with Hidden Confounders for Reinforcement
  Learning based Recommendation
Environment Reconstruction with Hidden Confounders for Reinforcement Learning based Recommendation
Wenjie Shang
Yang Yu
Qingyang Li
Zhiwei Qin
Yiping Meng
Jieping Ye
CML
67
51
0
12 Jul 2019
Imitation-Projected Programmatic Reinforcement Learning
Imitation-Projected Programmatic Reinforcement Learning
A. Verma
Hoang Minh Le
Yisong Yue
Swarat Chaudhuri
44
2
0
11 Jul 2019
Provably Efficient Reinforcement Learning with Linear Function
  Approximation
Provably Efficient Reinforcement Learning with Linear Function Approximation
Chi Jin
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
147
561
0
11 Jul 2019
Safe Policy Improvement with Soft Baseline Bootstrapping
Safe Policy Improvement with Soft Baseline Bootstrapping
Kimia Nadjahi
Romain Laroche
Rémi Tachet des Combes
OffRL
70
36
0
11 Jul 2019
A Model-based Approach for Sample-efficient Multi-task Reinforcement
  Learning
A Model-based Approach for Sample-efficient Multi-task Reinforcement Learning
Nicholas C. Landolfi
G. Thomas
Tengyu Ma
OffRL
64
19
0
11 Jul 2019
Trust-Region Variational Inference with Gaussian Mixture Models
Trust-Region Variational Inference with Gaussian Mixture Models
Oleg Arenz
Mingjun Zhong
Gerhard Neumann
87
20
0
10 Jul 2019
An Optimistic Perspective on Offline Reinforcement Learning
An Optimistic Perspective on Offline Reinforcement Learning
Rishabh Agarwal
Dale Schuurmans
Mohammad Norouzi
OffRLOnRL
126
70
0
10 Jul 2019
Deep Lagrangian Networks for end-to-end learning of energy-based control
  for under-actuated systems
Deep Lagrangian Networks for end-to-end learning of energy-based control for under-actuated systems
M. Lutter
Kim D. Listmann
Jan Peters
PINN
84
75
0
10 Jul 2019
On-Policy Robot Imitation Learning from a Converging Supervisor
On-Policy Robot Imitation Learning from a Converging Supervisor
Ashwin Balakrishna
Brijen Thananjeyan
Jonathan Lee
Felix Li
Arsh Zahed
Joseph E. Gonzalez
Ken Goldberg
141
17
0
08 Jul 2019
Deep Learning based Wireless Resource Allocation with Application to
  Vehicular Networks
Deep Learning based Wireless Resource Allocation with Application to Vehicular Networks
Le Liang
Hao Ye
Guanding Yu
Geoffrey Ye Li
78
200
0
07 Jul 2019
A Review of Robot Learning for Manipulation: Challenges,
  Representations, and Algorithms
A Review of Robot Learning for Manipulation: Challenges, Representations, and Algorithms
Oliver Kroemer
S. Niekum
George Konidaris
153
369
0
06 Jul 2019
Entropic Regularization of Markov Decision Processes
Entropic Regularization of Markov Decision Processes
Boris Belousov
Jan Peters
73
24
0
06 Jul 2019
Intrinsic Motivation Driven Intuitive Physics Learning using Deep
  Reinforcement Learning with Intrinsic Reward Normalization
Intrinsic Motivation Driven Intuitive Physics Learning using Deep Reinforcement Learning with Intrinsic Reward Normalization
Jae-Woo Choi
Sung-eui Yoon
AI4CEPINN
67
3
0
06 Jul 2019
Dependency-aware Attention Control for Unconstrained Face Recognition
  with Image Sets
Dependency-aware Attention Control for Unconstrained Face Recognition with Image Sets
Xiaofeng Liu
B. Kumar
Chao Yang
Qingming Tang
J. You
CVBM
95
42
0
05 Jul 2019
Self-supervised Learning of Distance Functions for Goal-Conditioned
  Reinforcement Learning
Self-supervised Learning of Distance Functions for Goal-Conditioned Reinforcement Learning
Srinivas Venkattaramanujam
Eric Crawford
T. Doan
Doina Precup
OffRLSSL
74
24
0
05 Jul 2019
Integration of Imitation Learning using GAIL and Reinforcement Learning
  using Task-achievement Rewards via Probabilistic Graphical Model
Integration of Imitation Learning using GAIL and Reinforcement Learning using Task-achievement Rewards via Probabilistic Graphical Model
Akira Kinose
T. Taniguchi
107
20
0
03 Jul 2019
Benchmarking Model-Based Reinforcement Learning
Benchmarking Model-Based Reinforcement Learning
Tingwu Wang
Xuchan Bao
I. Clavera
Jerrick Hoang
Yeming Wen
Eric D. Langlois
Matthew Shunshi Zhang
Guodong Zhang
Pieter Abbeel
Jimmy Ba
OffRL
122
365
0
03 Jul 2019
Co-training for Policy Learning
Co-training for Policy Learning
Jialin Song
Ravi Lanka
Yisong Yue
M. Ono
OffRL
66
20
0
03 Jul 2019
Previous
123...272829...394041
Next