ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.05477
  4. Cited By
Trust Region Policy Optimization
v1v2v3v4v5 (latest)

Trust Region Policy Optimization

19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
ArXiv (abs)PDFHTML

Papers citing "Trust Region Policy Optimization"

50 / 2,009 papers shown
Title
Reinforcement Learning with Augmented Data
Reinforcement Learning with Augmented Data
Michael Laskin
Kimin Lee
Adam Stooke
Lerrel Pinto
Pieter Abbeel
A. Srinivas
OffRL
168
661
0
30 Apr 2020
DSAC: Distributional Soft Actor-Critic for Risk-Sensitive Reinforcement Learning
DSAC: Distributional Soft Actor-Critic for Risk-Sensitive Reinforcement Learning
Xiaoteng Ma
Junyao Chen
Li Xia
Jun Yang
Qianchuan Zhao
Zhengyuan Zhou
94
17
0
30 Apr 2020
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator
  Policy Optimization
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization
P. DÓro
Wojciech Ja'skowski
OffRL
102
27
0
29 Apr 2020
Transferable Active Grasping and Real Embodied Dataset
Transferable Active Grasping and Real Embodied Dataset
Xiangyu Chen
Zelin Ye
Jiankai Sun
Yuda Fan
Fangwei Hu
Chenxi Wang
Cewu Lu
54
19
0
28 Apr 2020
The Ingredients of Real-World Robotic Reinforcement Learning
The Ingredients of Real-World Robotic Reinforcement Learning
Henry Zhu
Justin Yu
Abhishek Gupta
Dhruv Shah
Kristian Hartikainen
Avi Singh
Vikash Kumar
Sergey Levine
OffRL
161
181
0
27 Apr 2020
CFR-RL: Traffic Engineering with Reinforcement Learning in SDN
CFR-RL: Traffic Engineering with Reinforcement Learning in SDN
Member Ieee Junjie Zhang
Minghao Ye
Senior Member Ieee Zehua Guo
Chen-Yu Yen
F. I. H. Jonathan Chao
37
140
0
24 Apr 2020
Self-Paced Deep Reinforcement Learning
Self-Paced Deep Reinforcement Learning
Pascal Klink
Carlo DÉramo
Jan Peters
Joni Pajarinen
ODL
100
54
0
24 Apr 2020
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
Shangtong Zhang
Bo Liu
Shimon Whiteson
110
38
0
22 Apr 2020
Policy Gradient from Demonstration and Curiosity
Policy Gradient from Demonstration and Curiosity
Jie Chen
Wenjun Xu
129
12
0
22 Apr 2020
Sequential Anomaly Detection using Inverse Reinforcement Learning
Sequential Anomaly Detection using Inverse Reinforcement Learning
Min Hwan Oh
G. Iyengar
OffRLAI4TS
70
81
0
22 Apr 2020
Almost Optimal Model-Free Reinforcement Learning via Reference-Advantage
  Decomposition
Almost Optimal Model-Free Reinforcement Learning via Reference-Advantage Decomposition
Zihan Zhang
Yuanshuo Zhou
Xiangyang Ji
OffRL
84
158
0
21 Apr 2020
SIBRE: Self Improvement Based REwards for Adaptive Feedback in
  Reinforcement Learning
SIBRE: Self Improvement Based REwards for Adaptive Feedback in Reinforcement Learning
Somjit Nath
Richa Verma
Abhik Ray
H. Khadilkar
23
0
0
21 Apr 2020
Energy-Based Imitation Learning
Energy-Based Imitation Learning
Minghuan Liu
Tairan He
Minkai Xu
Weinan Zhang
118
48
0
20 Apr 2020
Dark, Beyond Deep: A Paradigm Shift to Cognitive AI with Humanlike
  Common Sense
Dark, Beyond Deep: A Paradigm Shift to Cognitive AI with Humanlike Common Sense
Yixin Zhu
Tao Gao
Lifeng Fan
Siyuan Huang
Mark Edmonds
...
Fangqiu Yi
Siyuan Qi
Ying Nian Wu
J. Tenenbaum
Song-Chun Zhu
115
130
0
20 Apr 2020
Modeling Survival in model-based Reinforcement Learning
Modeling Survival in model-based Reinforcement Learning
Saeed Moazami
P. Doerschuk
OffRL
29
1
0
18 Apr 2020
F2A2: Flexible Fully-decentralized Approximate Actor-critic for
  Cooperative Multi-agent Reinforcement Learning
F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning
Wenhao Li
Bo Jin
Xiangfeng Wang
Junchi Yan
H. Zha
113
21
0
17 Apr 2020
Knowledge-guided Deep Reinforcement Learning for Interactive
  Recommendation
Knowledge-guided Deep Reinforcement Learning for Interactive Recommendation
Xiaocong Chen
Chaoran Huang
Lina Yao
Xianzhi Wang
Wei Liu
Wenjie Zhang
131
36
0
17 Apr 2020
A Game Theoretic Framework for Model Based Reinforcement Learning
A Game Theoretic Framework for Model Based Reinforcement Learning
Aravind Rajeswaran
Igor Mordatch
Vikash Kumar
OffRL
67
128
0
16 Apr 2020
TextGAIL: Generative Adversarial Imitation Learning for Text Generation
TextGAIL: Generative Adversarial Imitation Learning for Text Generation
Qingyang Wu
Lei Li
Zhou Yu
GAN
96
50
0
07 Apr 2020
Intrinsic Exploration as Multi-Objective RL
Intrinsic Exploration as Multi-Objective RL
Philippe Morere
F. Ramos
22
1
0
06 Apr 2020
Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations
Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations
Zhuangdi Zhu
Kaixiang Lin
Bo Dai
Jiayu Zhou
OffRL
54
14
0
01 Apr 2020
Leverage the Average: an Analysis of KL Regularization in RL
Leverage the Average: an Analysis of KL Regularization in RL
Nino Vieillard
Tadashi Kozuno
B. Scherrer
Olivier Pietquin
Rémi Munos
Matthieu Geist
122
43
0
31 Mar 2020
When Autonomous Systems Meet Accuracy and Transferability through AI: A
  Survey
When Autonomous Systems Meet Accuracy and Transferability through AI: A Survey
Chongzhen Zhang
Jianrui Wang
Gary G. Yen
Chaoqiang Zhao
Qiyu Sun
Yang Tang
Feng Qian
Jürgen Kurths
AAML
95
20
0
29 Mar 2020
Policy Teaching via Environment Poisoning: Training-time Adversarial
  Attacks against Reinforcement Learning
Policy Teaching via Environment Poisoning: Training-time Adversarial Attacks against Reinforcement Learning
Amin Rakhsha
Goran Radanović
R. Devidze
Xiaojin Zhu
Adish Singla
AAMLOffRL
99
125
0
28 Mar 2020
Obstacle Avoidance and Navigation Utilizing Reinforcement Learning with
  Reward Shaping
Obstacle Avoidance and Navigation Utilizing Reinforcement Learning with Reward Shaping
Dan-xu Zhang
Colleen P. Bailey
92
12
0
28 Mar 2020
PADS: Policy-Adapted Sampling for Visual Similarity Learning
PADS: Policy-Adapted Sampling for Visual Similarity Learning
Karsten Roth
Timo Milbich
Bjorn Ommer
132
49
0
24 Mar 2020
Safe Crossover of Neural Networks Through Neuron Alignment
Safe Crossover of Neural Networks Through Neuron Alignment
Thomas Uriot
Dario Izzo
77
14
0
23 Mar 2020
Importance of using appropriate baselines for evaluation of
  data-efficiency in deep reinforcement learning for Atari
Importance of using appropriate baselines for evaluation of data-efficiency in deep reinforcement learning for Atari
Kacper Kielak
OffRL
51
8
0
23 Mar 2020
Safe Reinforcement Learning of Control-Affine Systems with Vertex
  Networks
Safe Reinforcement Learning of Control-Affine Systems with Vertex Networks
Liyuan Zheng
Yuanyuan Shi
Lillian J. Ratliff
Baosen Zhang
52
20
0
20 Mar 2020
Robust Deep Reinforcement Learning against Adversarial Perturbations on
  State Observations
Robust Deep Reinforcement Learning against Adversarial Perturbations on State Observations
Huan Zhang
Hongge Chen
Chaowei Xiao
Yue Liu
Mingyan D. Liu
Duane S. Boning
Cho-Jui Hsieh
AAML
190
277
0
19 Mar 2020
Placement Optimization with Deep Reinforcement Learning
Placement Optimization with Deep Reinforcement Learning
Anna Goldie
Azalia Mirhoseini
OffRL
24
36
0
18 Mar 2020
Sparse Graphical Memory for Robust Planning
Sparse Graphical Memory for Robust Planning
Scott Emmons
Ajay Jain
Michael Laskin
Thanard Kurutach
Pieter Abbeel
Deepak Pathak
88
50
0
13 Mar 2020
A General Framework for Learning Mean-Field Games
A General Framework for Learning Mean-Field Games
Xin Guo
Anran Hu
Renyuan Xu
Junzi Zhang
OffRLAI4CE
141
51
0
13 Mar 2020
Learning Predictive Representations for Deformable Objects Using
  Contrastive Estimation
Learning Predictive Representations for Deformable Objects Using Contrastive Estimation
Wilson Yan
Ashwin Vangipuram
Pieter Abbeel
Lerrel Pinto
108
191
0
11 Mar 2020
Online Meta-Critic Learning for Off-Policy Actor-Critic Methods
Online Meta-Critic Learning for Off-Policy Actor-Critic Methods
Wei Zhou
Yiying Li
Yongxin Yang
Huaimin Wang
Timothy M. Hospedales
OffRL
76
48
0
11 Mar 2020
Machine Learning for Intelligent Optical Networks: A Comprehensive
  Survey
Machine Learning for Intelligent Optical Networks: A Comprehensive Survey
Rentao Gu
Zeyuan Yang
Yuefeng Ji
67
115
0
11 Mar 2020
Quality Diversity for Multi-task Optimization
Quality Diversity for Multi-task Optimization
Jean-Baptiste Mouret
Glenn Maguire
76
57
0
09 Mar 2020
Stochastic Recursive Momentum for Policy Gradient Methods
Stochastic Recursive Momentum for Policy Gradient Methods
Huizhuo Yuan
Xiangru Lian
Ji Liu
Yuren Zhou
92
32
0
09 Mar 2020
Cost-Sensitive Portfolio Selection via Deep Reinforcement Learning
Cost-Sensitive Portfolio Selection via Deep Reinforcement Learning
Yifan Zhang
P. Zhao
Qingyao Wu
Bin Li
Junzhou Huang
Mingkui Tan
OOD
151
97
0
06 Mar 2020
Dynamic Experience Replay
Dynamic Experience Replay
Jieliang Luo
Hui Li
223
24
0
04 Mar 2020
Neural-Network Heuristics for Adaptive Bayesian Quantum Estimation
Neural-Network Heuristics for Adaptive Bayesian Quantum Estimation
Lukas J. Fiderer
Jonas Schuff
D. Braun
53
49
0
04 Mar 2020
Hierarchically Decoupled Imitation for Morphological Transfer
Hierarchically Decoupled Imitation for Morphological Transfer
D. Hejna
Pieter Abbeel
Lerrel Pinto
LM&Ro
77
43
0
03 Mar 2020
Efficient Exploration in Constrained Environments with Goal-Oriented
  Reference Path
Efficient Exploration in Constrained Environments with Goal-Oriented Reference Path
Keita Ota
Y. Sasaki
Devesh K. Jha
Yusuke Yoshiyasu
Asako Kanezaki
110
18
0
03 Mar 2020
Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?
Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?
Keita Ota
Tomoaki Oiki
Devesh K. Jha
T. Mariyama
D. Nikovski
OffRL
107
42
0
03 Mar 2020
Embodied Synaptic Plasticity with Online Reinforcement learning
Embodied Synaptic Plasticity with Online Reinforcement learning
Jacques Kaiser
M. Hoff
Andreas Konle
J. C. V. Tieck
David Kappel
...
Anand Subramoney
Robert Legenstein
A. Rönnau
Wolfgang Maass
Rüdiger Dillmann
OffRL
38
16
0
03 Mar 2020
Safe Reinforcement Learning for Autonomous Vehicles through Parallel
  Constrained Policy Optimization
Safe Reinforcement Learning for Autonomous Vehicles through Parallel Constrained Policy Optimization
Lu Wen
Jingliang Duan
Shengbo Eben Li
Shaobing Xu
H. Peng
66
68
0
03 Mar 2020
Rapidly Adaptable Legged Robots via Evolutionary Meta-Learning
Rapidly Adaptable Legged Robots via Evolutionary Meta-Learning
Xingyou Song
Yuxiang Yang
K. Choromanski
Ken Caluwaerts
Wenbo Gao
Chelsea Finn
Jie Tan
187
80
0
02 Mar 2020
Gaussian Process Policy Optimization
Gaussian Process Policy Optimization
Anand Srinivasa Rao
Bidipta Sarkar
Tejas Narayanan
GP
33
0
0
02 Mar 2020
Upper Confidence Primal-Dual Reinforcement Learning for CMDP with
  Adversarial Loss
Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss
Shuang Qiu
Xiaohan Wei
Zhuoran Yang
Jieping Ye
Zhaoran Wang
183
50
0
02 Mar 2020
How Do We Move: Modeling Human Movement with System Dynamics
How Do We Move: Modeling Human Movement with System Dynamics
Hua Wei
Dongkuan Xu
Z. Li
Zhenhui Li
24
1
0
01 Mar 2020
Previous
123...222324...394041
Next