Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
v1
v2
v3
v4
v5 (latest)
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 2,009 papers shown
Title
Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs
Han Zhong
Zhuoran Yang
Zhaoran Wang
Csaba Szepesvári
119
21
0
18 Oct 2021
On-Policy Model Errors in Reinforcement Learning
Lukas P. Frohlich
Maksym Lefarov
Melanie Zeilinger
Felix Berkenkamp
OnRL
81
6
0
15 Oct 2021
Wasserstein Unsupervised Reinforcement Learning
Shuncheng He
Yuhang Jiang
Hongchang Zhang
Jianzhun Shao
Xiangyang Ji
OffRL
93
24
0
15 Oct 2021
Offline Reinforcement Learning with Soft Behavior Regularization
Haoran Xu
Xianyuan Zhan
Jianxiong Li
Honglei Yin
OffRL
81
31
0
14 Oct 2021
Improving the sample-efficiency of neural architecture search with reinforcement learning
A. Nagy
Ábel Boros
118
3
0
13 Oct 2021
Twice regularized MDPs and the equivalence between robustness and regularization
E. Derman
Matthieu Geist
Shie Mannor
127
58
0
12 Oct 2021
Equivalence Analysis between Counterfactual Regret Minimization and Online Mirror Descent
Weiming Liu
Huacong Jiang
Bin Li
Houqiang Li
60
10
0
11 Oct 2021
Satisficing Paths and Independent Multi-Agent Reinforcement Learning in Stochastic Games
Bora Yongacoglu
Gürdal Arslan
S. Yüksel
63
16
0
09 Oct 2021
Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation
Junhong Shen
Lin F. Yang
OffRL
51
18
0
09 Oct 2021
Distributed Proximal Policy Optimization for Contention-Based Spectrum Access
Akash S. Doshi
J. Andrews
65
3
0
07 Oct 2021
On the Privacy Risks of Deploying Recurrent Neural Networks in Machine Learning Models
Yunhao Yang
Parham Gohari
Ufuk Topcu
AAML
90
3
0
06 Oct 2021
Multi-Agent Constrained Policy Optimisation
Shangding Gu
J. Kuba
Munning Wen
Ruiqing Chen
Ziyan Wang
Zheng Tian
Jun Wang
Alois Knoll
Yaodong Yang
165
49
0
06 Oct 2021
On The Transferability of Deep-Q Networks
M. Sabatelli
Pierre Geurts
87
2
0
06 Oct 2021
Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective
Ting-Han Fan
Peter J. Ramadge
CML
FAtt
OffRL
68
2
0
06 Oct 2021
Divergence-Regularized Multi-Agent Actor-Critic
Kefan Su
Zongqing Lu
155
27
0
01 Oct 2021
Powerpropagation: A sparsity inducing weight reparameterisation
Jonathan Richard Schwarz
Siddhant M. Jayakumar
Razvan Pascanu
P. Latham
Yee Whye Teh
201
55
0
01 Oct 2021
Solving the Real Robot Challenge using Deep Reinforcement Learning
Robert McCarthy
Francisco Roldan Sanchez
Qiang Wang
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
S. Redmond
105
11
0
30 Sep 2021
Bitcoin Transaction Strategy Construction Based on Deep Reinforcement Learning
Fengrui Liu
Yang Li
Baitong Li
Jiaxin Li
Huiyang Xie
53
45
0
30 Sep 2021
Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates
Romain Laroche
Rémi Tachet des Combes
94
8
0
29 Sep 2021
Lyapunov-stable neural-network control
Hongkai Dai
Benoit Landry
Lujie Yang
Marco Pavone
Russ Tedrake
102
125
0
29 Sep 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
138
60
0
28 Sep 2021
Exploring More When It Needs in Deep Reinforcement Learning
Youtian Guo
Qitong Gao
33
0
0
28 Sep 2021
Semi-Autonomous Teleoperation via Learning Non-Prehensile Manipulation Skills
Sangbeom Park
Yoonbyung Chai
Sunghyun Park
Jeongeun Park
Kyungjae Lee
Sungjoon Choi
SSL
73
5
0
27 Sep 2021
PM-FSM: Policies Modulating Finite State Machine for Robust Quadrupedal Locomotion
Ren Liu
Nitish Sontakke
Sehoon Ha
92
2
0
26 Sep 2021
Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms
Liyuan Zheng
Tanner Fiez
Zane Alumbaugh
Benjamin J. Chasnov
Lillian J. Ratliff
OffRL
99
42
0
25 Sep 2021
Regularization Guarantees Generalization in Bayesian Reinforcement Learning through Algorithmic Stability
Aviv Tamar
Daniel Soudry
E. Zisselman
OOD
OffRL
59
7
0
24 Sep 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience
C. Banerjee
Zhiyong Chen
N. Noman
63
34
0
24 Sep 2021
Semi-Supervised Imitation Learning with Mixed Qualities of Demonstrations for Autonomous Driving
Gunmin Lee
Wooseok Oh
Seungyoung Shin
Dohyeong Kim
Jeongwoo Oh
Jaeyeon Jeong
Sungjoon Choi
Songhwai Oh
SSL
78
2
0
23 Sep 2021
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
J. Kuba
Ruiqing Chen
Munning Wen
Ying Wen
Fanglei Sun
Jun Wang
Yaodong Yang
129
250
0
23 Sep 2021
Generalization in Mean Field Games by Learning Master Policies
Sarah Perrin
Mathieu Laurière
Julien Pérolat
Romuald Élie
Matthieu Geist
Olivier Pietquin
AI4CE
150
37
0
20 Sep 2021
Decentralized Global Connectivity Maintenance for Multi-Robot Navigation: A Reinforcement Learning Approach
Minghao Li
Yingrui Jie
Yang Kong
Hui Cheng
62
9
0
17 Sep 2021
Soft Actor-Critic With Integer Actions
Ting-Han Fan
Yubo Wang
69
15
0
17 Sep 2021
RAPID-RL: A Reconfigurable Architecture with Preemptive-Exits for Efficient Deep-Reinforcement Learning
Adarsh Kosta
Malik Aqeel Anwar
Priyadarshini Panda
A. Raychowdhury
Kaushik Roy
30
4
0
16 Sep 2021
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
91
77
0
16 Sep 2021
ObjectFolder: A Dataset of Objects with Implicit Visual, Auditory, and Tactile Representations
Ruohan Gao
Yen-Yu Chang
Shivani Mall
Li Fei-Fei
Jiajun Wu
114
84
0
16 Sep 2021
DROMO: Distributionally Robust Offline Model-based Policy Optimization
Ruizhen Liu
Dazhi Zhong
Zhi-Cong Chen
OffRL
68
3
0
15 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
86
103
0
14 Sep 2021
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods
Xin Guo
Anran Hu
Junzi Zhang
OffRL
86
6
0
13 Sep 2021
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation
Boyan Li
Hongyao Tang
Yan Zheng
Jianye Hao
Pengyi Li
Zhen Wang
Zhaopeng Meng
Li Wang
94
43
0
12 Sep 2021
Bootstrapped Meta-Learning
Sebastian Flennerhag
Yannick Schroecker
Tom Zahavy
Hado van Hasselt
David Silver
Satinder Singh
95
58
0
09 Sep 2021
Integrated and Adaptive Guidance and Control for Endoatmospheric Missiles via Reinforcement Learning
B. Gaudet
R. Furfaro
78
8
0
08 Sep 2021
A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions
Xiaocong Chen
L. Yao
Julian McAuley
Guanglin Zhou
Xianzhi Wang
AI4TS
79
62
0
08 Sep 2021
A Deep Reinforcement Learning Approach for Online Parcel Assignment
Hao Zeng
Qiong Wu
Kunpeng Han
Jun He
Haoyuan Hu
59
2
0
08 Sep 2021
Optimal Stroke Learning with Policy Gradient Approach for Robotic Table Tennis
Yapeng Gao
Jonas Tebbe
A. Zell
OffRL
96
14
0
07 Sep 2021
Eden: A Unified Environment Framework for Booming Reinforcement Learning Algorithms
Ruizhi Chen
Xiaoyu Wu
Yansong Pan
Kaizhao Yuan
Ling Li
...
Shaohui Peng
Xishan Zhang
Zidong Du
Qi Guo
Yunji Chen
OffRL
61
3
0
04 Sep 2021
A Survey of Exploration Methods in Reinforcement Learning
Susan Amin
Maziar Gomrokchi
Harsh Satija
H. V. Hoof
Doina Precup
OffRL
109
84
0
01 Sep 2021
Adaptive perturbation adversarial training: based on reinforcement learning
Zhi-pin Nie
Ying Lin
Sp Ren
Lan Zhang
AAML
37
1
0
30 Aug 2021
Photonic Quantum Policy Learning in OpenAI Gym
D. Nagy
Zsolt I. Tabi
Péter Hága
Zsófia Kallus
Z. Zimborás
96
8
0
29 Aug 2021
Accelerating Serverless Computing by Harvesting Idle Resources
Hanfei Yu
Hao Wang
Jian Li
Xuemei Yuan
Seung-Jong Park
42
32
0
28 Aug 2021
Influence-Based Reinforcement Learning for Intrinsically-Motivated Agents
Ammar Fayad
M. Ibrahim
62
5
0
28 Aug 2021
Previous
1
2
3
...
12
13
14
...
39
40
41
Next