Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
v1
v2
v3
v4
v5 (latest)
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 2,009 papers shown
Title
Trust-Region Method with Deep Reinforcement Learning in Analog Design Space Exploration
Kai-En Yang
Chia-Yu Tsai
Hung-Hao Shen
Chen-Feng Chiang
Feng-Ming Tsai
Chunguang Wang
Yiju Ting
Chia-Shun Yeh
C. Lai
53
14
0
29 Sep 2020
Scalable Deep Reinforcement Learning for Ride-Hailing
Jiekun Feng
Mark O. Gluzman
J. Dai
BDL
122
17
0
27 Sep 2020
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey
Wenshuai Zhao
Jorge Peña Queralta
Tomi Westerlund
OffRL
265
743
0
24 Sep 2020
Revisiting Design Choices in Proximal Policy Optimization
Chloe Ching-Yun Hsu
Celestine Mendler-Dünner
Moritz Hardt
161
57
0
23 Sep 2020
Is Q-Learning Provably Efficient? An Extended Analysis
Kushagra Rastogi
Jonathan Lee
Fabrice Harel-Canada
Aditya Sunil Joglekar
OffRL
28
1
0
22 Sep 2020
Towards Interpretable-AI Policies Induction using Evolutionary Nonlinear Decision Trees for Discrete Action Systems
Yashesh D. Dhebar
Kalyanmoy Deb
S. Nageshrao
Ling Zhu
Dimitar Filev
68
16
0
20 Sep 2020
Lyapunov-Based Reinforcement Learning for Decentralized Multi-Agent Control
Qingrui Zhang
Hao Dong
Wei Pan
37
7
0
20 Sep 2020
GRAC: Self-Guided and Self-Regularized Actor-Critic
Lin Shao
Yifan You
Mengyuan Yan
Qingyun Sun
Jeannette Bohg
89
24
0
18 Sep 2020
Competitiveness of MAP-Elites against Proximal Policy Optimization on locomotion tasks in deterministic simulations
Szymon Brych
Antoine Cully
67
4
0
17 Sep 2020
Elastica: A compliant mechanics environment for soft robotic control
Noel M. Naughton
Jiarui Sun
Arman Tekinalp
Girish Chowdhary
M. Gazzola
46
90
0
17 Sep 2020
Transfer Learning in Deep Reinforcement Learning: A Survey
Zhuangdi Zhu
Kaixiang Lin
Anil K. Jain
Jiayu Zhou
OffRL
LRM
153
606
0
16 Sep 2020
Time your hedge with Deep Reinforcement Learning
Eric Benhamou
David Saltiel
Sandrine Ungari
Abhishek Mukhopadhyay
AIFin
45
15
0
16 Sep 2020
Meta-AAD: Active Anomaly Detection with Deep Reinforcement Learning
Daochen Zha
Kwei-Herng Lai
Mingyang Wan
X. Hu
97
55
0
16 Sep 2020
Soft policy optimization using dual-track advantage estimator
Yubo Huang
Xuechun Wang
Luobao Zou
Zhiwei Zhuang
Weidong Zhang
28
3
0
15 Sep 2020
The Importance of Pessimism in Fixed-Dataset Policy Optimization
Jacob Buckman
Carles Gelada
Marc G. Bellemare
OffRL
125
139
0
15 Sep 2020
VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution using Reinforcement Learning
R. Awasthi
K. K. Guliani
Saif Ahmad Khan
Aniket Vashishtha
M. S. Gill
Arshita Bhatt
A. Nagori
Aniket Gupta
Ponnurangam Kumaraguru
Tavpritesh Sethi
102
24
0
14 Sep 2020
Multi-Agent Reinforcement Learning in Cournot Games
Yuanyuan Shi
Baosen Zhang
65
7
0
14 Sep 2020
Phasic Policy Gradient
K. Cobbe
Jacob Hilton
Oleg Klimov
John Schulman
OffRL
100
160
0
09 Sep 2020
Deep Learning and Reinforcement Learning for Autonomous Unmanned Aerial Systems: Roadmap for Theory to Deployment
Jithin Jagannath
Anu Jagannath
Sean Furman
Tyler Gwin
84
25
0
07 Sep 2020
A reinforcement learning approach to hybrid control design
Meet Gandhi
A. Kundu
S. Bhatnagar
20
0
0
02 Sep 2020
Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown Dynamics
Yanchao Sun
Da Huo
Furong Huang
AAML
OffRL
OnRL
114
52
0
02 Sep 2020
Deep Reinforcement Learning for Contact-Rich Skills Using Compliant Movement Primitives
Oren Spector
M. Zacksenhouse
64
12
0
30 Aug 2020
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning Systems
Vinicius G. Goecks
117
11
0
30 Aug 2020
Decision-making for Autonomous Vehicles on Highway: Deep Reinforcement Learning with Continuous Action Horizon
Teng Liu
Hong Wang
Jun Li
113
7
0
26 Aug 2020
Constrained Markov Decision Processes via Backward Value Functions
Harsh Satija
Philip Amortila
Joelle Pineau
107
52
0
26 Aug 2020
Synthetic Sample Selection via Reinforcement Learning
Jiarong Ye
Yuan Xue
L. R. Long
Sameer Kiran Antani
Z. Xue
K. Cheng
Xiaolei Huang
MedIm
70
24
0
26 Aug 2020
Inverse Policy Evaluation for Value-based Sequential Decision-making
Alan Chan
Kristopher De Asis
R. Sutton
OffRL
87
1
0
26 Aug 2020
Ensuring Monotonic Policy Improvement in Entropy-regularized Value-based Reinforcement Learning
Lingwei Zhu
Takamitsu Matsubara
54
4
0
25 Aug 2020
Learning Off-Policy with Online Planning
Harshit S. Sikchi
Wenxuan Zhou
David Held
OffRL
149
50
0
23 Aug 2020
Towards Designing a Self-Managed Machine Learning Inference Serving System inPublic Cloud
Jashwant Raj Gunasekaran
P. Thinakaran
Cyan Subhra Mishra
M. Kandemir
Chita R. Das
18
2
0
21 Aug 2020
Adversarial Imitation Learning via Random Search
Myungjae Shin
Joongheon Kim
43
11
0
21 Aug 2020
Imitation Learning with Sinkhorn Distances
Georgios Papagiannis
Yunpeng Li
OT
75
27
0
20 Aug 2020
A Survey of Knowledge-based Sequential Decision Making under Uncertainty
Shiqi Zhang
Mohan Sridharan
74
16
0
19 Aug 2020
A Framework for Studying Reinforcement Learning and Sim-to-Real in Robot Soccer
H. Bassani
R. A. Delgado
J. N. D. O. L. Junior
H. R. Medeiros
Pedro H. M. Braga
Mateus G. Machado
L. H. C. Santos
Alain Tapp
25
11
0
18 Aug 2020
On the Sample Complexity of Reinforcement Learning with Policy Space Generalization
Wenlong Mou
Zheng Wen
Xi Chen
74
11
0
17 Aug 2020
Generative Design by Reinforcement Learning: Enhancing the Diversity of Topology Optimization Designs
Seowoo Jang
Soyoung Yoo
Namwoo Kang
AI4CE
121
74
0
17 Aug 2020
Inverse Reinforcement Learning with Natural Language Goals
Li Zhou
Kevin Small
89
38
0
16 Aug 2020
Overcoming Model Bias for Robust Offline Deep Reinforcement Learning
Phillip Swazinna
Steffen Udluft
Thomas Runkler
OffRL
95
84
0
12 Aug 2020
Hardware as Policy: Mechanical and Computational Co-Optimization using Deep Reinforcement Learning
Tianjian Chen
Zhanpeng He
M. Ciocarlie
50
47
0
11 Aug 2020
Robot Action Selection Learning via Layered Dimension Informed Program Synthesis
Jarrett Holtz
Arjun Guha
Joydeep Biswas
68
8
0
10 Aug 2020
Risk-Sensitive Markov Decision Processes with Combined Metrics of Mean and Variance
L. Xia
60
31
0
09 Aug 2020
Deep Reinforcement Learning for Tactile Robotics: Learning to Type on a Braille Keyboard
Alex Church
John Lloyd
R. Hadsell
Nathan Lepora
80
31
0
06 Aug 2020
Better Fine-Tuning by Reducing Representational Collapse
Armen Aghajanyan
Akshat Shrivastava
Anchit Gupta
Naman Goyal
Luke Zettlemoyer
S. Gupta
AAML
100
210
0
06 Aug 2020
Follow the Object: Curriculum Learning for Manipulation Tasks with Imagined Goals
Ozsel Kilinc
Giovanni Montana
66
5
0
05 Aug 2020
A Relearning Approach to Reinforcement Learning for Control of Smart Buildings
Avisek Naug
Marcos Quiñones-Grueiro
G. Biswas
CLL
52
11
0
04 Aug 2020
An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch
Siddarth Desai
Ishan Durugkar
Haresh Karnan
Garrett A. Warnell
Josiah P. Hanna
Peter Stone
48
5
0
04 Aug 2020
Proximal Deterministic Policy Gradient
Marco Maggipinto
Gian Antonio Susto
Pratik Chaudhari
OffRL
41
5
0
03 Aug 2020
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Zuyue Fu
Zhuoran Yang
Zhaoran Wang
91
43
0
02 Aug 2020
Autonomous Navigation in Complex Environments with Deep Multimodal Fusion Network
A. Nguyen
Ngoc Son Nguyen
K. Tran
Erman Tjiputra
Quang-Dieu Tran
65
39
0
31 Jul 2020
Queueing Network Controls via Deep Reinforcement Learning
J. Dai
Mark O. Gluzman
OffRL
125
51
0
31 Jul 2020
Previous
1
2
3
...
19
20
21
...
39
40
41
Next