Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 1,114 papers shown
Title
Bridging the Gap Between Target Networks and Functional Regularization
Alexandre Piché
Valentin Thomas
Joseph Marino
Rafael Pardiñas
Gian Maria Marconi
C. Pal
Mohammad Emtiyaz Khan
14
1
0
21 Oct 2022
Deep Reinforcement Learning for Stabilization of Large-scale Probabilistic Boolean Networks
S. Moschoyiannis
Evangelos Chatzaroulas
Vytenis Sliogeris
Yuhu Wu
BDL
OffRL
AI4CE
16
7
0
21 Oct 2022
Global Convergence of Direct Policy Search for State-Feedback
H
∞
\mathcal{H}_\infty
H
∞
Robust Control: A Revisit of Nonsmooth Synthesis with Goldstein Subdifferential
Xing-ming Guo
Bin Hu
41
12
0
20 Oct 2022
RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator Control
Yanfei Xiang
Xin Wang
Shu Hu
Bin Zhu
Xiaomeng Huang
Xi Wu
Siwei Lyu
SSL
29
5
0
20 Oct 2022
Safe Policy Improvement in Constrained Markov Decision Processes
Luigi Berducci
Radu Grosu
OffRL
36
2
0
20 Oct 2022
Proximal Learning With Opponent-Learning Awareness
S. Zhao
Chris Xiaoxuan Lu
Roger C. Grosse
Jakob N. Foerster
34
21
0
18 Oct 2022
Deep Black-Box Reinforcement Learning with Movement Primitives
Fabian Otto
Onur Celik
Hongyi Zhou
Hanna Ziesche
Ngo Anh Vien
Gerhard Neumann
OffRL
24
19
0
18 Oct 2022
When to Update Your Model: Constrained Model-based Reinforcement Learning
Tianying Ji
Yu-Juan Luo
Gang Hua
Mingxuan Jing
Fengxiang He
Wen-bing Huang
24
18
0
15 Oct 2022
A Policy-Guided Imitation Approach for Offline Reinforcement Learning
Haoran Xu
Li Jiang
Jianxiong Li
Xianyuan Zhan
OffRL
26
62
0
15 Oct 2022
A Concise Introduction to Reinforcement Learning in Robotics
Akash Nagaraj
Mukund Sood
B. Patil
23
22
0
13 Oct 2022
Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations
N. Vadori
Leo Ardon
Sumitra Ganesh
Thomas Spooner
Selim Amrouni
Jared Vann
Mengda Xu
Zeyu Zheng
T. Balch
Manuela Veloso
18
16
0
13 Oct 2022
Observed Adversaries in Deep Reinforcement Learning
Eugene Lim
Harold Soh
AAML
19
0
0
13 Oct 2022
Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning
Yongyuan Liang
Yanchao Sun
Ruijie Zheng
Furong Huang
OOD
AAML
OffRL
28
47
0
12 Oct 2022
Traffic-Aware Autonomous Driving with Differentiable Traffic Simulation
L. Zheng
Sanghyun Son
Ming-Chyuan Lin
35
3
0
07 Oct 2022
Self-Adaptive Driving in Nonstationary Environments through Conjectural Online Lookahead Adaptation
Tao Li
Haozhe Lei
Quanyan Zhu
26
11
0
06 Oct 2022
Time-Varying Propensity Score to Bridge the Gap between the Past and Present
Rasool Fakoor
Jonas W. Mueller
Zachary Chase Lipton
Pratik Chaudhari
Alexander J. Smola
OOD
AI4TS
32
3
0
04 Oct 2022
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization
Rajkumar Ramamurthy
Prithviraj Ammanabrolu
Kianté Brantley
Jack Hessel
R. Sifa
Christian Bauckhage
Hannaneh Hajishirzi
Yejin Choi
OffRL
31
240
0
03 Oct 2022
Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games
Shicong Cen
Yuejie Chi
S. Du
Lin Xiao
59
35
0
03 Oct 2022
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
Yannick Hogewind
T. D. Simão
Tal Kachman
N. Jansen
16
10
0
02 Oct 2022
Policy Gradients for Probabilistic Constrained Reinforcement Learning
Weiqin Chen
D. Subramanian
Santiago Paternain
26
6
0
02 Oct 2022
Midas: A Multi-Joint Robotics Simulator with Intersection-Free Frictional Contact
Yunuo Chen
Minchen Li
Wenlong Lu
Chuyuan Fu
Chenfanfu Jiang
18
4
0
30 Sep 2022
Reinforcement Learning Algorithms: An Overview and Classification
Fadi AlMahamid
Katarina Grolinger
21
40
0
29 Sep 2022
Opportunities and Challenges from Using Animal Videos in Reinforcement Learning for Navigation
Vittorio Giammarino
James Queeney
Lucas C. Carstensen
Michael Hasselmo
I. Paschalidis
OffRL
50
4
0
25 Sep 2022
Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain Adaptation
Kang Xu
Yan Ma
Bingsheng Wei
Wei Li
32
3
0
24 Sep 2022
A Unified Perspective on Natural Gradient Variational Inference with Gaussian Mixture Models
Oleg Arenz
Philipp Dahlinger
Zihan Ye
Michael Volpp
Gerhard Neumann
39
15
0
23 Sep 2022
Model-Free Reinforcement Learning for Asset Allocation
Adebayo Oshingbesan
Eniola Ajiboye
Peruth Kamashazi
Timothy Mbaka
OffRL
19
1
0
21 Sep 2022
Revisiting Discrete Soft Actor-Critic
Haibin Zhou
Zichuan Lin
Junyou Li
Qiang Fu
Wei Yang
Deheng Ye
46
12
0
21 Sep 2022
Deep Generalized Schrödinger Bridge
Guan-Horng Liu
T. Chen
Oswin So
Evangelos A. Theodorou
OT
AI4CE
16
35
0
20 Sep 2022
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability
Mengdi Xu
Zuxin Liu
Peide Huang
Wenhao Ding
Zhepeng Cen
Bo-wen Li
Ding Zhao
74
45
0
16 Sep 2022
Reinforcement Learning-Based Cooperative P2P Power Trading between DC Nanogrid Clusters with Wind and PV Energy Resources
Sangkeum Lee
S. Nengroo
Hojun Jin
Taewook Heo
Y. Doh
Chun-leung Lee
Dongsoo Har
14
2
0
16 Sep 2022
Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes
Hao Fei
Hongyao Tang
Jianye Hao
Yan Zheng
OffRL
28
0
0
16 Sep 2022
Scalable Task-Driven Robotic Swarm Control via Collision Avoidance and Learning Mean-Field Control
Kai Cui
Mengguang Li
Christian Fabian
Heinz Koeppl
AI4CE
37
5
0
15 Sep 2022
Multi-Objective Policy Gradients with Topological Constraints
K. H. Wray
Stas Tiomkin
Mykel J. Kochenderfer
Pieter Abbeel
37
2
0
15 Sep 2022
Constrained Update Projection Approach to Safe Policy Optimization
Long Yang
Jiaming Ji
Juntao Dai
Linrui Zhang
Binbin Zhou
Pengfei Li
Yaodong Yang
Gang Pan
38
43
0
15 Sep 2022
Towards self-attention based visual navigation in the real world
Jaime Ruiz-Serra
Jack White
Stephen M. Petrie
T. Kameneva
C. McCarthy
28
0
0
15 Sep 2022
Model-based Reinforcement Learning with Multi-step Plan Value Estimation
Hao-Chu Lin
Yihao Sun
Jiajin Zhang
Yang Yu
OffRL
34
7
0
12 Sep 2022
Gradient Descent Temporal Difference-difference Learning
Rong Zhu
James M. Murray
OffRL
19
1
0
10 Sep 2022
Variational Inference for Model-Free and Model-Based Reinforcement Learning
Felix Leibfried
OffRL
15
0
0
04 Sep 2022
Dynamic Regret of Online Markov Decision Processes
Peng Zhao
Longfei Li
Zhi-Hua Zhou
OffRL
29
17
0
26 Aug 2022
Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph Learning for Continuous Action Space
Yining Chen
Ke Wang
Guang-hua Song
Xiaohong Jiang
28
3
0
23 Aug 2022
Entropy Augmented Reinforcement Learning
Jianfei Ma
36
0
0
19 Aug 2022
A Risk-Sensitive Approach to Policy Optimization
Jared Markowitz
Ryan W. Gardner
Ashley J. Llorens
R. Arora
I-J. Wang
OffRL
29
6
0
19 Aug 2022
Performance Optimization for Semantic Communications: An Attention-based Reinforcement Learning Approach
Yining Wang
Mingzhe Chen
Tao Luo
Walid Saad
Dusit Niyato
H. Vincent Poor
Shuguang Cui
21
127
0
17 Aug 2022
Path Planning of Cleaning Robot with Reinforcement Learning
Woohyeon Moon
Bumgeun Park
S. Nengroo
Taeyoung Kim
Dongsoo Har
22
17
0
17 Aug 2022
A Game-Theoretic Perspective of Generalization in Reinforcement Learning
Chang Yang
Ruiyu Wang
Xinrun Wang
Zhen Wang
OffRL
27
3
0
07 Aug 2022
Backward Imitation and Forward Reinforcement Learning via Bi-directional Model Rollouts
Yuxin Pan
Fangzhen Lin
OffRL
22
3
0
04 Aug 2022
Bayesian regularization of empirical MDPs
Samarth Gupta
Daniel N. Hill
Lexing Ying
Inderjit Dhillon
OffRL
24
0
0
03 Aug 2022
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games
Fivos Kalogiannis
Ioannis Anagnostides
Ioannis Panageas
Emmanouil-Vasileios Vlatakis-Gkaragkounis
Vaggos Chatziafratis
S. Stavroulakis
39
13
0
03 Aug 2022
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
J. Kuba
Xidong Feng
Shiyao Ding
Hao Dong
Jun Wang
Yaodong Yang
26
16
0
02 Aug 2022
Implicit Two-Tower Policies
Yunfan Zhao
Qingkai Pan
K. Choromanski
Deepali Jain
Vikas Sindhwani
OffRL
31
3
0
02 Aug 2022
Previous
1
2
3
...
6
7
8
...
21
22
23
Next