Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
v1
v2
v3
v4
v5 (latest)
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 2,012 papers shown
Title
Solve routing problems with a residual edge-graph attention neural network
Kun Lei
Peng Guo
Yi Wang
Xiao Wu
Wenchao Zhao
79
59
0
06 May 2021
Safety Enhancement for Deep Reinforcement Learning in Autonomous Separation Assurance
Wei Guo
Marc Brittain
Peng Wei
124
19
0
05 May 2021
On the Linear convergence of Natural Policy Gradient Algorithm
S. Khodadadian
P. Jhunjhunwala
Sushil Mahavir Varma
S. T. Maguluri
88
57
0
04 May 2021
Hierarchical Reinforcement Learning for Air-to-Air Combat
Adrian P. Pope
J. Ide
Daria Mićović
Henry Diaz
D. Rosenbluth
Lee Ritholtz
Jason C. Twedt
Thayne T. Walker
K. Alcedo
D. Javorsek
64
74
0
03 May 2021
Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference
Xiaocong Chen
Lina Yao
Xianzhi Wang
Aixin Sun
Wenjie Zhang
Quan Z. Sheng
56
8
0
03 May 2021
Adaptive Adversarial Training for Meta Reinforcement Learning
Shiqi Chen
Zhengyu Chen
Donglin Wang
79
7
0
27 Apr 2021
Efficient Hyperparameter Optimization for Physics-based Character Animation
Zeshi Yang
Zhiqi Yin
AI4CE
91
9
0
26 Apr 2021
Optimize Neural Fictitious Self-Play in Regret Minimization Thinking
Yuxuan Chen
Li Zhang
Shijian Li
Gang Pan
45
2
0
22 Apr 2021
Quick Learner Automated Vehicle Adapting its Roadmanship to Varying Traffic Cultures with Meta Reinforcement Learning
Songan Zhang
Lu Wen
H. Peng
H. E. Tseng
41
10
0
18 Apr 2021
Multitasking Inhibits Semantic Drift
Athul Paul Jacob
M. Lewis
Jacob Andreas
88
13
0
15 Apr 2021
Safe Continuous Control with Constrained Model-Based Policy Optimization
Moritz A. Zanger
Karam Daaboul
J. Marius Zöllner
80
19
0
14 Apr 2021
GAN-Based Interactive Reinforcement Learning from Demonstration and Human Evaluative Feedback
Jie Huang
Rongshun Juan
R. Gomez
Keisuke Nakamura
Q. Sha
Bo He
Guangliang Li
77
10
0
14 Apr 2021
TAAC: Temporally Abstract Actor-Critic for Continuous Control
Haonan Yu
Wei Xu
Haichao Zhang
OffRL
56
21
0
13 Apr 2021
Learning and Planning in Complex Action Spaces
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
M. Barekatain
Simon Schmitt
David Silver
103
79
0
13 Apr 2021
Muesli: Combining Improvements in Policy Optimization
Matteo Hessel
Ivo Danihelka
Fabio Viola
A. Guez
Simon Schmitt
Laurent Sifre
T. Weber
David Silver
H. V. Hasselt
111
66
0
13 Apr 2021
Thief, Beware of What Get You There: Towards Understanding Model Extraction Attack
Xinyi Zhang
Chengfang Fang
Jie Shi
MIACV
MLAU
SILM
98
16
0
13 Apr 2021
Survey on reinforcement learning for language processing
Víctor Uc Cetina
Nicolás Navarro-Guerrero
A. Martín-González
C. Weber
S. Wermter
OffRL
97
111
0
12 Apr 2021
Probabilistic Programming Bots in Intuitive Physics Game Play
Fahad Alhasoun
Sarah Alnegheimish
J. Tenenbaum
37
1
0
05 Apr 2021
No Need for Interactions: Robust Model-Based Imitation Learning using Neural ODE
HaoChih Lin
Baopu Li
Xin Zhou
Jiankun Wang
Max Meng
43
6
0
03 Apr 2021
Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning
Hiroki Furuta
Tadashi Kozuno
T. Matsushima
Y. Matsuo
S. Gu
129
14
0
31 Mar 2021
Deep Reinforcement Learning for Constrained Field Development Optimization in Subsurface Two-phase Flow
Y. Nasir
Jincong He
Chaoshun Hu
Shusei Tanaka
Kainan Wang
X. Wen
AI4CE
49
19
0
31 Mar 2021
Benchmarks for Deep Off-Policy Evaluation
Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
Ziyun Wang
...
Yutian Chen
Aviral Kumar
Cosmin Paduraru
Sergey Levine
T. Paine
ELM
OffRL
97
104
0
30 Mar 2021
Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow
John Mcleod
Hrvoje Stojić
Vincent Adam
Dongho Kim
Jordi Grau-Moya
Peter Vrancx
Felix Leibfried
OffRL
63
2
0
26 Mar 2021
Adversarial Imitation Learning with Trajectorial Augmentation and Correction
Dafni Antotsiou
C. Ciliberto
Tae-Kyun Kim
63
10
0
25 Mar 2021
Model Predictive Actor-Critic: Accelerating Robot Skill Acquisition with Deep Reinforcement Learning
A. S. Morgan
Daljeet Nandha
Georgia Chalvatzaki
Carlo DÉramo
A. Dollar
Jan Peters
96
44
0
25 Mar 2021
The Gradient Convergence Bound of Federated Multi-Agent Reinforcement Learning with Efficient Communication
Xing Xu
Rongpeng Li
Zhifeng Zhao
Honggang Zhang
86
12
0
24 Mar 2021
Cautiously Optimistic Policy Optimization and Exploration with Linear Function Approximation
Andrea Zanette
Ching-An Cheng
Alekh Agarwal
112
53
0
24 Mar 2021
Improving Actor-Critic Reinforcement Learning via Hamiltonian Monte Carlo Method
Duo Xu
Faramarz Fekri
67
8
0
22 Mar 2021
Learning to Simulate on Sparse Trajectory Data
Hua Wei
Chacha Chen
Chang-rui Liu
Guanjie Zheng
Z. Li
45
13
0
22 Mar 2021
Combining Pessimism with Optimism for Robust and Efficient Model-Based Deep Reinforcement Learning
Sebastian Curi
Ilija Bogunovic
Andreas Krause
86
17
0
18 Mar 2021
Lyapunov Barrier Policy Optimization
Harshit S. Sikchi
Wenxuan Zhou
David Held
90
15
0
16 Mar 2021
Hierarchical Reinforcement Learning Framework for Stochastic Spaceflight Campaign Design
Yuji Takubo
Hao Chen
K. Ho
37
13
0
16 Mar 2021
Robust MAML: Prioritization task buffer with adaptive learning process for model-agnostic meta-learning
Thanh Nguyen
Tung M. Luu
T. Pham
Sanzhar Rakhimkul
Chang D. Yoo
58
11
0
15 Mar 2021
Modelling Human Kinetics and Kinematics during Walking using Reinforcement Learning
Visak C. V. Kumar
24
0
0
15 Mar 2021
A Distributed Optimisation Framework Combining Natural Gradient with Hessian-Free for Discriminative Sequence Training
Adnan Haider
Chao Zhang
Florian Kreyssig
P. Woodland
120
7
0
12 Mar 2021
Domain Curiosity: Learning Efficient Data Collection Strategies for Domain Adaptation
Karol Arndt
Oliver Struckmeier
Ville Kyrki
46
1
0
12 Mar 2021
Policy Search with Rare Significant Events: Choosing the Right Partner to Cooperate with
Paul Ecoffet
Nicolas Fontbonne
Jean-Baptiste André
Nicolas Bredèche
37
3
0
11 Mar 2021
Robust High-speed Running for Quadruped Robots via Deep Reinforcement Learning
Guillaume Bellegarda
Yiyu Chen
Zhuochen Liu
Quan Nguyen
91
47
0
11 Mar 2021
Improving Context-Based Meta-Reinforcement Learning with Self-Supervised Trajectory Contrastive Learning
Bernie Wang
Si-ting Xu
Kurt Keutzer
Yang Gao
Bichen Wu
SSL
OffRL
61
7
0
10 Mar 2021
Causal-aware Safe Policy Improvement for Task-oriented dialogue
Govardana Sachithanandam Ramachandran
Kazuma Hashimoto
Caiming Xiong
OffRL
54
11
0
10 Mar 2021
Behavior From the Void: Unsupervised Active Pre-Training
Hao Liu
Pieter Abbeel
VLM
SSL
146
207
0
08 Mar 2021
Adaptive Agent Architecture for Real-time Human-Agent Teaming
Tianwei Ni
Huao Li
Siddharth Agrawal
S. Raja
Fan Jia
Yikang Gui
Dana Hughes
M. Lewis
Katia Sycara
41
0
0
07 Mar 2021
Visual Explanation using Attention Mechanism in Actor-Critic-based Deep Reinforcement Learning
Hidenori Itaya
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
K. Sugiura
76
19
0
06 Mar 2021
Deep reinforcement learning in medical imaging: A literature review
S. Kevin Zhou
Hoang Ngan Le
Khoa Luu
Hien V Nguyen
N. Ayache
LM&MA
OffRL
MedIm
84
149
0
05 Mar 2021
Addressing Action Oscillations through Learning Policy Inertia
Chong Chen
Hongyao Tang
Jianye Hao
Wulong Liu
Zhaopeng Meng
56
18
0
03 Mar 2021
Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction
Hongyao Tang
Jianye Hao
Guangyong Chen
Pengfei Chen
Chong Chen
Yaodong Yang
Lu Zhang
Wulong Liu
Zhaopeng Meng
OffRL
138
4
0
03 Mar 2021
Design of an Affordable Prosthetic Arm Equipped with Deep Learning Vision-Based Manipulation
A. Imran
William Escobar
F. Barez
70
7
0
03 Mar 2021
Offline Reinforcement Learning with Pseudometric Learning
Robert Dadashi
Shideh Rezaeifar
Nino Vieillard
Léonard Hussenot
Olivier Pietquin
Matthieu Geist
OffRL
112
41
0
02 Mar 2021
Deep Reinforcement Learning for URLLC data management on top of scheduled eMBB traffic
Fabio Saggese
Luca Pasqualini
M. Moretti
A. Abrardo
28
16
0
02 Mar 2021
Model-based Constrained Reinforcement Learning using Generalized Control Barrier Function
Haitong Ma
Jianyu Chen
Shengbo Eben Li
Ziyu Lin
Yang Guan
Yangang Ren
Sifa Zheng
69
67
0
02 Mar 2021
Previous
1
2
3
...
15
16
17
...
39
40
41
Next