Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 3,098 papers shown
Title
SFV: Reinforcement Learning of Physical Skills from Videos
Xue Bin Peng
Angjoo Kanazawa
Jitendra Malik
Pieter Abbeel
Sergey Levine
24
65
0
08 Oct 2018
Safe-To-Explore State Spaces: Ensuring Safe Exploration in Policy Search with Hierarchical Task Optimization
Jens Lundell
R. Krug
Erik Schaffernicht
Todor Stoyanov
Ville Kyrki
16
3
0
08 Oct 2018
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Shariq Iqbal
Fei Sha
6
738
0
05 Oct 2018
PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation
Perttu Hämäläinen
Amin Babadi
Xiaoxiao Ma
J. Lehtinen
32
62
0
05 Oct 2018
Learning Scheduling Algorithms for Data Processing Clusters
Hongzi Mao
Malte Schwarzkopf
S. Venkatakrishnan
Zili Meng
Mohammad Alizadeh
OffRL
20
638
0
03 Oct 2018
Near-Optimal Representation Learning for Hierarchical Reinforcement Learning
Ofir Nachum
S. Gu
Honglak Lee
Sergey Levine
33
206
0
02 Oct 2018
CEM-RL: Combining evolutionary and gradient-based methods for policy search
Aloïs Pourchot
Olivier Sigaud
32
160
0
02 Oct 2018
EMI: Exploration with Mutual Information
Hyoungseok Kim
Jaekyeom Kim
Yeonwoo Jeong
Sergey Levine
Hyun Oh Song
21
5
0
02 Oct 2018
The Dreaming Variational Autoencoder for Reinforcement Learning Environments
Per-Arne Andersen
M. G. Olsen
Ole-Christoffer Granmo
DRL
22
17
0
02 Oct 2018
Injective State-Image Mapping facilitates Visual Adversarial Imitation Learning
Subhajit Chaudhury
Daiki Kimura
Asim Munawar
Ryuki Tachibana
GAN
VGen
22
3
0
02 Oct 2018
Bayesian Policy Optimization for Model Uncertainty
Gilwoo Lee
Brian Hou
Aditya Mandalika
Jeongseok Lee
Sanjiban Choudhury
S. Srinivasa
14
41
0
01 Oct 2018
Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow
Xue Bin Peng
Angjoo Kanazawa
Sam Toyer
Pieter Abbeel
Sergey Levine
30
214
0
01 Oct 2018
Bayesian Transfer Reinforcement Learning with Prior Knowledge Rules
Michalis K. Titsias
Sotirios Nikoloutsopoulos
BDL
OffRL
18
3
0
30 Sep 2018
Refining Manually-Designed Symbol Grounding and High-Level Planning by Policy Gradients
Takuya Hiraoka
Takashi Onishi
Takahisa Imagawa
Yoshimasa Tsuruoka
11
2
0
29 Sep 2018
Using Deep Reinforcement Learning to Learn High-Level Policies on the ATRIAS Biped
Tianyu Li
Akshara Rai
H. Geyer
C. Atkeson
30
51
0
28 Sep 2018
Boosting Trust Region Policy Optimization by Normalizing Flows Policy
Yunhao Tang
Shipra Agrawal
TPM
39
29
0
27 Sep 2018
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
49
670
0
21 Sep 2018
Constrained Exploration and Recovery from Experience Shaping
Tu-Hoa Pham
Giovanni De Magistris
Don Joven Agravante
Subhajit Chaudhury
Asim Munawar
Ryuki Tachibana
13
2
0
21 Sep 2018
IntelligentCrowd: Mobile Crowdsensing via Multi-Agent Reinforcement Learning
Yize Chen
Hao Wang
HAI
14
25
0
20 Sep 2018
Benchmarking Reinforcement Learning Algorithms on Real-World Robots
A. R. Mahmood
D. Korenkevych
Gautham Vasan
W. Ma
James Bergstra
OffRL
22
155
0
20 Sep 2018
Sim-to-Real Transfer of Robot Learning with Variable Length Inputs
Vibhavari Dasagi
Robert Lee
Serena Mou
Jake Bruce
Niko Sünderhauf
Jurgen Leitner
OffRL
25
3
0
20 Sep 2018
TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game
Peng Sun
Xinghai Sun
Lei Han
Jiechao Xiong
Qing Wang
...
Yang Zheng
Ji Liu
Yongsheng Liu
Han Liu
Tong Zhang
37
75
0
19 Sep 2018
Leveraging Contact Forces for Learning to Grasp
Hamza Merzic
Miroslav Bogdanovic
Daniel Kappler
Ludovic Righetti
Jeannette Bohg
13
44
0
19 Sep 2018
Switching Isotropic and Directional Exploration with Parameter Space Noise in Deep Reinforcement Learning
Izumi Karino
Kazutoshi Tanaka
Ryuma Niiyama
Yasuo Kuniyoshi
19
3
0
18 Sep 2018
Adversarial Imitation via Variational Inverse Reinforcement Learning
A. H. Qureshi
Byron Boots
Michael C. Yip
22
61
0
17 Sep 2018
Curriculum goal masking for continuous deep reinforcement learning
Manfred Eppe
S. Magg
S. Wermter
27
19
0
17 Sep 2018
Policy Optimization via Importance Sampling
Alberto Maria Metelli
Matteo Papini
Francesco Faccio
Marcello Restelli
OffRL
21
89
0
17 Sep 2018
Improvements on Hindsight Learning
Ameet Deshpande
Srikanth Sarma
Ashutosh Jha
Balaraman Ravindran
OffRL
21
3
0
16 Sep 2018
Inspiration Learning through Preferences
Nir Baram
Shie Mannor
23
1
0
16 Sep 2018
Adversarial Reinforcement Learning for Observer Design in Autonomous Systems under Cyber Attacks
Abhishek Gupta
Zhaoyuan Yang
AAML
19
7
0
15 Sep 2018
Model-Based Reinforcement Learning via Meta-Policy Optimization
I. Clavera
Jonas Rothfuss
John Schulman
Yasuhiro Fujita
Tamim Asfour
Pieter Abbeel
30
225
0
14 Sep 2018
Deep Reinforcement Learning for Event-Triggered Control
Dominik Baumann
Jia Jie Zhu
Georg Martius
Sebastian Trimpe
BDL
30
60
0
13 Sep 2018
Multi-task Deep Reinforcement Learning with PopArt
Matteo Hessel
Hubert Soyer
L. Espeholt
Wojciech M. Czarnecki
Simon Schmitt
H. V. Hasselt
22
315
0
12 Sep 2018
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge
Michal Garmulewicz
Henryk Michalewski
Piotr Milos
16
8
0
10 Sep 2018
Learning Adaptive Display Exposure for Real-Time Advertising
Weixun Wang
Junqi Jin
Jianye Hao
Chunjie Chen
Chuan Yu
...
Xiaotian Hao
Yixi Wang
Han Li
Jian Xu
Kun Gai
22
6
0
10 Sep 2018
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
Ilya Kostrikov
Kumar Krishna Agrawal
Debidatta Dwibedi
Sergey Levine
Jonathan Tompson
41
256
0
09 Sep 2018
Learning Invariances for Policy Generalization
Rémi Tachet des Combes
Philip Bachman
H. V. Seijen
20
12
0
07 Sep 2018
A Block Coordinate Ascent Algorithm for Mean-Variance Optimization
Bo Liu
Tengyang Xie
Yangyang Xu
Mohammad Ghavamzadeh
Yinlam Chow
Daoming Lyu
Daesub Yoon
14
30
0
07 Sep 2018
Emergence of Human-comparable Balancing Behaviors by Deep Reinforcement Learning
Chuanyu Yang
Taku Komura
Zhibin Li
32
20
0
06 Sep 2018
Sample-Efficient Imitation Learning via Generative Adversarial Nets
Lionel Blondé
Alexandros Kalousis
GAN
16
47
0
06 Sep 2018
A Robotic Auto-Focus System based on Deep Reinforcement Learning
Xiaofan Yu
Runze Yu
Jingsong Yang
Xiaohui Duan
26
9
0
05 Sep 2018
Effective Exploration for Deep Reinforcement Learning via Bootstrapped Q-Ensembles under Tsallis Entropy Regularization
Gang Chen
Yiming Peng
Mengjie Zhang
22
14
0
02 Sep 2018
Gibson Env: Real-World Perception for Embodied Agents
F. Xia
Amir Zamir
Zhi-Yang He
Alexander Sax
Jitendra Malik
Silvio Savarese
AI4CE
LM&Ro
34
817
0
31 Aug 2018
A Coordinate-Free Construction of Scalable Natural Gradient
Kevin Luk
Roger C. Grosse
11
11
0
30 Aug 2018
Application of Self-Play Reinforcement Learning to a Four-Player Game of Imperfect Information
Henry Charlesworth
SSL
8
12
0
30 Aug 2018
Adversarial Deep Reinforcement Learning in Portfolio Management
Zhipeng Liang
Hao Chen
Junhao Zhu
Kangkang Jiang
Yanran Li
24
33
0
29 Aug 2018
SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning
Marvin Zhang
Sharad Vikram
Laura M. Smith
Pieter Abbeel
Matthew J. Johnson
Sergey Levine
OffRL
23
41
0
28 Aug 2018
NavigationNet: A Large-scale Interactive Indoor Navigation Dataset
He Huang
Yujing Shen
Jiankai Sun
Cewu Lu
3DV
23
2
0
25 Aug 2018
Proximal Policy Optimization and its Dynamic Version for Sequence Generation
Yi-Lin Tuan
Jinzhi Zhang
Yujia Li
Hung-yi Lee
23
10
0
24 Aug 2018
LIFT: Reinforcement Learning in Computer Systems by Learning From Demonstrations
Michael Schaarschmidt
A. Kuhnle
Ben Ellis
Kai Fricke
Felix Gessert
Eiko Yoneki
OffRL
32
41
0
23 Aug 2018
Previous
1
2
3
...
53
54
55
...
60
61
62
Next