Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.06347
Cited By
v1
v2 (latest)
Proximal Policy Optimization Algorithms
20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Proximal Policy Optimization Algorithms"
50 / 8,601 papers shown
Title
Deep Neuroevolution of Recurrent and Discrete World Models
S. Risi
Kenneth O. Stanley
OCL
133
53
0
28 Apr 2019
Arbitrage of Energy Storage in Electricity Markets with Deep Reinforcement Learning
Hanchen Xu
Xiao Li
Xiangyu Zhang
Junbo Zhang
45
27
0
28 Apr 2019
How You Act Tells a Lot: Privacy-Leakage Attack on Deep Reinforcement Learning
Xinlei Pan
Weiyao Wang
Xiaoshuai Zhang
Yue Liu
Jinfeng Yi
Basel Alomair
MIACV
146
26
0
24 Apr 2019
Neural Logic Reinforcement Learning
Zhengyao Jiang
Shan Luo
NAI
93
75
0
24 Apr 2019
Monte-Carlo Tree Search for Efficient Visually Guided Rearrangement Planning
Yann Labbé
Sergey Zagoruyko
Igor Kalevatykh
Ivan Laptev
Justin Carpentier
Mathieu Aubry
Josef Sivic
OCL
134
70
0
23 Apr 2019
Generative Exploration and Exploitation
Jiechuan Jiang
Zongqing Lu
39
6
0
21 Apr 2019
Model-free Deep Reinforcement Learning for Urban Autonomous Driving
Jianyu Chen
Bodi Yuan
Masayoshi Tomizuka
75
268
0
20 Apr 2019
Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition
Chao Gao
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
53
17
0
20 Apr 2019
ConvLab: Multi-Domain End-to-End Dialog System Platform
Sungjin Lee
Qi Zhu
Ryuichi Takanobu
Xiang Li
Yaoqin Zhang
...
Jinchao Li
Baolin Peng
Xiujun Li
Minlie Huang
Jianfeng Gao
VLM
96
111
0
18 Apr 2019
Decoupled Data Based Approach for Learning to Control Nonlinear Dynamical Systems
Ran A. Wang
Karthikeya S. Parunandi
Dan Yu
D. Kalathil
S. Chakravorty
43
12
0
17 Apr 2019
Rogue-Gym: A New Challenge for Generalization in Reinforcement Learning
Yuji Kanagawa
Tomoyuki Kaneko
73
14
0
17 Apr 2019
Energy-Efficient Slithering Gait Exploration for a Snake-like Robot based on Reinforcement Learning
Zhenshan Bing
Christian Lemke
Zhuangyi Jiang
Kai-Qi Huang
Alois Knoll
30
17
0
16 Apr 2019
Reinforcement Learning for Nested Polar Code Construction
Lingchen Huang
Huazi Zhang
Rong Li
Yiqun Ge
Jun Wang
27
14
0
16 Apr 2019
NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection
Golnaz Ghiasi
Nayeon Lee
Ruoming Pang
Quoc V. Le
ObjD
75
1,401
0
16 Apr 2019
Multi-Objective Autonomous Braking System using Naturalistic Dataset
Rafael Vasquez
Bilal Farooq
50
10
0
15 Apr 2019
Learning to Navigate in Indoor Environments: from Memorizing to Reasoning
Liulong Ma
Yanjie Liu
Jiao Chen
Dong Jin
58
10
0
15 Apr 2019
Learning to Guide: Guidance Law Based on Deep Meta-learning and Model Predictive Path Integral Control
Chen Liang
Weihong Wang
Zhenghua Liu
Chao Lai
Benchun Zhou
49
29
0
15 Apr 2019
Disentangling Options with Hellinger Distance Regularizer
Minsung Hyun
Junyoung Choi
Nojun Kwak
18
2
0
15 Apr 2019
A Short Survey On Memory Based Reinforcement Learning
Dhruv Ramani
OffRL
71
17
0
14 Apr 2019
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations
Daniel S. Brown
Wonjoon Goo
P. Nagarajan
S. Niekum
121
358
0
12 Apr 2019
Knowledge Flow: Improve Upon Your Teachers
Iou-Jen Liu
Jian-wei Peng
Alex Schwing
110
62
0
11 Apr 2019
Model-Free Reinforcement Learning for Financial Portfolios: A Brief Survey
Yoshiharu Sato
OffRL
45
32
0
10 Apr 2019
Creating Pro-Level AI for a Real-Time Fighting Game Using Deep Reinforcement Learning
In-Suk Oh
Seungeun Rho
Sangbin Moon
Seongho Son
Hyoil Lee
Jinyun Chung
98
53
0
08 Apr 2019
Policy Gradient Search: Online Planning and Expert Iteration without Search Trees
Thomas W. Anthony
Robert Nishihara
Philipp Moritz
Tim Salimans
John Schulman
81
30
0
07 Apr 2019
Reinforcement Learning with Attention that Works: A Self-Supervised Approach
Anthony Manchin
Ehsan Abbasnejad
Anton Van Den Hengel
74
60
0
06 Apr 2019
Multi-Preference Actor Critic
Ishan Durugkar
Matthew J. Hausknecht
Adith Swaminathan
Patrick MacAlpine
39
1
0
05 Apr 2019
A Validated Physical Model For Real-Time Simulation of Soft Robotic Snakes
Renato Gasoto
Miles Macklin
Xuan Liu
Yinan Sun
Kenny Erleben
C. Onal
Jie Fu
AI4CE
30
16
0
05 Apr 2019
Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization
Michael Volpp
Lukas P. Frohlich
Kirsten Fischer
Andreas Doerr
Stefan Falkner
Frank Hutter
Christian Daniel
105
85
0
04 Apr 2019
Architecture Search of Dynamic Cells for Semantic Video Segmentation
Vladimir Nekrasov
Hao Chen
Chunhua Shen
Ian Reid
95
21
0
04 Apr 2019
Template-Based Automatic Search of Compact Semantic Segmentation Architectures
Vladimir Nekrasov
Chunhua Shen
Ian Reid
SSeg
VLM
51
10
0
04 Apr 2019
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer
E. Beeching
Christian Wolf
J. Dibangoye
Olivier Simonin
OffRL
LRM
76
25
0
03 Apr 2019
Habitat: A Platform for Embodied AI Research
Manolis Savva
Abhishek Kadian
Oleksandr Maksymets
Yili Zhao
Erik Wijmans
...
Jia-Wei Liu
V. Koltun
Jitendra Malik
Devi Parikh
Dhruv Batra
LM&Ro
143
1,424
0
02 Apr 2019
Multitask Soft Option Learning
Maximilian Igl
Andrew Gambardella
Jinke He
Nantas Nardelli
N. Siddharth
Wendelin Bohmer
Shimon Whiteson
178
26
0
01 Apr 2019
How to pick the domain randomization parameters for sim-to-real transfer of reinforcement learning policies?
Q. Vuong
Sharad Vikram
H. Su
Sicun Gao
Henrik I. Christensen
OOD
79
49
0
28 Mar 2019
AutoSlim: Towards One-Shot Architecture Search for Channel Numbers
Jiahui Yu
Thomas Huang
79
56
0
27 Mar 2019
Autoregressive Policies for Continuous Control Deep Reinforcement Learning
D. Korenkevych
A. R. Mahmood
Gautham Vasan
James Bergstra
79
28
0
27 Mar 2019
Generalized Off-Policy Actor-Critic
Shangtong Zhang
Wendelin Bohmer
Shimon Whiteson
OffRL
CML
132
43
0
27 Mar 2019
AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search
Linnan Wang
Yiyang Zhao
Yuu Jinnai
Yuandong Tian
Rodrigo Fonseca
BDL
89
96
0
26 Mar 2019
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Riley Simmons-Edler
Ben Eisner
E. Mitchell
Sebastian Seung
Daniel D. Lee
97
29
0
25 Mar 2019
Learning a Multi-Modal Policy via Imitating Demonstrations with Mixed Behaviors
Fang-I Hsiao
Jui-Hsuan Kuo
Min Sun
OffRL
43
14
0
25 Mar 2019
Temporal Logic Guided Safe Reinforcement Learning Using Control Barrier Functions
Xiao Li
C. Belta
46
41
0
23 Mar 2019
HouseExpo: A Large-scale 2D Indoor Layout Dataset for Learning-based Algorithms on Mobile Robots
Tingguang Li
Danny Ho
Chenming Li
Delong Zhu
Chaoqun Wang
Max Meng
3DV
56
57
0
23 Mar 2019
TTR-Based Reward for Reinforcement Learning with Implicit Model Priors
Xubo Lyu
Mo Chen
OffRL
38
3
0
23 Mar 2019
Iterative Reinforcement Learning Based Design of Dynamic Locomotion Skills for Cassie
Zhaoming Xie
Patrick Clary
Jeremy Dao
Pedro Morais
J. Hurst
M. van de Panne
73
67
0
22 Mar 2019
Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables
Kate Rakelly
Aurick Zhou
Deirdre Quillen
Chelsea Finn
Sergey Levine
OffRL
90
663
0
19 Mar 2019
Truly Proximal Policy Optimization
Yuhui Wang
Hao He
Chao Wen
Xiaoyang Tan
78
126
0
19 Mar 2019
Hindsight Generative Adversarial Imitation Learning
N. Liu
Tao Lu
Yinghao Cai
Boyao Li
Shuo Wang
78
6
0
19 Mar 2019
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Dhruva Tirumala
Hyeonwoo Noh
Alexandre Galashov
Leonard Hasenclever
Arun Ahuja
Greg Wayne
Razvan Pascanu
Yee Whye Teh
N. Heess
OffRL
65
44
0
18 Mar 2019
Scheduled Intrinsic Drive: A Hierarchical Take on Intrinsically Motivated Exploration
Jingwei Zhang
Niklas Wetzel
Nicolai Dorka
Joschka Boedecker
Wolfram Burgard
65
26
0
18 Mar 2019
Adaptive Variance for Changing Sparse-Reward Environments
Xingyu Lin
Pengsheng Guo
Carlos Florensa
David Held
59
6
0
15 Mar 2019
Previous
1
2
3
...
164
165
166
...
171
172
173
Next