Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives
Anirudh Goyal
Shagun Sodhani
Jonathan Binas
Xue Bin Peng
Sergey Levine
Yoshua Bengio
91
49
0
25 Jun 2019
Sequential Neural Processes
Gautam Singh
Jaesik Yoon
Youngsung Son
Sungjin Ahn
BDL
AI4TS
115
85
0
24 Jun 2019
Modern Deep Reinforcement Learning Algorithms
Sergey Ivanov
A. Dýakonov
OffRL
61
39
0
24 Jun 2019
Proximal Distilled Evolutionary Reinforcement Learning
Cristian Bodnar
Ben Day
Pietro Lio
101
76
0
24 Jun 2019
Boosting for Control of Dynamical Systems
Naman Agarwal
Nataly Brukhim
Elad Hazan
Zhou Lu
56
13
0
20 Jun 2019
Experience Replay Optimization
Daochen Zha
Kwei-Herng Lai
Kaixiong Zhou
Helen Zhou
OffRL
59
105
0
19 Jun 2019
Calibrated Model-Based Deep Reinforcement Learning
Ali Malik
Volodymyr Kuleshov
Jiaming Song
Danny Nemer
Harlan Seymour
Stefano Ermon
159
55
0
19 Jun 2019
Wasserstein Adversarial Imitation Learning
Huang Xiao
Michael Herman
Joerg Wagner
Sebastian Ziesche
Jalal Etesami
T. H. Linh
43
72
0
19 Jun 2019
RadGrad: Active learning with loss gradients
Paul Budnarain
Renato Ferreira Pinto Junior
Ilan Kogan
107
3
0
18 Jun 2019
Directed Exploration for Reinforcement Learning
Z. Guo
Emma Brunskill
71
12
0
18 Jun 2019
Hill Climbing on Value Estimates for Search-control in Dyna
Yangchen Pan
Hengshuai Yao
Amir-massoud Farahmand
Martha White
87
18
0
18 Jun 2019
Gap-Increasing Policy Evaluation for Efficient and Noise-Tolerant Reinforcement Learning
Tadashi Kozuno
Dongqi Han
Kenji Doya
OffRL
43
2
0
18 Jun 2019
Weight Agnostic Neural Networks
Adam Gaier
David R Ha
OOD
67
242
0
11 Jun 2019
Learning to Score Behaviors for Guided Policy Optimization
Aldo Pacchiano
Jack Parker-Holder
Yunhao Tang
A. Choromańska
K. Choromanski
Michael I. Jordan
91
39
0
11 Jun 2019
Exploration via Hindsight Goal Generation
Zhizhou Ren
Kefan Dong
Yuanshuo Zhou
Qiang Liu
Jian-wei Peng
85
90
0
10 Jun 2019
Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past
Che Wang
George Andriopoulos
51
45
0
10 Jun 2019
Data-Efficient and Safe Learning for Humanoid Locomotion Aided by a Dynamic Balancing Model
Junhyeok Ahn
Jaemin Lee
Luis Sentis
55
15
0
10 Jun 2019
Neural Heterogeneous Scheduler
Tegg Taekyong Sung
Valliappa Chockalingam
Alex Yahja
Bo Ryu
18
3
0
09 Jun 2019
An Extensible Interactive Interface for Agent Design
Matthew Rahtz
James Fang
Anca Dragan
Dylan Hadfield-Menell
60
1
0
06 Jun 2019
A Survey of Behavior Learning Applications in Robotics -- State of the Art and Perspectives
Alexander Fabisch
Christoph Petzoldt
M. Otto
Frank Kirchner
AI4CE
54
13
0
05 Jun 2019
Neuropathic Pain Diagnosis Simulator for Causal Discovery Algorithm Evaluation
Ruibo Tu
Kun Zhang
Bo Christer Bertilson
Hedvig Kjellström
Cheng Zhang
OOD
CML
95
43
0
04 Jun 2019
Reinforcement Learning with Low-Complexity Liquid State Machines
Wachirawit Ponghiran
G. Srinivasan
Kaushik Roy
47
14
0
04 Jun 2019
Lattice Map Spiking Neural Networks (LM-SNNs) for Clustering and Classifying Image Data
Hananel Hazan
D. J. Saunders
Darpan T. Sanghavi
H. Siegelmann
R. Kozma
61
17
0
04 Jun 2019
Off-Policy Evaluation via Off-Policy Classification
A. Irpan
Kanishka Rao
Konstantinos Bousmalis
Chris Harris
Julian Ibarz
Sergey Levine
OffRL
83
50
0
04 Jun 2019
Hamiltonian Neural Networks
S. Greydanus
Misko Dzamba
J. Yosinski
PINN
AI4CE
143
900
0
04 Jun 2019
Analysis and Improvement of Adversarial Training in DQN Agents With Adversarially-Guided Exploration (AGE)
Vahid Behzadan
W. Hsu
AAML
64
8
0
03 Jun 2019
RL-Based Method for Benchmarking the Adversarial Resilience and Robustness of Deep Reinforcement Learning Policies
Vahid Behzadan
W. Hsu
AAML
OffRL
66
9
0
03 Jun 2019
Air Learning: A Deep Reinforcement Learning Gym for Autonomous Aerial Robot Visual Navigation
Srivatsan Krishnan
Behzad Boroujerdian
William Fu
Aleksandra Faust
Vijay Janapa Reddi
75
37
0
02 Jun 2019
Attentional Policies for Cross-Context Multi-Agent Reinforcement Learning
Matthew A. Wright
R. Horowitz
52
3
0
31 May 2019
Defining Admissible Rewards for High Confidence Policy Evaluation
Niranjani Prasad
Barbara E. Engelhardt
Finale Doshi-Velez
OffRL
53
6
0
30 May 2019
Towards Finding Longer Proofs
Zsolt Zombori
Adrián Csiszárik
Henryk Michalewski
C. Kaliszyk
Josef Urban
OffRL
LRM
83
16
0
30 May 2019
Structured Monte Carlo Sampling for Nonisotropic Distributions via Determinantal Point Processes
K. Choromanski
Aldo Pacchiano
Jack Parker-Holder
Yunhao Tang
77
3
0
29 May 2019
Adversarial Imitation Learning from Incomplete Demonstrations
Mingfei Sun
Xiaojuan Ma
78
29
0
29 May 2019
Linear interpolation gives better gradients than Gaussian smoothing in derivative-free optimization
A. Berahas
Liyuan Cao
K. Choromanski
K. Scheinberg
136
19
0
29 May 2019
A General Markov Decision Process Framework for Directly Learning Optimal Control Policies
Yingdong Lu
M. Squillante
C. Wu
AI4CE
8
2
0
28 May 2019
Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement Learning
Caleb Chuck
Supawit Chockchowwat
S. Niekum
67
14
0
27 May 2019
Physics-as-Inverse-Graphics: Unsupervised Physical Parameter Estimation from Video
Miguel Jaques
Michael G. Burke
Timothy M. Hospedales
VGen
PINN
126
45
0
27 May 2019
SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards
S. Reddy
Anca Dragan
Sergey Levine
OffRL
69
52
0
27 May 2019
Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction
Hongyao Tang
Jianye Hao
Guangyong Chen
Pengfei Chen
Zhaopeng Meng
Yaodong Yang
Li Wang
33
2
0
27 May 2019
Policy Search by Target Distribution Learning for Continuous Control
Wei Shen
Yuanqi Li
Jian Li
60
6
0
27 May 2019
Explainable Reinforcement Learning Through a Causal Lens
Prashan Madumal
Tim Miller
L. Sonenberg
F. Vetere
CML
116
362
0
27 May 2019
Provably Efficient Imitation Learning from Observation Alone
Wen Sun
Anirudh Vemula
Byron Boots
J. Andrew Bagnell
167
107
0
27 May 2019
Operation and Imitation under Safety-Aware Shared Control
Alexander Broad
Todd Murphey
B. Argall
67
10
0
26 May 2019
Prioritized Sequence Experience Replay
Marc Brittain
J. R. Bertram
Xuxi Yang
Peng Wei
73
45
0
25 May 2019
MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies
Xue Bin Peng
Michael Chang
Grace Zhang
Pieter Abbeel
Sergey Levine
83
197
0
23 May 2019
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment
Jivitesh Sharma
Per-Arne Andersen
Ole-Christoffer Granmo
M. G. Olsen
AI4CE
72
70
0
23 May 2019
Combine PPO with NES to Improve Exploration
Lianjiang Li
Yunrong Yang
Bingna Li
13
1
0
23 May 2019
Learning Robust Options by Conditional Value at Risk Optimization
Takuya Hiraoka
Takahisa Imagawa
Tatsuya Mori
Takashi Onishi
Yoshimasa Tsuruoka
80
27
0
22 May 2019
Deep Signature Transforms
Patrick Kidger
Patrick Kidger
Imanol Perez Arribas
C. Salvi
Terry Lyons
SLR
163
131
0
21 May 2019
Evolving Rewards to Automate Reinforcement Learning
Aleksandra Faust
Anthony G. Francis
Dar Mehta
77
51
0
18 May 2019
Previous
1
2
3
...
43
44
45
...
50
51
52
Next