Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
REPLAB: A Reproducible Low-Cost Arm Benchmark Platform for Robotic Learning
Brian Yang
Jesse Zhang
Vitchyr H. Pong
Sergey Levine
Dinesh Jayaraman
77
37
0
17 May 2019
Mastering the Game of Sungka from Random Play
Darwin Bautista
Raimarc S. Dionido
19
0
0
17 May 2019
Bias-Reduced Hindsight Experience Replay with Virtual Goal Prioritization
Binyamin Manela
Armin Biess
44
23
0
14 May 2019
Multi-Pass Q-Networks for Deep Reinforcement Learning with Parameterised Action Spaces
Craig J. Bester
Steven D. James
George Konidaris
56
57
0
10 May 2019
Autonomous Management of Energy-Harvesting IoT Nodes Using Deep Reinforcement Learning
Abdulmajid Murad
F. Kraemer
Kerstin Bach
Gavin Taylor
48
26
0
10 May 2019
Smoothing Policies and Safe Policy Gradients
Matteo Papini
Matteo Pirotta
Marcello Restelli
80
31
0
08 May 2019
Longitudinal Dynamic versus Kinematic Models for Car-Following Control Using Deep Reinforcement Learning
Yuan Lin
J. McPhee
N. L. Azad
AI4CE
66
34
0
07 May 2019
A Complementary Learning Systems Approach to Temporal Difference Learning
Sam Blakeman
D. Mareschal
54
42
0
07 May 2019
P3O: Policy-on Policy-off Policy Optimization
Rasool Fakoor
Pratik Chaudhari
Alex Smola
OffRL
93
56
0
05 May 2019
Hierarchical Policy Learning is Sensitive to Goal Space Design
Zach Dwiel
Madhavun Candadai
Mariano Phielipp
Arjun K. Bansal
79
15
0
04 May 2019
ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables
Mingzhang Yin
Yuguang Yue
Mingyuan Zhou
66
23
0
04 May 2019
Collaborative Evolutionary Reinforcement Learning
Shauharda Khadka
Somdeb Majumdar
Tarek Nassar
Zach Dwiel
E. Tumer
Santiago Miret
Yinyin Liu
Kagan Tumer
64
100
0
02 May 2019
Coevo: a collaborative design platform with artificial agents
Gerard Serra
D. Miralles
8
2
0
30 Apr 2019
Generative Adversarial Imagination for Sample Efficient Deep Reinforcement Learning
Kacper Kielak
GAN
23
0
0
30 Apr 2019
DAC: The Double Actor-Critic Architecture for Learning Options
Shangtong Zhang
Shimon Whiteson
149
73
0
29 Apr 2019
Reinforcement Learning Based Orchestration for Elastic Services
Mauricio Fadel Argerich
Bin Cheng
Jonathan Fürst
32
7
0
26 Apr 2019
Self Training Autonomous Driving Agent
Shashank Kotyan
Danilo Vasconcellos Vargas
Venkanna Udutalapally
29
4
0
26 Apr 2019
Meta-Sim: Learning to Generate Synthetic Datasets
Amlan Kar
Aayush Prakash
Ming-Yuan Liu
Eric Cameracci
Justin Yuan
Matt Rusiniak
David Acuna
Antonio Torralba
Sanja Fidler
144
252
0
25 Apr 2019
Evolving Neural Networks in Reinforcement Learning by means of UMDAc
Mikel Malagón
Josu Ceberio
54
2
0
24 Apr 2019
Generative Exploration and Exploitation
Jiechuan Jiang
Zongqing Lu
39
6
0
21 Apr 2019
Learning Manipulation under Physics Constraints with Visual Perception
Wenbin Li
A. Leonardis
Jeannette Bohg
Mario Fritz
SSL
OCL
31
7
0
19 Apr 2019
Rogue-Gym: A New Challenge for Generalization in Reinforcement Learning
Yuji Kanagawa
Tomoyuki Kaneko
73
14
0
17 Apr 2019
A Hitchhiker's Guide to Statistical Comparisons of Reinforcement Learning Algorithms
Cédric Colas
Olivier Sigaud
Pierre-Yves Oudeyer
77
64
0
15 Apr 2019
Dot-to-Dot: Explainable Hierarchical Reinforcement Learning for Robotic Manipulation
Benjamin Beyret
A. Shafti
A. Faisal
142
74
0
14 Apr 2019
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations
Daniel S. Brown
Wonjoon Goo
P. Nagarajan
S. Niekum
121
358
0
12 Apr 2019
Only Relevant Information Matters: Filtering Out Noisy Samples to Boost RL
Yannis Flet-Berliac
Philippe Preux
47
2
0
08 Apr 2019
Reducing catastrophic forgetting when evolving neural networks
Joseph Early
23
2
0
05 Apr 2019
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer
E. Beeching
Christian Wolf
J. Dibangoye
Olivier Simonin
OffRL
LRM
76
25
0
03 Apr 2019
VRGym: A Virtual Testbed for Physical and Interactive AI
Xu Xie
Hangxin Liu
Zhenliang Zhang
Yuxing Qiu
Feng Gao
Siyuan Qi
Yixin Zhu
Song-Chun Zhu
58
27
0
02 Apr 2019
Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents
Christian Rupprecht
Cyril Ibrahim
C. Pal
96
32
0
02 Apr 2019
Personalized Cancer Chemotherapy Schedule: a numerical comparison of performance and robustness in model-based and model-free scheduling methodologies
J. Tordesillas
Juncal Arbelaiz
OffRL
34
3
0
02 Apr 2019
Multitask Soft Option Learning
Maximilian Igl
Andrew Gambardella
Jinke He
Nantas Nardelli
N. Siddharth
Wendelin Bohmer
Shimon Whiteson
187
26
0
01 Apr 2019
Guided Meta-Policy Search
Russell Mendonca
Abhishek Gupta
Rosen Kralev
Pieter Abbeel
Sergey Levine
Chelsea Finn
68
57
0
01 Apr 2019
Generalized Off-Policy Actor-Critic
Shangtong Zhang
Wendelin Bohmer
Shimon Whiteson
OffRL
CML
151
43
0
27 Mar 2019
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Riley Simmons-Edler
Ben Eisner
E. Mitchell
Sebastian Seung
Daniel D. Lee
97
29
0
25 Mar 2019
Using RGB Image as Visual Input for Mapless Robot Navigation
Liulong Ma
Yanjie Liu
Jiao Chen
SSL
92
17
0
24 Mar 2019
HouseExpo: A Large-scale 2D Indoor Layout Dataset for Learning-based Algorithms on Mobile Robots
Tingguang Li
Danny Ho
Chenming Li
Delong Zhu
Chaoqun Wang
Max Meng
3DV
66
57
0
23 Mar 2019
Improving Safety in Reinforcement Learning Using Model-Based Architectures and Human Intervention
Bharat Prakash
Mohit Khatwani
Nicholas R. Waytowich
T. Mohsenin
OffRL
58
19
0
22 Mar 2019
DQN with model-based exploration: efficient learning on environments with sparse rewards
Stephen Gou
Yuyang Liu
52
14
0
22 Mar 2019
Adaptive Genomic Evolution of Neural Network Topologies (AGENT) for State-to-Action Mapping in Autonomous Agents
A. Behjat
Sharat Chidambaran
Souma Chowdhury
48
14
0
17 Mar 2019
gym-gazebo2, a toolkit for reinforcement learning using ROS 2 and Gazebo
N. G. Lopez
Y. Nuin
Elias Barba Moral
Lander Usategui San Juan
A. Rueda
Víctor Mayoral-Vilches
R. Kojcev
OffRL
50
35
0
14 Mar 2019
Deep Reinforcement Learning with Feedback-based Exploration
Jan Scholten
Daan Wout
C. Celemin
Jens Kober
62
4
0
14 Mar 2019
Learning Gaussian Policies from Corrective Human Feedback
Daan Wout
Jan Scholten
C. Celemin
Jens Kober
93
2
0
12 Mar 2019
Universally Slimmable Networks and Improved Training Techniques
Jiahui Yu
Thomas Huang
92
389
0
12 Mar 2019
Sim-to-(Multi)-Real: Transfer of Low-Level Robust Control Policies to Multiple Quadrotors
Artem Molchanov
Tao Chen
Wolfgang Hönig
James A. Preiss
Nora Ayanian
Gaurav Sukhatme
159
111
0
11 Mar 2019
Hybrid Reinforcement Learning with Expert State Sequences
Xiaoxiao Guo
Shiyu Chang
Mo Yu
Gerald Tesauro
Murray Campbell
OffRL
54
33
0
11 Mar 2019
Orthogonal Estimation of Wasserstein Distances
Mark Rowland
Jiri Hron
Yunhao Tang
K. Choromanski
Tamás Sarlós
Adrian Weller
91
43
0
09 Mar 2019
Adaptive Power System Emergency Control using Deep Reinforcement Learning
Qiuhua Huang
Renke Huang
Weituo Hao
Jie Tan
Rui Fan
Zhenyu Huang
111
279
0
09 Mar 2019
Dyna-AIL : Adversarial Imitation Learning by Planning
Vaibhav Saxena
Srinivasan Sivanandan
Pulkit Mathur
41
1
0
08 Mar 2019
Provably Robust Blackbox Optimization for Reinforcement Learning
K. Choromanski
Aldo Pacchiano
Jack Parker-Holder
Yunhao Tang
Deepali Jain
Yuxiang Yang
Atil Iscen
Jasmine Hsu
Vikas Sindhwani
57
5
0
07 Mar 2019
Previous
1
2
3
...
44
45
46
...
50
51
52
Next