Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
Continual Learning Using World Models for Pseudo-Rehearsal
Nicholas A. Ketz
Soheil Kolouri
Praveen K. Pilly
KELM
CLL
51
7
0
06 Mar 2019
Safety-Guided Deep Reinforcement Learning via Online Gaussian Process Estimation
Jiameng Fan
Wenchao Li
OffRL
OnRL
GP
77
18
0
06 Mar 2019
The AI Driving Olympics at NeurIPS 2018
J. Zilly
J. Tani
Breandan Considine
Bhairav Mehta
Andrea F. Daniele
...
R. Hristov
S. Mallya
Emilio Frazzoli
A. Censi
Liam Paull
85
14
0
06 Mar 2019
Training in Task Space to Speed Up and Guide Reinforcement Learning
Guillaume Bellegarda
Katie Byl
51
19
0
06 Mar 2019
Open-Sourced Reinforcement Learning Environments for Surgical Robotics
Florian Richter
Ryan K. Orosco
Michael C. Yip
OffRL
68
82
0
05 Mar 2019
The StreetLearn Environment and Dataset
Piotr Wojciech Mirowski
Andras Banki-Horvath
Keith Anderson
Denis Teplyashin
Karl Moritz Hermann
...
Matthew Koichi Grimes
Karen Simonyan
Koray Kavukcuoglu
Andrew Zisserman
R. Hadsell
3DV
75
66
0
04 Mar 2019
NoRML: No-Reward Meta Learning
Yuxiang Yang
Ken Caluwaerts
Atil Iscen
Jie Tan
Chelsea Finn
77
27
0
04 Mar 2019
Asynchronous Episodic Deep Deterministic Policy Gradient: Towards Continuous Control in Computationally Complex Environments
Zhizheng Zhang
Jiale Chen
Zhibo Chen
Weiping Li
OffRL
93
61
0
03 Mar 2019
Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents
Joseph Suárez
Yilun Du
Phillip Isola
Igor Mordatch
77
71
0
02 Mar 2019
Regularity Normalization: Neuroscience-Inspired Unsupervised Attention across Neural Network Layers
Baihan Lin
59
2
0
27 Feb 2019
Deep Variational Koopman Models: Inferring Koopman Observations for Uncertainty-Aware Dynamics Modeling and Control
Jeremy Morton
F. Witherden
Mykel J Kochenderfer
85
47
0
26 Feb 2019
Flappy Hummingbird: An Open Source Dynamic Simulation of Flapping Wing Robots and Animals
Fan Fei
Zhan Tu
Yilun Yang
Jian Zhang
Xinyan Deng
88
32
0
25 Feb 2019
Curiosity-Driven Experience Prioritization via Density Estimation
Rui Zhao
Volker Tresp
134
55
0
20 Feb 2019
Emergent Coordination Through Competition
Siqi Liu
Guy Lever
J. Merel
S. Tunyasuvunakool
N. Heess
T. Graepel
123
151
0
19 Feb 2019
Fast Efficient Hyperparameter Tuning for Policy Gradients
Supratik Paul
Vitaly Kurin
Shimon Whiteson
72
32
0
18 Feb 2019
Realizing Continual Learning through Modeling a Learning System as a Fiber Bundle
Zhenfeng Cao
40
2
0
16 Feb 2019
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy Observations
Yuhui Wang
Hao He
Xiaoyang Tan
55
10
0
15 Feb 2019
Network Offloading Policies for Cloud Robotics: a Learning-based Approach
Sandeep P. Chinchali
Apoorva Sharma
James Harrison
Amine Elhafsi
Daniel Kang
Evgenya Pergament
Eyal Cidon
Sachin Katti
Marco Pavone
OffRL
66
107
0
15 Feb 2019
Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity
Deepak Pathak
Chris Xiaoxuan Lu
Trevor Darrell
Phillip Isola
Alexei A. Efros
53
135
0
14 Feb 2019
ELF OpenGo: An Analysis and Open Reimplementation of AlphaZero
Yuandong Tian
Jerry Ma
Qucheng Gong
Shubho Sengupta
Zhuoyuan Chen
James Pinkerton
C. L. Zitnick
102
110
0
12 Feb 2019
VERIFAI: A Toolkit for the Design and Analysis of Artificial Intelligence-Based Systems
T. Dreossi
Daniel J. Fremont
Shromona Ghosh
Edward J. Kim
H. Ravanbakhsh
Marcell Vazquez-Chanlatte
Sanjit A. Seshia
81
29
0
12 Feb 2019
Diverse Exploration via Conjugate Policies for Policy Gradient Methods
Andrew Cohen
Xingye Qiao
Lei Yu
E. Way
Xiangrong Tong
60
9
0
10 Feb 2019
Metaoptimization on a Distributed System for Deep Reinforcement Learning
Greg Heinrich
I. Frosio
OffRL
28
2
0
07 Feb 2019
The Actor-Advisor: Policy Gradient With Off-Policy Advice
Hélène Plisnier
Denis Steckelmacher
D. Roijers
A. Nowé
CML
OffRL
27
6
0
07 Feb 2019
Obstacle Tower: A Generalization Challenge in Vision, Control, and Planning
Arthur Juliani
Ahmed Khalifa
Vincent-Pierre Berges
Jonathan Harper
Ervin Teng
Hunter Henry
A. Crespi
Julian Togelius
Danny Lange
78
144
0
04 Feb 2019
A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning
Francisco M. Garcia
Philip S. Thomas
102
41
0
03 Feb 2019
Certified Reinforcement Learning with Logic Guidance
Mohammadhosein Hasanbeig
Daniel Kroening
Alessandro Abate
127
57
0
02 Feb 2019
The Hanabi Challenge: A New Frontier for AI Research
Nolan Bard
Jakob N. Foerster
A. Chandar
Neil Burch
Marc Lanctot
...
Iain Dunning
Shibl Mourad
Hugo Larochelle
Marc G. Bellemare
Michael Bowling
LLMAG
100
355
0
01 Feb 2019
Policy Consolidation for Continual Reinforcement Learning
Christos Kaplanis
Murray Shanahan
Claudia Clopath
CLL
OffRL
78
51
0
01 Feb 2019
Contrasting Exploration in Parameter and Action Space: A Zeroth-Order Optimization Perspective
Anirudh Vemula
Wen Sun
J. Andrew Bagnell
73
40
0
31 Jan 2019
Improving Evolutionary Strategies with Generative Neural Networks
Louis Faury
Clément Calauzènes
Olivier Fercoq
Syrine Krichene
67
13
0
31 Jan 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
113
370
0
30 Jan 2019
Discretizing Continuous Action Space for On-Policy Optimization
Yunhao Tang
Shipra Agrawal
OffRL
107
124
0
29 Jan 2019
Trust Region-Guided Proximal Policy Optimization
Yuhui Wang
Hao He
Xiaoyang Tan
Yaozhong Gan
OffRL
89
57
0
29 Jan 2019
Modularization of End-to-End Learning: Case Study in Arcade Games
Andrew Melnik
Sascha Fleer
M. Schilling
Helge J. Ritter
OffRL
71
12
0
27 Jan 2019
Trust Region Value Optimization using Kalman Filtering
Shirli Di-Castro Shashua
Shie Mannor
61
8
0
23 Jan 2019
Neuroflight: Next Generation Flight Control Firmware
W. Koch
R. Mancuso
Azer Bestavros
72
30
0
19 Jan 2019
AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning
Ameer Haj-Ali
Qijing Huang
William S. Moses
J. Xiang
Ion Stoica
Krste Asanović
J. Wawrzynek
49
36
0
15 Jan 2019
Motion Perception in Reinforcement Learning with Dynamic Objects
Artemij Amiranashvili
Alexey Dosovitskiy
V. Koltun
Thomas Brox
69
35
0
10 Jan 2019
Model-Predictive Policy Learning with Uncertainty Regularization for Driving in Dense Traffic
Mikael Henaff
A. Canziani
Yann LeCun
OOD
111
123
0
08 Jan 2019
Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions
Rui Wang
Joel Lehman
Jeff Clune
Kenneth O. Stanley
123
250
0
07 Jan 2019
Hierarchical Reinforcement Learning via Advantage-Weighted Information Maximization
Takayuki Osa
Voot Tangkaratt
Masashi Sugiyama
OffRL
62
27
0
05 Jan 2019
Adversarial Learning of a Sampler Based on an Unnormalized Distribution
Chunyuan Li
Ke Bai
Jianqiao Li
Guoyin Wang
Changyou Chen
Lawrence Carin
150
10
0
03 Jan 2019
Complementary reinforcement learning towards explainable agents
J. H. Lee
53
12
0
01 Jan 2019
Deconfounding Reinforcement Learning in Observational Settings
Chaochao Lu
Bernhard Schölkopf
José Miguel Hernández-Lobato
CML
OOD
173
75
0
26 Dec 2018
Learning to Walk via Deep Reinforcement Learning
Tuomas Haarnoja
Sehoon Ha
Aurick Zhou
Jie Tan
George Tucker
Sergey Levine
137
442
0
26 Dec 2018
Iroko: A Framework to Prototype Reinforcement Learning for Data Center Traffic Control
Fabian Ruffy
Michael Przystupa
Ivan Beschastnikh
48
31
0
24 Dec 2018
VMAV-C: A Deep Attention-based Reinforcement Learning Algorithm for Model-based Control
Xingxing Liang
Qi Wang
Yanghe Feng
Zhong Liu
Jincai Huang
65
5
0
24 Dec 2018
NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning
Sirui Xie
Junning Huang
Lanxin Lei
Chunxiao Liu
Zheng Ma
Wayne Zhang
Liang Lin
57
8
0
21 Dec 2018
Pre-training with Non-expert Human Demonstration for Deep Reinforcement Learning
G. V. D. L. Cruz
Yunshu Du
Matthew E. Taylor
OffRL
51
26
0
21 Dec 2018
Previous
1
2
3
...
45
46
47
...
50
51
52
Next