Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 1,654 papers shown
Title
Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions
Rui Wang
Joel Lehman
Aditya Rawal
Jiale Zhi
Yulun Li
Jeff Clune
Kenneth O. Stanley
22
125
0
19 Mar 2020
Neuroevolution of Self-Interpretable Agents
Yujin Tang
Duong Nguyen
David R Ha
34
111
0
18 Mar 2020
Pretraining Image Encoders without Reconstruction via Feature Prediction Loss
G. Pihlgren
Fredrik Sandin
Marcus Liwicki
18
3
0
16 Mar 2020
Self-Supervised Discovering of Interpretable Features for Reinforcement Learning
Wenjie Shi
Gao Huang
Shiji Song
Zhuoyuan Wang
Tingyu Lin
Cheng Wu
SSL
28
18
0
16 Mar 2020
The Chef's Hat Simulation Environment for Reinforcement-Learning-Based Agents
Pablo V. A. Barros
Anne C. Bloem
Inge M. Hootsmans
Lena M. Opheij
Romain H. A. Toebosch
E. Barakova
A. Sciutti
17
9
0
12 Mar 2020
Online Meta-Critic Learning for Off-Policy Actor-Critic Methods
Wei Zhou
Yiying Li
Yongxin Yang
Huaimin Wang
Timothy M. Hospedales
OffRL
34
46
0
11 Mar 2020
Explore and Exploit with Heterotic Line Bundle Models
Magdalena Larfors
Robin Schneider
41
38
0
10 Mar 2020
Stable Policy Optimization via Off-Policy Divergence Regularization
Ahmed Touati
Amy Zhang
Joelle Pineau
Pascal Vincent
OffRL
36
17
0
09 Mar 2020
q-VAE for Disentangled Representation Learning and Latent Dynamical Systems
Taisuke Kobayashis
BDL
DRL
22
17
0
04 Mar 2020
Hierarchically Decoupled Imitation for Morphological Transfer
D. Hejna
Pieter Abbeel
Lerrel Pinto
LM&Ro
25
41
0
03 Mar 2020
Exploration-efficient Deep Reinforcement Learning with Demonstration Guidance for Robot Control
Ke Lin
Liang Gong
Xudong Li
Te Sun
Binhao Chen
Chengliang Liu
Zhengfeng Zhang
Jian Pu
Junping Zhang
24
8
0
27 Feb 2020
Plannable Approximations to MDP Homomorphisms: Equivariance under Actions
Elise van der Pol
Thomas Kipf
F. Oliehoek
Max Welling
25
77
0
27 Feb 2020
Policy Evaluation Networks
J. Harb
Tom Schaul
Doina Precup
Pierre-Luc Bacon
OffRL
20
36
0
26 Feb 2020
Simultaneously Evolving Deep Reinforcement Learning Models using Multifactorial Optimization
Aritz D. Martinez
E. Osaba
Javier Del Ser
Francisco Herrera
22
10
0
25 Feb 2020
Off-Policy Deep Reinforcement Learning with Analogous Disentangled Exploration
Hoang Trung-Dung
Yitao Liang
Guy Van den Broeck
OffRL
22
3
0
25 Feb 2020
Informative Path Planning for Mobile Sensing with Reinforcement Learning
Yongyong Wei
Rong Zheng
26
34
0
18 Feb 2020
Universal Value Density Estimation for Imitation Learning and Goal-Conditioned Reinforcement Learning
Yannick Schroecker
Charles Isbell
OffRL
36
12
0
15 Feb 2020
PDDLGym: Gym Environments from PDDL Problems
Tom Silver
Rohan Chitnis
AI4CE
25
56
0
15 Feb 2020
Applying Depth-Sensing to Automated Surgical Manipulation with a da Vinci Robot
M. Hwang
Daniel Seita
Brijen Thananjeyan
Jeffrey Ichnowski
Samuel Paradis
Danyal Fer
Thomas Low
Ken Goldberg
8
31
0
15 Feb 2020
Robust Reinforcement Learning via Adversarial training with Langevin Dynamics
Parameswaran Kamalaruban
Yu-ting Huang
Ya-Ping Hsieh
Paul Rolland
C. Shi
V. Cevher
31
60
0
14 Feb 2020
Learning Functionally Decomposed Hierarchies for Continuous Control Tasks with Path Planning
Sammy Christen
Lukás Jendele
Emre Aksan
Otmar Hilliges
OffRL
30
25
0
14 Feb 2020
XCS Classifier System with Experience Replay
Anthony Stein
Roland Maier
Lukas Rosenbauer
J. Hähner
BDL
28
21
0
13 Feb 2020
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted Prescription
Olivier Francon
Santiago Gonzalez
B. Hodjat
Elliot Meyerson
Risto Miikkulainen
Xin Qiu
H. Shahrzad
26
16
0
13 Feb 2020
Representation of Reinforcement Learning Policies in Reproducing Kernel Hilbert Spaces
Bogdan Mazoure
T. Doan
Tianyu Li
V. Makarenkov
Joelle Pineau
Doina Precup
Guillaume Rabusseau
OffRL
21
1
0
07 Feb 2020
Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP)
Zhimin Hou
Kuangen Zhang
Yi Wan
Dongyu Li
Chenglong Fu
Haoyong Yu
27
15
0
07 Feb 2020
Ready Policy One: World Building Through Active Learning
Philip J. Ball
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
OffRL
32
49
0
07 Feb 2020
Effective Diversity in Population Based Reinforcement Learning
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
22
158
0
03 Feb 2020
Evolving Neural Networks through a Reverse Encoding Tree
Haoling Zhang
Chao-Han Huck Yang
Hector Zenil
N. Kiani
Yue-Hong Shen
Jesper N. Tegnér
19
5
0
03 Feb 2020
An Upper Bound of the Bias of Nadaraya-Watson Kernel Regression under Lipschitz Assumptions
Samuele Tosatto
R. Akrour
Jan Peters
15
4
0
29 Jan 2020
Rotation, Translation, and Cropping for Zero-Shot Generalization
Chang Ye
Ahmed Khalifa
Philip Bontrager
Julian Togelius
32
38
0
27 Jan 2020
PCGRL: Procedural Content Generation via Reinforcement Learning
Ahmed Khalifa
Philip Bontrager
Sam Earle
Julian Togelius
21
143
0
24 Jan 2020
PoPS: Policy Pruning and Shrinking for Deep Reinforcement Learning
Dor Livne
Kobi Cohen
34
50
0
14 Jan 2020
Improving Image Autoencoder Embeddings with Perceptual Loss
G. Pihlgren
Fredrik Sandin
Marcus Liwicki
25
33
0
10 Jan 2020
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors
Jingliang Duan
Yang Guan
Shengbo Eben Li
Yangang Ren
B. Cheng
OffRL
25
174
0
09 Jan 2020
Joint Goal and Strategy Inference across Heterogeneous Demonstrators via Reward Network Distillation
Letian Chen
Rohan R. Paleja
Muyleng Ghuy
Matthew C. Gombolay
30
38
0
02 Jan 2020
Model Inversion Networks for Model-Based Optimization
Aviral Kumar
Sergey Levine
OffRL
38
93
0
31 Dec 2019
Uncertainty-Based Out-of-Distribution Classification in Deep Reinforcement Learning
Andreas Sedlmeier
Thomas Gabor
Thomy Phan
Lenz Belzner
Claudia Linnhoff-Popien
21
25
0
31 Dec 2019
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
43
188
0
23 Dec 2019
Monte-Carlo Tree Search for Policy Optimization
Xiaobai Ma
Katherine Driggs-Campbell
Zongzhang Zhang
Mykel J. Kochenderfer
20
6
0
23 Dec 2019
Taming an autonomous surface vehicle for path following and collision avoidance using deep reinforcement learning
Eivind Meyer
Haakon Robinson
Adil Rasheed
Omer San
33
65
0
18 Dec 2019
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
46
1,799
0
13 Dec 2019
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning
Shuai Lu
Shuai Han
Wenbo Zhou
Junwei Zhang
29
26
0
13 Dec 2019
Adversarial recovery of agent rewards from latent spaces of the limit order book
Jacobo Roa-Vicens
Yuanbo Wang
Virgile Mison
Y. Gal
Ricardo M. A. Silva
23
3
0
09 Dec 2019
Adaptive Online Planning for Continual Lifelong Learning
Kevin Lu
Igor Mordatch
Pieter Abbeel
OffRL
OnRL
CLL
11
15
0
03 Dec 2019
IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks
Youngwoon Lee
E. Hu
Zhengyu Yang
Alexander Yin
Joseph J. Lim
36
122
0
17 Nov 2019
Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning
Cameron Voloshin
Hoang Minh Le
Nan Jiang
Yisong Yue
OffRL
30
152
0
15 Nov 2019
Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs Using Proximal Policy Optimization
Eivind Bøhn
E. M. Coates
Signe Moe
T. Johansen
25
129
0
13 Nov 2019
Multi-Agent Connected Autonomous Driving using Deep Reinforcement Learning
Praveen Palanisamy
45
142
0
11 Nov 2019
Fully Bayesian Recurrent Neural Networks for Safe Reinforcement Learning
Matthew Benatan
Edward O. Pyzer-Knapp
BDL
24
6
0
08 Nov 2019
Experience Sharing Between Cooperative Reinforcement Learning Agents
Lucas O. Souza
G. Ramos
C. Ralha
27
9
0
06 Nov 2019
Previous
1
2
3
...
27
28
29
...
32
33
34
Next