Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 1,654 papers shown
Title
Curiosity-Driven Experience Prioritization via Density Estimation
Rui Zhao
Volker Tresp
27
54
0
20 Feb 2019
Emergent Coordination Through Competition
Siqi Liu
Guy Lever
J. Merel
S. Tunyasuvunakool
N. Heess
T. Graepel
47
149
0
19 Feb 2019
Fast Efficient Hyperparameter Tuning for Policy Gradients
Supratik Paul
Vitaly Kurin
Shimon Whiteson
22
32
0
18 Feb 2019
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy Observations
Yuhui Wang
Hao He
Xiaoyang Tan
30
9
0
15 Feb 2019
Network Offloading Policies for Cloud Robotics: a Learning-based Approach
Sandeep P. Chinchali
Apoorva Sharma
James Harrison
Amine Elhafsi
Daniel Kang
Evgenya Pergament
Eyal Cidon
Sachin Katti
Marco Pavone
OffRL
11
105
0
15 Feb 2019
VERIFAI: A Toolkit for the Design and Analysis of Artificial Intelligence-Based Systems
T. Dreossi
Daniel J. Fremont
Shromona Ghosh
Edward J. Kim
H. Ravanbakhsh
Marcell Vazquez-Chanlatte
S. Seshia
18
29
0
12 Feb 2019
A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning
Francisco M. Garcia
Philip S. Thomas
24
38
0
03 Feb 2019
Certified Reinforcement Learning with Logic Guidance
Mohammadhosein Hasanbeig
Daniel Kroening
Alessandro Abate
24
53
0
02 Feb 2019
Contrasting Exploration in Parameter and Action Space: A Zeroth-Order Optimization Perspective
Anirudh Vemula
Wen Sun
J. Andrew Bagnell
19
40
0
31 Jan 2019
Improving Evolutionary Strategies with Generative Neural Networks
Louis Faury
Clément Calauzènes
Olivier Fercoq
Syrine Krichene
27
12
0
31 Jan 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
24
363
0
30 Jan 2019
Discretizing Continuous Action Space for On-Policy Optimization
Yunhao Tang
Shipra Agrawal
OffRL
26
118
0
29 Jan 2019
Modularization of End-to-End Learning: Case Study in Arcade Games
Andrew Melnik
Sascha Fleer
M. Schilling
Helge J. Ritter
OffRL
39
12
0
27 Jan 2019
Trust Region Value Optimization using Kalman Filtering
Shirli Di-Castro Shashua
Shie Mannor
19
7
0
23 Jan 2019
Neuroflight: Next Generation Flight Control Firmware
W. Koch
R. Mancuso
Azer Bestavros
35
29
0
19 Jan 2019
AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning
Ameer Haj-Ali
Qijing Huang
William S. Moses
J. Xiang
Ion Stoica
Krste Asanović
J. Wawrzynek
29
36
0
15 Jan 2019
Model-Predictive Policy Learning with Uncertainty Regularization for Driving in Dense Traffic
Mikael Henaff
A. Canziani
Yann LeCun
OOD
28
122
0
08 Jan 2019
Adversarial Learning of a Sampler Based on an Unnormalized Distribution
Chunyuan Li
Ke Bai
Jianqiao Li
Guoyin Wang
Changyou Chen
Lawrence Carin
16
10
0
03 Jan 2019
Complementary reinforcement learning towards explainable agents
J. H. Lee
27
12
0
01 Jan 2019
Deconfounding Reinforcement Learning in Observational Settings
Chaochao Lu
Bernhard Schölkopf
José Miguel Hernández-Lobato
CML
OOD
39
73
0
26 Dec 2018
Learning to Walk via Deep Reinforcement Learning
Tuomas Haarnoja
Sehoon Ha
Aurick Zhou
Jie Tan
George Tucker
Sergey Levine
54
433
0
26 Dec 2018
VMAV-C: A Deep Attention-based Reinforcement Learning Algorithm for Model-based Control
Xingxing Liang
Qi Wang
Yanghe Feng
Zhong Liu
Jincai Huang
29
5
0
24 Dec 2018
TD-Regularized Actor-Critic Methods
Simone Parisi
Voot Tangkaratt
Jan Peters
Mohammad Emtiyaz Khan
OffRL
30
32
0
19 Dec 2018
Communication-Efficient Policy Gradient Methods for Distributed Reinforcement Learning
Tianyi Chen
Kaipeng Zhang
G. Giannakis
Tamer Basar
OffRL
29
41
0
07 Dec 2018
Active Deep Q-learning with Demonstration
Si-An Chen
Voot Tangkaratt
Hsuan-Tien Lin
Masashi Sugiyama
18
32
0
06 Dec 2018
Relative Entropy Regularized Policy Iteration
A. Abdolmaleki
Jost Tobias Springenberg
Jonas Degrave
Steven Bohez
Yuval Tassa
Dan Belov
N. Heess
Martin Riedmiller
27
72
0
05 Dec 2018
PNS: Population-Guided Novelty Search for Reinforcement Learning in Hard Exploration Environments
Qihao Liu
Yujia Wang
Xiao-Fei Liu
25
8
0
26 Nov 2018
Coordinating Disaster Emergency Response with Heuristic Reinforcement Learning
L. Nguyen
Zhou Yang
Jiazhen Zhu
Jia Ming Li
Fang Jin
24
21
0
12 Nov 2018
Towards Governing Agent's Efficacy: Action-Conditional
β
β
β
-VAE for Deep Transparent Reinforcement Learning
John Yang
Gyujeong Lee
Minsung Hyun
Simyung Chang
Nojun Kwak
29
3
0
11 Nov 2018
Sample-Efficient Policy Learning based on Completely Behavior Cloning
Qiming Zou
Ling Wang
K. Lu
Yu Li
OffRL
25
0
0
09 Nov 2018
Deep Reinforcement Learning via L-BFGS Optimization
Chris Paxton
Roummel F. Marcia
OffRL
21
0
0
06 Nov 2018
Learning to Defend by Learning to Attack
Haoming Jiang
Zhehui Chen
Yuyang Shi
Bo Dai
T. Zhao
21
22
0
03 Nov 2018
VIREL: A Variational Inference Framework for Reinforcement Learning
M. Fellows
Anuj Mahajan
Tim G. J. Rudner
Shimon Whiteson
DRL
38
54
0
03 Nov 2018
Towards a Simple Approach to Multi-step Model-based Reinforcement Learning
Kavosh Asadi
Evan Cater
Dipendra Kumar Misra
Michael L. Littman
OffRL
29
13
0
31 Oct 2018
Assessing Generalization in Deep Reinforcement Learning
Charles Packer
Katelyn Gao
Jernej Kos
Philipp Krahenbuhl
V. Koltun
D. Song
OffRL
18
233
0
29 Oct 2018
Sample-Efficient Learning of Nonprehensile Manipulation Policies via Physics-Based Informed State Distributions
Lerrel Pinto
Aditya Mandalika
Brian Hou
S. Srinivasa
33
13
0
24 Oct 2018
RLgraph: Modular Computation Graphs for Deep Reinforcement Learning
Michael Schaarschmidt
Sven Mika
Kai Fricke
Eiko Yoneki
OffRL
23
5
0
21 Oct 2018
Autonomous Self-Explanation of Behavior for Interactive Reinforcement Learning Agents
Yosuke Fukuchi
Masahiko Osawa
Hiroshi Yamakawa
M. Imai
19
31
0
20 Oct 2018
O2A: One-shot Observational learning with Action vectors
Leo Pauly
Wisdom C. Agboh
David C. Hogg
R. Fuentes
57
9
0
17 Oct 2018
Multiple Interactions Made Easy (MIME): Large Scale Demonstrations Data for Imitation
Pratyusha Sharma
Lekha Mohan
Lerrel Pinto
Abhinav Gupta
23
119
0
16 Oct 2018
Batch Active Preference-Based Learning of Reward Functions
Erdem Biyik
Dorsa Sadigh
22
108
0
10 Oct 2018
Reinforcement Learning for Improving Agent Design
David R Ha
35
124
0
09 Oct 2018
PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation
Perttu Hämäläinen
Amin Babadi
Xiaoxiao Ma
J. Lehtinen
32
62
0
05 Oct 2018
Energy-Based Hindsight Experience Prioritization
Rui Zhao
Volker Tresp
19
74
0
02 Oct 2018
Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information
Arjun Sharma
Mohit Sharma
Nicholas Rhinehart
Kris Kitani
27
68
0
29 Sep 2018
Boosting Trust Region Policy Optimization by Normalizing Flows Policy
Yunhao Tang
Shipra Agrawal
TPM
39
29
0
27 Sep 2018
Omega-Regular Objectives in Model-Free Reinforcement Learning
E. M. Hahn
Mateo Perez
S. Schewe
Fabio Somenzi
Ashutosh Trivedi
D. Wojtczak
15
145
0
26 Sep 2018
Switching Isotropic and Directional Exploration with Parameter Space Noise in Deep Reinforcement Learning
Izumi Karino
Kazutoshi Tanaka
Ryuma Niiyama
Yasuo Kuniyoshi
19
3
0
18 Sep 2018
Deep Learning with Experience Ranking Convolutional Neural Network for Robot Manipulator
Hai V. Nguyen
Hung M. La
M. Deans
SSL
OffRL
12
8
0
16 Sep 2018
Deterministic Implementations for Reproducibility in Deep Reinforcement Learning
P. Nagarajan
Garrett A. Warnell
Peter Stone
22
51
0
15 Sep 2018
Previous
1
2
3
...
30
31
32
33
34
Next