Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
The Temporal Singularity: time-accelerated simulated civilizations and their implications
G. Spigler
3DGS
AI4CE
19
1
0
22 Jun 2018
How Many Random Seeds? Statistical Power Analysis in Deep Reinforcement Learning Experiments
Cédric Colas
Olivier Sigaud
Pierre-Yves Oudeyer
63
93
0
21 Jun 2018
A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning
Amy Zhang
Nicolas Ballas
Joelle Pineau
CLL
OffRL
110
180
0
20 Jun 2018
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
130
222
0
20 Jun 2018
Sim-to-Real Reinforcement Learning for Deformable Object Manipulation
J. Matas
Stephen James
Andrew J. Davison
AI4CE
77
361
0
20 Jun 2018
VirtualHome: Simulating Household Activities via Programs
Xavier Puig
K. Ra
Marko Boben
Jiaman Li
Tingwu Wang
Sanja Fidler
Antonio Torralba
LM&Ro
118
503
0
19 Jun 2018
Laplacian Smoothing Gradient Descent
Stanley Osher
Bao Wang
Penghang Yin
Xiyang Luo
Farzin Barekat
Minh Pham
A. Lin
ODL
113
43
0
17 Jun 2018
Surprising Negative Results for Generative Adversarial Tree Search
Kamyar Azizzadenesheli
Brandon Yang
Weitang Liu
Zachary Chase Lipton
Anima Anandkumar
91
13
0
15 Jun 2018
Self-Imitation Learning
Junhyuk Oh
Yijie Guo
Satinder Singh
Honglak Lee
SSL
85
251
0
14 Jun 2018
Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network
Wenjia Meng
Qian Zheng
L. Yang
Pengfei Li
Gang Pan
45
21
0
14 Jun 2018
Accelerating Imitation Learning with Predictive Models
Ching-An Cheng
Xinyan Yan
Evangelos A. Theodorou
Byron Boots
90
21
0
12 Jun 2018
Continuous-time Value Function Approximation in Reproducing Kernel Hilbert Spaces
Motoya Ohnishi
M. Yukawa
M. Johansson
Masashi Sugiyama
45
3
0
08 Jun 2018
Re-evaluating Evaluation
David Balduzzi
K. Tuyls
Julien Perolat
T. Graepel
MoMe
89
101
0
07 Jun 2018
Graph Convolutional Policy Network for Goal-Directed Molecular Graph Generation
Jiaxuan You
Bowen Liu
Rex Ying
Vijay S. Pande
J. Leskovec
GNN
347
905
0
07 Jun 2018
Deep Reinforcement Learning for General Video Game AI
R. Torrado
Philip Bontrager
Julian Togelius
Jialin Liu
Diego Perez-Liebana
82
131
0
06 Jun 2018
Randomized Value Functions via Multiplicative Normalizing Flows
Ahmed Touati
Harsh Satija
Joshua Romoff
Joelle Pineau
Pascal Vincent
55
36
0
06 Jun 2018
Human-like generalization in a machine through predicate learning
L. Doumas
Guillermo Puebla
Andrea E. Martin
NAI
49
9
0
05 Jun 2018
Boredom-driven curious learning by Homeo-Heterostatic Value Gradients
Yen Yu
A. Chang
Ryota Kanai
30
9
0
05 Jun 2018
BindsNET: A machine learning-oriented spiking neural networks library in Python
Hananel Hazan
D. J. Saunders
Hassaan Khan
Darpan T. Sanghavi
H. Siegelmann
R. Kozma
AI4CE
81
234
0
04 Jun 2018
Mitigation of Policy Manipulation Attacks on Deep Q-Networks with Parameter-Space Noise
Vahid Behzadan
Arslan Munir
AAML
74
21
0
04 Jun 2018
Playing Atari with Six Neurons
Giuseppe Cuccu
Julian Togelius
Philippe Cudré-Mauroux
165
43
0
04 Jun 2018
Challenges in High-dimensional Reinforcement Learning with Evolution Strategies
Nils Müller
Tobias Glasmachers
61
28
0
04 Jun 2018
DAQN: Deep Auto-encoder and Q-Network
Daiki Kimura
50
18
0
02 Jun 2018
Multi-vehicle Flocking Control with Deep Deterministic Policy Gradient Method
Yang Lyu
Quan Pan
Jin-wen Hu
Chunhui Zhao
Shuai Liu
108
33
0
01 Jun 2018
Sequential Attacks on Agents for Long-Term Adversarial Goals
E. Tretschk
Seong Joon Oh
Mario Fritz
OnRL
399
48
1
31 May 2018
Supervised Policy Update for Deep Reinforcement Learning
Q. Vuong
Yiming Zhang
George Andriopoulos
77
20
0
29 May 2018
Truncated Horizon Policy Search: Combining Reinforcement Learning & Imitation Learning
Wen Sun
J. Andrew Bagnell
Byron Boots
128
94
0
29 May 2018
Reward Constrained Policy Optimization
Chen Tessler
D. Mankowitz
Shie Mannor
97
546
0
28 May 2018
Fingerprint Policy Optimisation for Robust Reinforcement Learning
Supratik Paul
Michael A. Osborne
Shimon Whiteson
102
18
0
27 May 2018
Fast Policy Learning through Imitation and Reinforcement
Ching-An Cheng
Xinyan Yan
Nolan Wagener
Byron Boots
77
84
0
26 May 2018
A0C: Alpha Zero in Continuous Action Space
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
95
49
0
24 May 2018
Intelligent Trainer for Model-Based Reinforcement Learning
Yuanlong Li
Linsen Dong
Xin Zhou
Yonggang Wen
K. Guan
OffRL
43
0
0
24 May 2018
Representation Balancing MDPs for Off-Policy Policy Evaluation
Yao Liu
Omer Gottesman
Aniruddh Raghu
Matthieu Komorowski
A. Faisal
Finale Doshi-Velez
Emma Brunskill
OffRL
70
75
0
23 May 2018
A General Family of Robust Stochastic Operators for Reinforcement Learning
Yingdong Lu
M. Squillante
C. Wu
44
3
0
21 May 2018
Where Do You Think You're Going?: Inferring Beliefs about Dynamics from Behavior
S. Reddy
Anca Dragan
Sergey Levine
89
105
0
21 May 2018
Evolution-Guided Policy Gradient in Reinforcement Learning
Shauharda Khadka
Kagan Tumer
132
232
0
21 May 2018
Two geometric input transformation methods for fast online reinforcement learning with neural nets
Sina Ghiassian
Huizhen Yu
Banafsheh Rafiee
R. Sutton
OffRL
62
10
0
18 May 2018
Leveraging human knowledge in tabular reinforcement learning: A study of human subjects
Ariel Rosenfeld
Moshe Cohen
Matthew E. Taylor
Sarit Kraus
OffRL
38
34
0
15 May 2018
GAN Q-learning
T. Doan
Bogdan Mazoure
Clare Lyle
OOD
OffRL
64
19
0
13 May 2018
Task Transfer by Preference-Based Cost Learning
Chi-Hung Hsu
Shu-Huan Chang
Da-Cheng Juan
Yu-Ting Chen
Shih-Chieh Chang
79
54
0
12 May 2018
Interactive Reinforcement Learning with Dynamic Reuse of Prior Knowledge from Human/Agent's Demonstration
Zhaodong Wang
Matthew E. Taylor
OnRL
14
5
0
11 May 2018
Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes
T. P. Le
Ngo Anh Vien
Abu Layek
TaeChoong Chung
53
52
0
11 May 2018
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Joshua Romoff
Peter Henderson
Alexandre Piché
Vincent François-Lavet
Joelle Pineau
96
42
0
09 May 2018
Deep Reinforcement Learning for Playing 2.5D Fighting Games
Yu-Jhe Li
Hsin-Yu Chang
Yu-Jing Lin
Po-Wei Wu
Y. Wang
GAN
28
5
0
05 May 2018
Behavioral Cloning from Observation
F. Torabi
Garrett A. Warnell
Peter Stone
OffRL
143
732
0
04 May 2018
Exploration by Distributional Reinforcement Learning
Yunhao Tang
Shipra Agrawal
OOD
86
31
0
04 May 2018
VINE: An Open Source Interactive Data Visualization Tool for Neuroevolution
Rui Wang
Jeff Clune
Kenneth O. Stanley
30
7
0
03 May 2018
Measuring the Intrinsic Dimension of Objective Landscapes
Chunyuan Li
Heerad Farkhoor
Rosanne Liu
J. Yosinski
93
417
0
24 Apr 2018
Benchmarking projective simulation in navigation problems
A. Melnikov
A. Makmal
Hans J. Briegel
59
19
0
23 Apr 2018
State Distribution-aware Sampling for Deep Q-learning
Weichao Li
Fuxian Huang
Xi Li
G. Pan
Leilei Gan
TTA
46
4
0
23 Apr 2018
Previous
1
2
3
...
48
49
50
51
52
Next