Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
Action Branching Architectures for Deep Reinforcement Learning
Arash Tavakoli
Fabio Pardo
Petar Kormushev
74
265
0
24 Nov 2017
Deterministic Policy Optimization by Combining Pathwise and Score Function Estimators for Discrete Action Spaces
Daniel Levy
Stefano Ermon
66
4
0
21 Nov 2017
Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments
Peter Anderson
Qi Wu
Damien Teney
Jake Bruce
Mark Johnson
Niko Sünderhauf
Ian Reid
Stephen Gould
Anton Van Den Hengel
LM&Ro
174
1,325
0
20 Nov 2017
Variational Adaptive-Newton Method for Explorative Learning
Mohammad Emtiyaz Khan
Wu Lin
Voot Tangkaratt
Zuozhu Liu
Didrik Nielsen
ODL
81
20
0
15 Nov 2017
Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?
M. Raghu
A. Irpan
Jacob Andreas
Robert D. Kleinberg
Quoc V. Le
Jon M. Kleinberg
105
28
0
07 Nov 2017
Acquiring Target Stacking Skills by Goal-Parameterized Deep Reinforcement Learning
Wenbin Li
Jeannette Bohg
Mario Fritz
51
8
0
01 Nov 2017
Backpropagation through the Void: Optimizing control variates for black-box gradient estimation
Will Grathwohl
Dami Choi
Yuhuai Wu
Geoffrey Roeder
David Duvenaud
138
300
0
31 Oct 2017
Action-depedent Control Variates for Policy Optimization via Stein's Identity
Hao Liu
Yihao Feng
Yi Mao
Dengyong Zhou
Jian-wei Peng
Qiang Liu
94
4
0
30 Oct 2017
Diff-DAC: Distributed Actor-Critic for Average Multitask Deep Reinforcement Learning
Sergio Valcarcel Macua
Aleksi Tukiainen
D. Hernández
David Baldazo
Enrique Munoz de Cote
S. Zazo
112
29
0
28 Oct 2017
Fast Model Identification via Physics Engines for Data-Efficient Policy Search
Shaojun Zhu
A. Kimmel
Kostas E. Bekris
Abdeslam Boularias
135
14
0
24 Oct 2017
Nonsmooth optimal value and policy functions in mechanical systems subject to unilateral constraints
Bora S. Banjanin
Samuel A. Burden
87
0
0
18 Oct 2017
Map-based Multi-Policy Reinforcement Learning: Enhancing Adaptability of Robots by Deep Reinforcement Learning
A. Kume
Eiichi Matsumoto
K. Takahashi
W. Ko
Jethro Tan
69
11
0
17 Oct 2017
Flow: A Modular Learning Framework for Mixed Autonomy Traffic
Cathy Wu
Abdul Rahman Kreidieh
Kanaad Parvate
Eugene Vinitsky
Alexandre M. Bayen
102
162
0
16 Oct 2017
Unsupervised Real-Time Control through Variational Empowerment
Maximilian Karl
Maximilian Soelch
Philip Becker-Ehmck
Djalel Benbouzid
Patrick van der Smagt
Justin Bayer
78
55
0
13 Oct 2017
AMBER: Adaptive Multi-Batch Experience Replay for Continuous Action Control
Seungyul Han
Y. Sung
OffRL
33
8
0
12 Oct 2017
Learning to Generalize: Meta-Learning for Domain Generalization
Da Li
Yongxin Yang
Yi-Zhe Song
Timothy M. Hospedales
OOD
111
1,436
0
10 Oct 2017
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces
Garrett A. Warnell
Nicholas R. Waytowich
Vernon J. Lawhern
Peter Stone
86
273
0
28 Sep 2017
Multi-task Learning with Gradient Guided Policy Specialization
Wenhao Yu
Chenxi Liu
Greg Turk
30
2
0
23 Sep 2017
OptLayer - Practical Constrained Optimization for Deep Reinforcement Learning in the Real World
Tu-Hoa Pham
Giovanni De Magistris
Ryuki Tachibana
OffRL
71
143
0
22 Sep 2017
OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning
Peter Henderson
Wei-Di Chang
Pierre-Luc Bacon
David Meger
Joelle Pineau
Doina Precup
GAN
77
73
0
20 Sep 2017
Deep Reinforcement Learning that Matters
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
149
1,970
0
19 Sep 2017
DropoutDAgger: A Bayesian Approach to Safe Imitation Learning
Kunal Menda
Katherine Driggs-Campbell
Mykel J. Kochenderfer
110
28
0
18 Sep 2017
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Marlos C. Machado
Marc G. Bellemare
Erik Talvitie
J. Veness
Matthew J. Hausknecht
Michael Bowling
114
558
0
18 Sep 2017
Shapechanger: Environments for Transfer Learning
Sébastien M. R. Arnold
Tsam Kiu Pun
Théo-Tim J. Denisart
Francisco J. Valero Cuevas
3DPC
LM&Ro
26
0
0
15 Sep 2017
Shared Learning : Enhancing Reinforcement in
Q
Q
Q
-Ensembles
Rakesh R Menon
Balaraman Ravindran
33
0
0
14 Sep 2017
One-Shot Visual Imitation Learning via Meta-Learning
Chelsea Finn
Tianhe Yu
Tianhao Zhang
Pieter Abbeel
Sergey Levine
SSL
133
566
0
14 Sep 2017
Pre-training Neural Networks with Human Demonstrations for Deep Reinforcement Learning
G. V. D. L. Cruz
Yunshu Du
Matthew E. Taylor
3DH
OffRL
84
58
0
12 Sep 2017
Mean Actor Critic
Cameron Allen
Kavosh Asadi
Melrose Roderick
Abdel-rahman Mohamed
George Konidaris
Michael Littman
92
45
0
01 Sep 2017
Deep Learning for Video Game Playing
Niels Justesen
Philip Bontrager
Julian Togelius
S. Risi
VLM
101
208
0
25 Aug 2017
Sim4CV: A Photo-Realistic Simulator for Computer Vision Applications
Matthias Muller
Vincent Casser
Jean Lahoud
Neil G. Smith
Guohao Li
VGen
74
181
0
19 Aug 2017
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
143
2,830
0
19 Aug 2017
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Yuhuai Wu
Elman Mansimov
Shun Liao
Roger C. Grosse
Jimmy Ba
OffRL
135
630
0
17 Aug 2017
OpenML Benchmarking Suites
B. Bischl
Giuseppe Casalicchio
Matthias Feurer
Pieter Gijsbers
Frank Hutter
Michel Lang
R. G. Mantovani
Jan N. van Rijn
Joaquin Vanschoren
VLM
ELM
123
165
0
11 Aug 2017
Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control
Riashat Islam
Peter Henderson
Maziar Gomrokchi
Doina Precup
BDL
OffRL
103
253
0
10 Aug 2017
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning
Anusha Nagabandi
G. Kahn
R. Fearing
Sergey Levine
116
977
0
08 Aug 2017
An Information-Theoretic Optimality Principle for Deep Reinforcement Learning
Felix Leibfried
Jordi Grau-Moya
Haitham Bou-Ammar
101
24
0
06 Aug 2017
Mutual Alignment Transfer Learning
Markus Wulfmeier
Ingmar Posner
Pieter Abbeel
152
61
0
25 Jul 2017
RAIL: Risk-Averse Imitation Learning
Anirban Santara
A. Naik
Balaraman Ravindran
Dipankar Das
Dheevatsa Mudigere
Sasikanth Avancha
Bharat Kaul
82
18
0
20 Jul 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
678
19,343
0
20 Jul 2017
ADAPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical Systems
James Harrison
Animesh Garg
Boris Ivanovic
Yuke Zhu
Silvio Savarese
Li Fei-Fei
Marco Pavone
76
25
0
15 Jul 2017
ELF: An Extensive, Lightweight and Flexible Research Platform for Real-time Strategy Games
Yuandong Tian
Qucheng Gong
Wenling Shang
Yuxin Wu
C. L. Zitnick
OffRL
74
126
0
04 Jul 2017
OPEB: Open Physical Environment Benchmark for Artificial Intelligence
H. Mirzaei
Mona Fathollahi
T. Givargis
21
2
0
04 Jul 2017
Dex: Incremental Learning for Complex Environments in Deep Reinforcement Learning
Nick Erickson
Qi Zhao
CLL
OffRL
422
2
0
19 Jun 2017
Expected Policy Gradients
K. Ciosek
Shimon Whiteson
129
58
0
15 Jun 2017
Reinforcement Learning under Model Mismatch
Aurko Roy
Huan Xu
Sebastian Pokutta
OOD
87
80
0
15 Jun 2017
Symmetry Learning for Function Approximation in Reinforcement Learning
Anuj Mahajan
Theja Tulabandhula
67
31
0
09 Jun 2017
Parameter Space Noise for Exploration
Matthias Plappert
Rein Houthooft
Prafulla Dhariwal
Szymon Sidor
Richard Y. Chen
Xi Chen
Tamim Asfour
Pieter Abbeel
Marcin Andrychowicz
111
597
0
06 Jun 2017
Deep learning evaluation using deep linguistic processing
A. Kuhnle
Ann A. Copestake
ELM
59
11
0
05 Jun 2017
Non-Markovian Control with Gated End-to-End Memory Policy Networks
J. Perez
T. Silander
OffRL
65
6
0
31 May 2017
Sequential Dynamic Decision Making with Deep Neural Nets on a Test-Time Budget
Henghui Zhu
Feng Nan
I. Paschalidis
Venkatesh Saligrama
6
2
0
31 May 2017
Previous
1
2
3
...
50
51
52
Next