Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
Multi-Agent Trust Region Policy Optimization
Hepeng Li
Haibo He
106
42
0
15 Oct 2020
Deep Learning of Koopman Representation for Control
Yiqiang Han
Wenjian Hao
Umesh Vaidya
AI4CE
57
110
0
15 Oct 2020
Self-Imitation Learning for Robot Tasks with Sparse and Delayed Rewards
Zhixin Chen
Mengxiang Lin
58
6
0
14 Oct 2020
Efficient Wasserstein Natural Gradients for Reinforcement Learning
Theodore H. Moskovitz
Michael Arbel
Ferenc Huszár
Arthur Gretton
72
21
0
12 Oct 2020
EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models
Cédric Colas
B. Hejblum
S. Rouillon
R. Thiébaut
Pierre-Yves Oudeyer
Clément Moulin-Frier
M. Prague
61
22
0
09 Oct 2020
Learning Value Functions in Deep Policy Gradients using Residual Variance
Yannis Flet-Berliac
Reda Ouhamma
Odalric-Ambrym Maillard
Philippe Preux
OffRL
72
1
0
09 Oct 2020
Learning to Locomote: Understanding How Environment Design Matters for Deep Reinforcement Learning
Daniele Reda
Tianxin Tao
M. van de Panne
AI4CE
109
53
0
09 Oct 2020
CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning
Ossama Ahmed
Frederik Trauble
Anirudh Goyal
Alexander Neitz
Yoshua Bengio
Bernhard Schölkopf
M. Wuthrich
Stefan Bauer
CML
118
123
0
08 Oct 2020
A novel control mode of bionic morphing tail based on deep reinforcement learning
Liming Zheng
Zhou Zhou
Peng Sun
Zhilin Zhang
Rui Wang
AI4CE
36
1
0
08 Oct 2020
Learning Intrinsic Symbolic Rewards in Reinforcement Learning
Hassam Sheikh
Shauharda Khadka
Santiago Miret
Somdeb Majumdar
OffRL
69
7
0
08 Oct 2020
Proximal Policy Optimization with Relative Pearson Divergence
Taisuke Kobayashi
43
17
0
07 Oct 2020
Reinforcement Learning with Random Delays
Simon Ramstedt
Yann Bouteiller
Giovanni Beltrame
C. Pal
Jonathan Binas
227
61
0
06 Oct 2020
Learning Diverse Options via InfoMax Termination Critic
Yuji Kanagawa
Tomoyuki Kaneko
64
1
0
06 Oct 2020
Active Feature Acquisition with Generative Surrogate Models
Yang Li
Junier B. Oliva
RALM
TPM
72
38
0
06 Oct 2020
Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning
Rodrigo Toro Icarte
Toryn Q. Klassen
Richard Valenzano
Sheila A. McIlraith
OffRL
152
222
0
06 Oct 2020
Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards for Real-time Strategy Games
Shengyi Huang
Santiago Ontañón
63
10
0
05 Oct 2020
A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Shangtong Zhang
Romain Laroche
H. V. Seijen
Shimon Whiteson
Rémi Tachet des Combes
122
15
0
02 Oct 2020
MADRaS : Multi Agent Driving Simulator
Anirban Santara
S. Rudra
Sree Aditya Buridi
Meha Kaushik
A. Naik
Bharat Kaul
Balaraman Ravindran
72
30
0
02 Oct 2020
Deep Reinforcement Learning with Mixed Convolutional Network
Yanyu Zhang
SSL
10
2
0
01 Oct 2020
Heteroscedastic Bayesian Optimisation for Stochastic Model Predictive Control
Rel Guzman
Rafael Oliveira
F. Ramos
97
15
0
01 Oct 2020
Deep Reinforcement Learning for Efficient Measurement of Quantum Devices
Vu-Linh Nguyen
S. B. Orbell
D. Lennon
H. Moon
F. Vigneau
...
D. Zumbuhl
G. Briggs
Michael A. Osborne
D. Sejdinovic
N. Ares
55
41
0
30 Sep 2020
MARS-Gym: A Gym framework to model, train, and evaluate Recommender Systems for Marketplaces
Marlesson R. O. Santana
Luckeciano C. Melo
Fernando H. F. Camargo
Bruno Brandão
Anderson Soares
Renan M. Oliveira
Sandor Caetano
OffRL
48
15
0
30 Sep 2020
Reannealing of Decaying Exploration Based On Heuristic Measure in Deep Q-Network
Xing Wang
A. Vinel
40
0
0
29 Sep 2020
Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning
Haotian Fu
Hongyao Tang
Jianye Hao
Chong Chen
Xidong Feng
Dong Li
Wulong Liu
OffRL
83
50
0
29 Sep 2020
Cross Learning in Deep Q-Networks
Xing Wang
A. Vinel
25
2
0
29 Sep 2020
Novelty Search in Representational Space for Sample Efficient Exploration
Ruo Yu Tao
Vincent François-Lavet
Joelle Pineau
90
45
0
28 Sep 2020
Agent Environment Cycle Games
J. K. Terry
Nathaniel Grammel
Benjamin Black
Ananth Hari
Caroline Horsch
L. Santos
65
7
0
28 Sep 2020
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey
Wenshuai Zhao
Jorge Peña Queralta
Tomi Westerlund
OffRL
256
743
0
24 Sep 2020
Motion Planning by Reinforcement Learning for an Unmanned Aerial Vehicle in Virtual Open Space with Static Obstacles
Sanghyun Kim
Jongmin Park
Jae-Kwan Yun
Jiwon Seo
28
17
0
24 Sep 2020
The Agent Web Model -- Modelling web hacking for reinforcement learning
L. Erdődi
Fabio Massimo Zennaro
24
3
0
23 Sep 2020
A Centralised Soft Actor Critic Deep Reinforcement Learning Approach to District Demand Side Management through CityLearn
Anjukan Kathirgamanathan
Kacper Twardowski
E. Mangina
D. Finn
63
21
0
22 Sep 2020
Learning Task-Agnostic Action Spaces for Movement Optimization
Amin Babadi
M. van de Panne
Caren Liu
Perttu Hämäläinen
51
2
0
22 Sep 2020
CMAX++ : Leveraging Experience in Planning and Execution using Inaccurate Models
Anirudh Vemula
J. Andrew Bagnell
Maxim Likhachev
88
9
0
21 Sep 2020
Deep Reinforcement Learning Methods for Structure-Guided Processing Path Optimization
Johannes Dornheim
L. Morand
Samuel Zeitvogel
Tarek Iraki
Norbert Link
Dirk Helm
58
21
0
21 Sep 2020
RL STaR Platform: Reinforcement Learning for Simulation based Training of Robots
Tamir Blum
Gabin Paillet
Mickaël Laîné
Kazuya Yoshida
43
7
0
21 Sep 2020
Learn to Exceed: Stereo Inverse Reinforcement Learning with Concurrent Policy Optimization
Feng Tao
Yongcan Cao
99
2
0
21 Sep 2020
Multiplayer Support for the Arcade Learning Environment
J. K. Terry
Benjamin Black
Luis Santos
74
13
0
20 Sep 2020
Measuring the Complexity of Domains Used to Evaluate AI Systems
Christopher Pereyda
Lawrence Holder
25
3
0
18 Sep 2020
GRAC: Self-Guided and Self-Regularized Actor-Critic
Lin Shao
Yifan You
Mengyuan Yan
Qingyun Sun
Jeannette Bohg
89
24
0
18 Sep 2020
Autonomous Learning of Features for Control: Experiments with Embodied and Situated Agents
Nicola Milano
S. Nolfi
46
0
0
15 Sep 2020
Extended Radial Basis Function Controller for Reinforcement Learning
Nicholas Capel
Naifu Zhang
35
1
0
12 Sep 2020
Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
M. Berk Mirza
Andrew Jaegle
Jonathan J. Hunt
A. Guez
S. Tunyasuvunakool
...
Peter Karkus
S. Racanière
Lars Buesing
Timothy Lillicrap
N. Heess
AI4CE
79
12
0
11 Sep 2020
TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments
Tom Bewley
J. Lawry
FAtt
74
27
0
10 Sep 2020
COVID-19 Pandemic Cyclic Lockdown Optimization Using Reinforcement Learning
M. Arango
Lyudmil Pelov
57
17
0
10 Sep 2020
Integrated Benchmarking and Design for Reproducible and Accessible Evaluation of Robotic Agents
J. Tani
Andrea F. Daniele
Gianmarco Bernasconi
Amaury Camus
Aleksandar Petrov
...
Tomasz Zaluska
Matthew R. Walter
Emilio Frazzoli
Liam Paull
A. Censi
50
8
0
09 Sep 2020
DyNODE: Neural Ordinary Differential Equations for Dynamics Modeling in Continuous Control
V. M. Alvarez
R. Rosca
Cristian G. Falcutescu
63
11
0
09 Sep 2020
Visualizing the Loss Landscape of Actor Critic Methods with Applications in Inventory Optimization
Recep Yusuf Bekci
M. Gümüş
29
4
0
04 Sep 2020
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning
Sheng-Chun Kao
Geonhwa Jeong
T. Krishna
111
96
0
04 Sep 2020
SEDRo: A Simulated Environment for Developmental Robotics
Aishwarya Pothula
Md Ashaduzzaman Rubel Mondol
Sanath Narasimhan
Sm Mazharul Islam
Deokgun Park
36
5
0
03 Sep 2020
Adaptive Risk Sensitive Model Predictive Control with Stochastic Search
Ziyi Wang
Oswin So
Keuntaek Lee
Camilo A. Duarte
Evangelos A. Theodorou
61
2
0
02 Sep 2020
Previous
1
2
3
...
32
33
34
...
50
51
52
Next