Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
Liquid Time-constant Networks
Ramin Hasani
Mathias Lechner
Alexander Amini
Daniela Rus
Radu Grosu
AI4TS
AI4CE
93
230
0
08 Jun 2020
Learning Long-Term Dependencies in Irregularly-Sampled Time Series
Mathias Lechner
Ramin Hasani
AI4TS
73
132
0
08 Jun 2020
Implications of Human Irrationality for Reinforcement Learning
Haiyang Chen
H. Chang
Andrew Howes
47
1
0
07 Jun 2020
Dual Policy Distillation
Kwei-Herng Lai
Daochen Zha
Yuening Li
Helen Zhou
OffRL
118
9
0
07 Jun 2020
TrueRMA: Learning Fast and Smooth Robot Trajectories with Recursive Midpoint Adaptations in Cartesian Space
Jonas C. Kiemel
Pascal Meissner
Torsten Kröger
48
6
0
05 Jun 2020
Visual Transfer for Reinforcement Learning via Wasserstein Domain Confusion
Josh Roy
George Konidaris
73
16
0
04 Jun 2020
A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines
Callum Wilson
A. Riccardi
E. Minisci
21
4
0
04 Jun 2020
Interferobot: aligning an optical interferometer by a reinforcement learning agent
Dmitry Sorokin
Alexander Ulanov
E. A. Sazhina
A. Lvovsky
60
17
0
03 Jun 2020
Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration
Seungyul Han
Y. Sung
45
26
0
02 Jun 2020
Crowd simulation for crisis management: the outcomes of the last decade
George K. Sidiropoulos
C. Kiourt
Lefteris Moussiades
54
20
0
01 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
143
226
0
01 Jun 2020
AI Research Considerations for Human Existential Safety (ARCHES)
Andrew Critch
David M. Krueger
112
53
0
30 May 2020
Active Measure Reinforcement Learning for Observation Cost Minimization
C. Bellinger
Rory Coles
Mark Crowley
Isaac Tamblyn
OffRL
45
24
0
26 May 2020
Modeling Penetration Testing with Reinforcement Learning Using Capture-the-Flag Challenges: Trade-offs between Model-free Learning and A Priori Knowledge
Fabio Massimo Zennaro
L. Erdődi
47
18
0
26 May 2020
Gradient Monitored Reinforcement Learning
Mohammed Sharafath Abdul Hameed
Gavneet Singh Chadha
Andreas Schwung
S. Ding
97
11
0
25 May 2020
Policy Entropy for Out-of-Distribution Classification
Andreas Sedlmeier
Robert Muller
Steffen Illium
Claudia Linnhoff-Popien
OODD
OffRL
59
14
0
25 May 2020
Novel Policy Seeking with Constrained Optimization
Hao Sun
Zhenghao Peng
Bo Dai
Jian Guo
Dahua Lin
Bolei Zhou
136
13
0
21 May 2020
Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR Control in Active Distribution Networks
Haotian Liu
Wenchuan Wu
OffRL
28
98
0
20 May 2020
Mirror Descent Policy Optimization
Manan Tomar
Lior Shani
Yonathan Efroni
Mohammad Ghavamzadeh
142
87
0
20 May 2020
The Second Type of Uncertainty in Monte Carlo Tree Search
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
45
3
0
19 May 2020
Lifelong Control of Off-grid Microgrid with Model Based Reinforcement Learning
Simone Totaro
Ioannis Boukas
Anders Jonsson
Bertrand Cornélusse
27
31
0
16 May 2020
Learning Transferable Concepts in Deep Reinforcement Learning
Diego Gomez
Nicanor Quijano
Luis Felipe Giraldo
CLL
SSL
31
0
0
16 May 2020
Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement Learning
Thomas M. Moerland
Anna Deichler
S. Baldi
Joost Broekens
Catholijn M. Jonker
OffRL
42
10
0
15 May 2020
Probabilistic Guarantees for Safe Deep Reinforcement Learning
E. Bacci
David Parker
91
27
0
14 May 2020
Explainable Reinforcement Learning: A Survey
Erika Puiutta
Eric M. S. P. Veith
XAI
108
248
0
13 May 2020
Proxy Experience Replay: Federated Distillation for Distributed Reinforcement Learning
Han Cha
Jihong Park
Hyesung Kim
M. Bennis
Seong-Lyun Kim
71
26
0
13 May 2020
MOReL : Model-Based Offline Reinforcement Learning
Rahul Kidambi
Aravind Rajeswaran
Praneeth Netrapalli
Thorsten Joachims
OffRL
124
679
0
12 May 2020
Smooth Exploration for Robotic Reinforcement Learning
Antonin Raffin
Jens Kober
F. Stulp
81
58
0
12 May 2020
Delay-Aware Multi-Agent Reinforcement Learning for Cooperative and Competitive Environments
Baiming Chen
Mengdi Xu
Zuxin Liu
Liang-Sheng Li
Ding Zhao
70
37
0
11 May 2020
Delay-Aware Model-Based Reinforcement Learning for Continuous Control
Baiming Chen
Mengdi Xu
Liang-Sheng Li
Ding Zhao
OffRL
143
65
0
11 May 2020
Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics
Arsenii Kuznetsov
Pavel Shvechikov
Alexander Grishin
Dmitry Vetrov
246
198
0
08 May 2020
LGSVL Simulator: A High Fidelity Simulator for Autonomous Driving
Guodong Rong
B. Shin
Hadi Tabatabaee
Q. Lu
Steve Lemke
...
Eric Sterner
Keunhae Ushiroda
Michael Reyes
Dmitry Zelenkovsky
Seonman Kim
105
405
0
07 May 2020
Planning from Images with Deep Latent Gaussian Process Dynamics
Nathanael Bosch
Jan Achterhold
Laura Leal-Taixé
J. Stückler
54
1
0
07 May 2020
Playing Minecraft with Behavioural Cloning
Anssi Kanervisto
Janne Karttunen
Ville Hautamaki
72
12
0
07 May 2020
CARL: Controllable Agent with Reinforcement Learning for Quadruped Locomotion
Ying-Sheng Luo
Jonathan Hans Soeseno
Trista Pei-chun Chen
Wei-Chao Chen
81
15
0
07 May 2020
A Survey of Algorithms for Black-Box Safety Validation of Cyber-Physical Systems
Anthony Corso
Robert J. Moss
Mark Koren
Ritchie Lee
Mykel J. Kochenderfer
97
176
0
06 May 2020
Active Preference-Based Gaussian Process Regression for Reward Learning
Erdem Biyik
Nicolas Huynh
Mykel J. Kochenderfer
Dorsa Sadigh
GP
96
110
0
06 May 2020
Discrete-to-Deep Supervised Policy Learning
B. Kurniawan
Peter Vamplew
Michael Papasimeon
Richard Dazeley
Cameron Foale
OffRL
16
3
0
05 May 2020
SIGVerse: A cloud-based VR platform for research on social and embodied human-robot interaction
T. Inamura
Y. Mizuchi
41
9
0
02 May 2020
DSAC: Distributional Soft Actor-Critic for Risk-Sensitive Reinforcement Learning
Xiaoteng Ma
Junyao Chen
Li Xia
Jun Yang
Qianchuan Zhao
Zhengyuan Zhou
80
17
0
30 Apr 2020
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization
P. DÓro
Wojciech Ja'skowski
OffRL
94
27
0
29 Apr 2020
Actor-Critic Reinforcement Learning for Control with Stability Guarantee
Minghao Han
Lixian Zhang
Jun Wang
Wei Pan
95
113
0
29 Apr 2020
Augmented Behavioral Cloning from Observation
Juarez Monteiro
Nathan Gavenski
R. Granada
Felipe Meneguzzi
Rodrigo C. Barros
51
12
0
28 Apr 2020
First return, then explore
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
108
365
0
27 Apr 2020
GymFG: A Framework with a Gym Interface for FlightGear
A. Wood
Ali Sydney
Peter Chin
B. Thapa
Ryan Ross
6
2
0
26 Apr 2020
Self-Paced Deep Reinforcement Learning
Pascal Klink
Carlo DÉramo
Jan Peters
Joni Pajarinen
ODL
96
54
0
24 Apr 2020
Evolution of Q Values for Deep Q Learning in Stable Baselines
M. Andrews
Cemil Dibek
Karina Palyutina
40
3
0
24 Apr 2020
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
Shangtong Zhang
Bo Liu
Shimon Whiteson
110
38
0
22 Apr 2020
Policy Gradient from Demonstration and Curiosity
Jie Chen
Wenjun Xu
124
12
0
22 Apr 2020
Goal-conditioned Batch Reinforcement Learning for Rotation Invariant Locomotion
Aditi Mavalankar
OffRL
59
7
0
17 Apr 2020
Previous
1
2
3
...
36
37
38
...
50
51
52
Next