ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Liquid Time-constant Networks
Liquid Time-constant Networks
Ramin Hasani
Mathias Lechner
Alexander Amini
Daniela Rus
Radu Grosu
AI4TSAI4CE
93
230
0
08 Jun 2020
Learning Long-Term Dependencies in Irregularly-Sampled Time Series
Learning Long-Term Dependencies in Irregularly-Sampled Time Series
Mathias Lechner
Ramin Hasani
AI4TS
73
132
0
08 Jun 2020
Implications of Human Irrationality for Reinforcement Learning
Implications of Human Irrationality for Reinforcement Learning
Haiyang Chen
H. Chang
Andrew Howes
47
1
0
07 Jun 2020
Dual Policy Distillation
Dual Policy Distillation
Kwei-Herng Lai
Daochen Zha
Yuening Li
Helen Zhou
OffRL
118
9
0
07 Jun 2020
TrueRMA: Learning Fast and Smooth Robot Trajectories with Recursive
  Midpoint Adaptations in Cartesian Space
TrueRMA: Learning Fast and Smooth Robot Trajectories with Recursive Midpoint Adaptations in Cartesian Space
Jonas C. Kiemel
Pascal Meissner
Torsten Kröger
48
6
0
05 Jun 2020
Visual Transfer for Reinforcement Learning via Wasserstein Domain
  Confusion
Visual Transfer for Reinforcement Learning via Wasserstein Domain Confusion
Josh Roy
George Konidaris
73
16
0
04 Jun 2020
A Novel Update Mechanism for Q-Networks Based On Extreme Learning
  Machines
A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines
Callum Wilson
A. Riccardi
E. Minisci
21
4
0
04 Jun 2020
Interferobot: aligning an optical interferometer by a reinforcement
  learning agent
Interferobot: aligning an optical interferometer by a reinforcement learning agent
Dmitry Sorokin
Alexander Ulanov
E. A. Sazhina
A. Lvovsky
60
17
0
03 Jun 2020
Diversity Actor-Critic: Sample-Aware Entropy Regularization for
  Sample-Efficient Exploration
Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration
Seungyul Han
Y. Sung
45
26
0
02 Jun 2020
Crowd simulation for crisis management: the outcomes of the last decade
Crowd simulation for crisis management: the outcomes of the last decade
George K. Sidiropoulos
C. Kiourt
Lefteris Moussiades
54
20
0
01 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
143
226
0
01 Jun 2020
AI Research Considerations for Human Existential Safety (ARCHES)
AI Research Considerations for Human Existential Safety (ARCHES)
Andrew Critch
David M. Krueger
112
53
0
30 May 2020
Active Measure Reinforcement Learning for Observation Cost Minimization
Active Measure Reinforcement Learning for Observation Cost Minimization
C. Bellinger
Rory Coles
Mark Crowley
Isaac Tamblyn
OffRL
45
24
0
26 May 2020
Modeling Penetration Testing with Reinforcement Learning Using
  Capture-the-Flag Challenges: Trade-offs between Model-free Learning and A
  Priori Knowledge
Modeling Penetration Testing with Reinforcement Learning Using Capture-the-Flag Challenges: Trade-offs between Model-free Learning and A Priori Knowledge
Fabio Massimo Zennaro
L. Erdődi
47
18
0
26 May 2020
Gradient Monitored Reinforcement Learning
Gradient Monitored Reinforcement Learning
Mohammed Sharafath Abdul Hameed
Gavneet Singh Chadha
Andreas Schwung
S. Ding
97
11
0
25 May 2020
Policy Entropy for Out-of-Distribution Classification
Policy Entropy for Out-of-Distribution Classification
Andreas Sedlmeier
Robert Muller
Steffen Illium
Claudia Linnhoff-Popien
OODDOffRL
59
14
0
25 May 2020
Novel Policy Seeking with Constrained Optimization
Novel Policy Seeking with Constrained Optimization
Hao Sun
Zhenghao Peng
Bo Dai
Jian Guo
Dahua Lin
Bolei Zhou
136
13
0
21 May 2020
Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR
  Control in Active Distribution Networks
Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR Control in Active Distribution Networks
Haotian Liu
Wenchuan Wu
OffRL
28
98
0
20 May 2020
Mirror Descent Policy Optimization
Mirror Descent Policy Optimization
Manan Tomar
Lior Shani
Yonathan Efroni
Mohammad Ghavamzadeh
142
87
0
20 May 2020
The Second Type of Uncertainty in Monte Carlo Tree Search
The Second Type of Uncertainty in Monte Carlo Tree Search
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
45
3
0
19 May 2020
Lifelong Control of Off-grid Microgrid with Model Based Reinforcement
  Learning
Lifelong Control of Off-grid Microgrid with Model Based Reinforcement Learning
Simone Totaro
Ioannis Boukas
Anders Jonsson
Bertrand Cornélusse
27
31
0
16 May 2020
Learning Transferable Concepts in Deep Reinforcement Learning
Learning Transferable Concepts in Deep Reinforcement Learning
Diego Gomez
Nicanor Quijano
Luis Felipe Giraldo
CLLSSL
31
0
0
16 May 2020
Think Too Fast Nor Too Slow: The Computational Trade-off Between
  Planning And Reinforcement Learning
Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement Learning
Thomas M. Moerland
Anna Deichler
S. Baldi
Joost Broekens
Catholijn M. Jonker
OffRL
42
10
0
15 May 2020
Probabilistic Guarantees for Safe Deep Reinforcement Learning
Probabilistic Guarantees for Safe Deep Reinforcement Learning
E. Bacci
David Parker
91
27
0
14 May 2020
Explainable Reinforcement Learning: A Survey
Explainable Reinforcement Learning: A Survey
Erika Puiutta
Eric M. S. P. Veith
XAI
108
248
0
13 May 2020
Proxy Experience Replay: Federated Distillation for Distributed
  Reinforcement Learning
Proxy Experience Replay: Federated Distillation for Distributed Reinforcement Learning
Han Cha
Jihong Park
Hyesung Kim
M. Bennis
Seong-Lyun Kim
71
26
0
13 May 2020
MOReL : Model-Based Offline Reinforcement Learning
MOReL : Model-Based Offline Reinforcement Learning
Rahul Kidambi
Aravind Rajeswaran
Praneeth Netrapalli
Thorsten Joachims
OffRL
124
679
0
12 May 2020
Smooth Exploration for Robotic Reinforcement Learning
Smooth Exploration for Robotic Reinforcement Learning
Antonin Raffin
Jens Kober
F. Stulp
81
58
0
12 May 2020
Delay-Aware Multi-Agent Reinforcement Learning for Cooperative and
  Competitive Environments
Delay-Aware Multi-Agent Reinforcement Learning for Cooperative and Competitive Environments
Baiming Chen
Mengdi Xu
Zuxin Liu
Liang-Sheng Li
Ding Zhao
70
37
0
11 May 2020
Delay-Aware Model-Based Reinforcement Learning for Continuous Control
Delay-Aware Model-Based Reinforcement Learning for Continuous Control
Baiming Chen
Mengdi Xu
Liang-Sheng Li
Ding Zhao
OffRL
143
65
0
11 May 2020
Controlling Overestimation Bias with Truncated Mixture of Continuous
  Distributional Quantile Critics
Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics
Arsenii Kuznetsov
Pavel Shvechikov
Alexander Grishin
Dmitry Vetrov
246
198
0
08 May 2020
LGSVL Simulator: A High Fidelity Simulator for Autonomous Driving
LGSVL Simulator: A High Fidelity Simulator for Autonomous Driving
Guodong Rong
B. Shin
Hadi Tabatabaee
Q. Lu
Steve Lemke
...
Eric Sterner
Keunhae Ushiroda
Michael Reyes
Dmitry Zelenkovsky
Seonman Kim
105
405
0
07 May 2020
Planning from Images with Deep Latent Gaussian Process Dynamics
Planning from Images with Deep Latent Gaussian Process Dynamics
Nathanael Bosch
Jan Achterhold
Laura Leal-Taixé
J. Stückler
54
1
0
07 May 2020
Playing Minecraft with Behavioural Cloning
Playing Minecraft with Behavioural Cloning
Anssi Kanervisto
Janne Karttunen
Ville Hautamaki
72
12
0
07 May 2020
CARL: Controllable Agent with Reinforcement Learning for Quadruped
  Locomotion
CARL: Controllable Agent with Reinforcement Learning for Quadruped Locomotion
Ying-Sheng Luo
Jonathan Hans Soeseno
Trista Pei-chun Chen
Wei-Chao Chen
81
15
0
07 May 2020
A Survey of Algorithms for Black-Box Safety Validation of Cyber-Physical
  Systems
A Survey of Algorithms for Black-Box Safety Validation of Cyber-Physical Systems
Anthony Corso
Robert J. Moss
Mark Koren
Ritchie Lee
Mykel J. Kochenderfer
97
176
0
06 May 2020
Active Preference-Based Gaussian Process Regression for Reward Learning
Active Preference-Based Gaussian Process Regression for Reward Learning
Erdem Biyik
Nicolas Huynh
Mykel J. Kochenderfer
Dorsa Sadigh
GP
96
110
0
06 May 2020
Discrete-to-Deep Supervised Policy Learning
Discrete-to-Deep Supervised Policy Learning
B. Kurniawan
Peter Vamplew
Michael Papasimeon
Richard Dazeley
Cameron Foale
OffRL
16
3
0
05 May 2020
SIGVerse: A cloud-based VR platform for research on social and embodied
  human-robot interaction
SIGVerse: A cloud-based VR platform for research on social and embodied human-robot interaction
T. Inamura
Y. Mizuchi
41
9
0
02 May 2020
DSAC: Distributional Soft Actor-Critic for Risk-Sensitive Reinforcement Learning
DSAC: Distributional Soft Actor-Critic for Risk-Sensitive Reinforcement Learning
Xiaoteng Ma
Junyao Chen
Li Xia
Jun Yang
Qianchuan Zhao
Zhengyuan Zhou
80
17
0
30 Apr 2020
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator
  Policy Optimization
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization
P. DÓro
Wojciech Ja'skowski
OffRL
94
27
0
29 Apr 2020
Actor-Critic Reinforcement Learning for Control with Stability Guarantee
Actor-Critic Reinforcement Learning for Control with Stability Guarantee
Minghao Han
Lixian Zhang
Jun Wang
Wei Pan
95
113
0
29 Apr 2020
Augmented Behavioral Cloning from Observation
Augmented Behavioral Cloning from Observation
Juarez Monteiro
Nathan Gavenski
R. Granada
Felipe Meneguzzi
Rodrigo C. Barros
51
12
0
28 Apr 2020
First return, then explore
First return, then explore
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
108
365
0
27 Apr 2020
GymFG: A Framework with a Gym Interface for FlightGear
GymFG: A Framework with a Gym Interface for FlightGear
A. Wood
Ali Sydney
Peter Chin
B. Thapa
Ryan Ross
6
2
0
26 Apr 2020
Self-Paced Deep Reinforcement Learning
Self-Paced Deep Reinforcement Learning
Pascal Klink
Carlo DÉramo
Jan Peters
Joni Pajarinen
ODL
96
54
0
24 Apr 2020
Evolution of Q Values for Deep Q Learning in Stable Baselines
Evolution of Q Values for Deep Q Learning in Stable Baselines
M. Andrews
Cemil Dibek
Karina Palyutina
40
3
0
24 Apr 2020
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
Shangtong Zhang
Bo Liu
Shimon Whiteson
110
38
0
22 Apr 2020
Policy Gradient from Demonstration and Curiosity
Policy Gradient from Demonstration and Curiosity
Jie Chen
Wenjun Xu
124
12
0
22 Apr 2020
Goal-conditioned Batch Reinforcement Learning for Rotation Invariant
  Locomotion
Goal-conditioned Batch Reinforcement Learning for Rotation Invariant Locomotion
Aditi Mavalankar
OffRL
59
7
0
17 Apr 2020
Previous
123...363738...505152
Next