ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
REPLAB: A Reproducible Low-Cost Arm Benchmark Platform for Robotic
  Learning
REPLAB: A Reproducible Low-Cost Arm Benchmark Platform for Robotic Learning
Brian Yang
Jesse Zhang
Vitchyr H. Pong
Sergey Levine
Dinesh Jayaraman
77
37
0
17 May 2019
Mastering the Game of Sungka from Random Play
Mastering the Game of Sungka from Random Play
Darwin Bautista
Raimarc S. Dionido
19
0
0
17 May 2019
Bias-Reduced Hindsight Experience Replay with Virtual Goal
  Prioritization
Bias-Reduced Hindsight Experience Replay with Virtual Goal Prioritization
Binyamin Manela
Armin Biess
44
23
0
14 May 2019
Multi-Pass Q-Networks for Deep Reinforcement Learning with Parameterised
  Action Spaces
Multi-Pass Q-Networks for Deep Reinforcement Learning with Parameterised Action Spaces
Craig J. Bester
Steven D. James
George Konidaris
56
57
0
10 May 2019
Autonomous Management of Energy-Harvesting IoT Nodes Using Deep
  Reinforcement Learning
Autonomous Management of Energy-Harvesting IoT Nodes Using Deep Reinforcement Learning
Abdulmajid Murad
F. Kraemer
Kerstin Bach
Gavin Taylor
48
26
0
10 May 2019
Smoothing Policies and Safe Policy Gradients
Smoothing Policies and Safe Policy Gradients
Matteo Papini
Matteo Pirotta
Marcello Restelli
80
31
0
08 May 2019
Longitudinal Dynamic versus Kinematic Models for Car-Following Control
  Using Deep Reinforcement Learning
Longitudinal Dynamic versus Kinematic Models for Car-Following Control Using Deep Reinforcement Learning
Yuan Lin
J. McPhee
N. L. Azad
AI4CE
66
34
0
07 May 2019
A Complementary Learning Systems Approach to Temporal Difference
  Learning
A Complementary Learning Systems Approach to Temporal Difference Learning
Sam Blakeman
D. Mareschal
54
42
0
07 May 2019
P3O: Policy-on Policy-off Policy Optimization
P3O: Policy-on Policy-off Policy Optimization
Rasool Fakoor
Pratik Chaudhari
Alex Smola
OffRL
93
56
0
05 May 2019
Hierarchical Policy Learning is Sensitive to Goal Space Design
Hierarchical Policy Learning is Sensitive to Goal Space Design
Zach Dwiel
Madhavun Candadai
Mariano Phielipp
Arjun K. Bansal
79
15
0
04 May 2019
ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient
  Backpropagation Through Categorical Variables
ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables
Mingzhang Yin
Yuguang Yue
Mingyuan Zhou
66
23
0
04 May 2019
Collaborative Evolutionary Reinforcement Learning
Collaborative Evolutionary Reinforcement Learning
Shauharda Khadka
Somdeb Majumdar
Tarek Nassar
Zach Dwiel
E. Tumer
Santiago Miret
Yinyin Liu
Kagan Tumer
64
100
0
02 May 2019
Coevo: a collaborative design platform with artificial agents
Coevo: a collaborative design platform with artificial agents
Gerard Serra
D. Miralles
8
2
0
30 Apr 2019
Generative Adversarial Imagination for Sample Efficient Deep
  Reinforcement Learning
Generative Adversarial Imagination for Sample Efficient Deep Reinforcement Learning
Kacper Kielak
GAN
23
0
0
30 Apr 2019
DAC: The Double Actor-Critic Architecture for Learning Options
DAC: The Double Actor-Critic Architecture for Learning Options
Shangtong Zhang
Shimon Whiteson
149
73
0
29 Apr 2019
Reinforcement Learning Based Orchestration for Elastic Services
Reinforcement Learning Based Orchestration for Elastic Services
Mauricio Fadel Argerich
Bin Cheng
Jonathan Fürst
32
7
0
26 Apr 2019
Self Training Autonomous Driving Agent
Self Training Autonomous Driving Agent
Shashank Kotyan
Danilo Vasconcellos Vargas
Venkanna Udutalapally
29
4
0
26 Apr 2019
Meta-Sim: Learning to Generate Synthetic Datasets
Meta-Sim: Learning to Generate Synthetic Datasets
Amlan Kar
Aayush Prakash
Ming-Yuan Liu
Eric Cameracci
Justin Yuan
Matt Rusiniak
David Acuna
Antonio Torralba
Sanja Fidler
144
252
0
25 Apr 2019
Evolving Neural Networks in Reinforcement Learning by means of UMDAc
Evolving Neural Networks in Reinforcement Learning by means of UMDAc
Mikel Malagón
Josu Ceberio
54
2
0
24 Apr 2019
Generative Exploration and Exploitation
Generative Exploration and Exploitation
Jiechuan Jiang
Zongqing Lu
39
6
0
21 Apr 2019
Learning Manipulation under Physics Constraints with Visual Perception
Learning Manipulation under Physics Constraints with Visual Perception
Wenbin Li
A. Leonardis
Jeannette Bohg
Mario Fritz
SSLOCL
31
7
0
19 Apr 2019
Rogue-Gym: A New Challenge for Generalization in Reinforcement Learning
Rogue-Gym: A New Challenge for Generalization in Reinforcement Learning
Yuji Kanagawa
Tomoyuki Kaneko
73
14
0
17 Apr 2019
A Hitchhiker's Guide to Statistical Comparisons of Reinforcement
  Learning Algorithms
A Hitchhiker's Guide to Statistical Comparisons of Reinforcement Learning Algorithms
Cédric Colas
Olivier Sigaud
Pierre-Yves Oudeyer
77
64
0
15 Apr 2019
Dot-to-Dot: Explainable Hierarchical Reinforcement Learning for Robotic
  Manipulation
Dot-to-Dot: Explainable Hierarchical Reinforcement Learning for Robotic Manipulation
Benjamin Beyret
A. Shafti
A. Faisal
142
74
0
14 Apr 2019
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement
  Learning from Observations
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations
Daniel S. Brown
Wonjoon Goo
P. Nagarajan
S. Niekum
121
358
0
12 Apr 2019
Only Relevant Information Matters: Filtering Out Noisy Samples to Boost
  RL
Only Relevant Information Matters: Filtering Out Noisy Samples to Boost RL
Yannis Flet-Berliac
Philippe Preux
47
2
0
08 Apr 2019
Reducing catastrophic forgetting when evolving neural networks
Reducing catastrophic forgetting when evolving neural networks
Joseph Early
23
2
0
05 Apr 2019
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning
  Without a Supercomputer
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer
E. Beeching
Christian Wolf
J. Dibangoye
Olivier Simonin
OffRLLRM
76
25
0
03 Apr 2019
VRGym: A Virtual Testbed for Physical and Interactive AI
VRGym: A Virtual Testbed for Physical and Interactive AI
Xu Xie
Hangxin Liu
Zhenliang Zhang
Yuxing Qiu
Feng Gao
Siyuan Qi
Yixin Zhu
Song-Chun Zhu
58
27
0
02 Apr 2019
Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents
Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents
Christian Rupprecht
Cyril Ibrahim
C. Pal
96
32
0
02 Apr 2019
Personalized Cancer Chemotherapy Schedule: a numerical comparison of
  performance and robustness in model-based and model-free scheduling
  methodologies
Personalized Cancer Chemotherapy Schedule: a numerical comparison of performance and robustness in model-based and model-free scheduling methodologies
J. Tordesillas
Juncal Arbelaiz
OffRL
34
3
0
02 Apr 2019
Multitask Soft Option Learning
Multitask Soft Option Learning
Maximilian Igl
Andrew Gambardella
Jinke He
Nantas Nardelli
N. Siddharth
Wendelin Bohmer
Shimon Whiteson
187
26
0
01 Apr 2019
Guided Meta-Policy Search
Guided Meta-Policy Search
Russell Mendonca
Abhishek Gupta
Rosen Kralev
Pieter Abbeel
Sergey Levine
Chelsea Finn
68
57
0
01 Apr 2019
Generalized Off-Policy Actor-Critic
Generalized Off-Policy Actor-Critic
Shangtong Zhang
Wendelin Bohmer
Shimon Whiteson
OffRLCML
151
43
0
27 Mar 2019
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Riley Simmons-Edler
Ben Eisner
E. Mitchell
Sebastian Seung
Daniel D. Lee
97
29
0
25 Mar 2019
Using RGB Image as Visual Input for Mapless Robot Navigation
Using RGB Image as Visual Input for Mapless Robot Navigation
Liulong Ma
Yanjie Liu
Jiao Chen
SSL
92
17
0
24 Mar 2019
HouseExpo: A Large-scale 2D Indoor Layout Dataset for Learning-based
  Algorithms on Mobile Robots
HouseExpo: A Large-scale 2D Indoor Layout Dataset for Learning-based Algorithms on Mobile Robots
Tingguang Li
Danny Ho
Chenming Li
Delong Zhu
Chaoqun Wang
Max Meng
3DV
66
57
0
23 Mar 2019
Improving Safety in Reinforcement Learning Using Model-Based
  Architectures and Human Intervention
Improving Safety in Reinforcement Learning Using Model-Based Architectures and Human Intervention
Bharat Prakash
Mohit Khatwani
Nicholas R. Waytowich
T. Mohsenin
OffRL
58
19
0
22 Mar 2019
DQN with model-based exploration: efficient learning on environments
  with sparse rewards
DQN with model-based exploration: efficient learning on environments with sparse rewards
Stephen Gou
Yuyang Liu
52
14
0
22 Mar 2019
Adaptive Genomic Evolution of Neural Network Topologies (AGENT) for
  State-to-Action Mapping in Autonomous Agents
Adaptive Genomic Evolution of Neural Network Topologies (AGENT) for State-to-Action Mapping in Autonomous Agents
A. Behjat
Sharat Chidambaran
Souma Chowdhury
48
14
0
17 Mar 2019
gym-gazebo2, a toolkit for reinforcement learning using ROS 2 and Gazebo
gym-gazebo2, a toolkit for reinforcement learning using ROS 2 and Gazebo
N. G. Lopez
Y. Nuin
Elias Barba Moral
Lander Usategui San Juan
A. Rueda
Víctor Mayoral-Vilches
R. Kojcev
OffRL
50
35
0
14 Mar 2019
Deep Reinforcement Learning with Feedback-based Exploration
Deep Reinforcement Learning with Feedback-based Exploration
Jan Scholten
Daan Wout
C. Celemin
Jens Kober
62
4
0
14 Mar 2019
Learning Gaussian Policies from Corrective Human Feedback
Learning Gaussian Policies from Corrective Human Feedback
Daan Wout
Jan Scholten
C. Celemin
Jens Kober
93
2
0
12 Mar 2019
Universally Slimmable Networks and Improved Training Techniques
Universally Slimmable Networks and Improved Training Techniques
Jiahui Yu
Thomas Huang
92
389
0
12 Mar 2019
Sim-to-(Multi)-Real: Transfer of Low-Level Robust Control Policies to
  Multiple Quadrotors
Sim-to-(Multi)-Real: Transfer of Low-Level Robust Control Policies to Multiple Quadrotors
Artem Molchanov
Tao Chen
Wolfgang Hönig
James A. Preiss
Nora Ayanian
Gaurav Sukhatme
159
111
0
11 Mar 2019
Hybrid Reinforcement Learning with Expert State Sequences
Hybrid Reinforcement Learning with Expert State Sequences
Xiaoxiao Guo
Shiyu Chang
Mo Yu
Gerald Tesauro
Murray Campbell
OffRL
54
33
0
11 Mar 2019
Orthogonal Estimation of Wasserstein Distances
Orthogonal Estimation of Wasserstein Distances
Mark Rowland
Jiri Hron
Yunhao Tang
K. Choromanski
Tamás Sarlós
Adrian Weller
91
43
0
09 Mar 2019
Adaptive Power System Emergency Control using Deep Reinforcement
  Learning
Adaptive Power System Emergency Control using Deep Reinforcement Learning
Qiuhua Huang
Renke Huang
Weituo Hao
Jie Tan
Rui Fan
Zhenyu Huang
111
279
0
09 Mar 2019
Dyna-AIL : Adversarial Imitation Learning by Planning
Dyna-AIL : Adversarial Imitation Learning by Planning
Vaibhav Saxena
Srinivasan Sivanandan
Pulkit Mathur
41
1
0
08 Mar 2019
Provably Robust Blackbox Optimization for Reinforcement Learning
Provably Robust Blackbox Optimization for Reinforcement Learning
K. Choromanski
Aldo Pacchiano
Jack Parker-Holder
Yunhao Tang
Deepali Jain
Yuxiang Yang
Atil Iscen
Jasmine Hsu
Vikas Sindhwani
57
5
0
07 Mar 2019
Previous
123...444546...505152
Next