ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Continual Learning Using World Models for Pseudo-Rehearsal
Continual Learning Using World Models for Pseudo-Rehearsal
Nicholas A. Ketz
Soheil Kolouri
Praveen K. Pilly
KELMCLL
51
7
0
06 Mar 2019
Safety-Guided Deep Reinforcement Learning via Online Gaussian Process
  Estimation
Safety-Guided Deep Reinforcement Learning via Online Gaussian Process Estimation
Jiameng Fan
Wenchao Li
OffRLOnRLGP
77
18
0
06 Mar 2019
The AI Driving Olympics at NeurIPS 2018
The AI Driving Olympics at NeurIPS 2018
J. Zilly
J. Tani
Breandan Considine
Bhairav Mehta
Andrea F. Daniele
...
R. Hristov
S. Mallya
Emilio Frazzoli
A. Censi
Liam Paull
85
14
0
06 Mar 2019
Training in Task Space to Speed Up and Guide Reinforcement Learning
Training in Task Space to Speed Up and Guide Reinforcement Learning
Guillaume Bellegarda
Katie Byl
51
19
0
06 Mar 2019
Open-Sourced Reinforcement Learning Environments for Surgical Robotics
Open-Sourced Reinforcement Learning Environments for Surgical Robotics
Florian Richter
Ryan K. Orosco
Michael C. Yip
OffRL
68
82
0
05 Mar 2019
The StreetLearn Environment and Dataset
The StreetLearn Environment and Dataset
Piotr Wojciech Mirowski
Andras Banki-Horvath
Keith Anderson
Denis Teplyashin
Karl Moritz Hermann
...
Matthew Koichi Grimes
Karen Simonyan
Koray Kavukcuoglu
Andrew Zisserman
R. Hadsell
3DV
75
66
0
04 Mar 2019
NoRML: No-Reward Meta Learning
NoRML: No-Reward Meta Learning
Yuxiang Yang
Ken Caluwaerts
Atil Iscen
Jie Tan
Chelsea Finn
77
27
0
04 Mar 2019
Asynchronous Episodic Deep Deterministic Policy Gradient: Towards
  Continuous Control in Computationally Complex Environments
Asynchronous Episodic Deep Deterministic Policy Gradient: Towards Continuous Control in Computationally Complex Environments
Zhizheng Zhang
Jiale Chen
Zhibo Chen
Weiping Li
OffRL
93
61
0
03 Mar 2019
Neural MMO: A Massively Multiagent Game Environment for Training and
  Evaluating Intelligent Agents
Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents
Joseph Suárez
Yilun Du
Phillip Isola
Igor Mordatch
77
71
0
02 Mar 2019
Regularity Normalization: Neuroscience-Inspired Unsupervised Attention
  across Neural Network Layers
Regularity Normalization: Neuroscience-Inspired Unsupervised Attention across Neural Network Layers
Baihan Lin
59
2
0
27 Feb 2019
Deep Variational Koopman Models: Inferring Koopman Observations for
  Uncertainty-Aware Dynamics Modeling and Control
Deep Variational Koopman Models: Inferring Koopman Observations for Uncertainty-Aware Dynamics Modeling and Control
Jeremy Morton
F. Witherden
Mykel J Kochenderfer
85
47
0
26 Feb 2019
Flappy Hummingbird: An Open Source Dynamic Simulation of Flapping Wing
  Robots and Animals
Flappy Hummingbird: An Open Source Dynamic Simulation of Flapping Wing Robots and Animals
Fan Fei
Zhan Tu
Yilun Yang
Jian Zhang
Xinyan Deng
88
32
0
25 Feb 2019
Curiosity-Driven Experience Prioritization via Density Estimation
Curiosity-Driven Experience Prioritization via Density Estimation
Rui Zhao
Volker Tresp
134
55
0
20 Feb 2019
Emergent Coordination Through Competition
Emergent Coordination Through Competition
Siqi Liu
Guy Lever
J. Merel
S. Tunyasuvunakool
N. Heess
T. Graepel
123
151
0
19 Feb 2019
Fast Efficient Hyperparameter Tuning for Policy Gradients
Fast Efficient Hyperparameter Tuning for Policy Gradients
Supratik Paul
Vitaly Kurin
Shimon Whiteson
72
32
0
18 Feb 2019
Realizing Continual Learning through Modeling a Learning System as a
  Fiber Bundle
Realizing Continual Learning through Modeling a Learning System as a Fiber Bundle
Zhenfeng Cao
40
2
0
16 Feb 2019
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy
  Observations
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy Observations
Yuhui Wang
Hao He
Xiaoyang Tan
55
10
0
15 Feb 2019
Network Offloading Policies for Cloud Robotics: a Learning-based
  Approach
Network Offloading Policies for Cloud Robotics: a Learning-based Approach
Sandeep P. Chinchali
Apoorva Sharma
James Harrison
Amine Elhafsi
Daniel Kang
Evgenya Pergament
Eyal Cidon
Sachin Katti
Marco Pavone
OffRL
66
107
0
15 Feb 2019
Learning to Control Self-Assembling Morphologies: A Study of
  Generalization via Modularity
Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity
Deepak Pathak
Chris Xiaoxuan Lu
Trevor Darrell
Phillip Isola
Alexei A. Efros
53
135
0
14 Feb 2019
ELF OpenGo: An Analysis and Open Reimplementation of AlphaZero
ELF OpenGo: An Analysis and Open Reimplementation of AlphaZero
Yuandong Tian
Jerry Ma
Qucheng Gong
Shubho Sengupta
Zhuoyuan Chen
James Pinkerton
C. L. Zitnick
102
110
0
12 Feb 2019
VERIFAI: A Toolkit for the Design and Analysis of Artificial
  Intelligence-Based Systems
VERIFAI: A Toolkit for the Design and Analysis of Artificial Intelligence-Based Systems
T. Dreossi
Daniel J. Fremont
Shromona Ghosh
Edward J. Kim
H. Ravanbakhsh
Marcell Vazquez-Chanlatte
Sanjit A. Seshia
81
29
0
12 Feb 2019
Diverse Exploration via Conjugate Policies for Policy Gradient Methods
Diverse Exploration via Conjugate Policies for Policy Gradient Methods
Andrew Cohen
Xingye Qiao
Lei Yu
E. Way
Xiangrong Tong
60
9
0
10 Feb 2019
Metaoptimization on a Distributed System for Deep Reinforcement Learning
Metaoptimization on a Distributed System for Deep Reinforcement Learning
Greg Heinrich
I. Frosio
OffRL
28
2
0
07 Feb 2019
The Actor-Advisor: Policy Gradient With Off-Policy Advice
The Actor-Advisor: Policy Gradient With Off-Policy Advice
Hélène Plisnier
Denis Steckelmacher
D. Roijers
A. Nowé
CMLOffRL
27
6
0
07 Feb 2019
Obstacle Tower: A Generalization Challenge in Vision, Control, and
  Planning
Obstacle Tower: A Generalization Challenge in Vision, Control, and Planning
Arthur Juliani
Ahmed Khalifa
Vincent-Pierre Berges
Jonathan Harper
Ervin Teng
Hunter Henry
A. Crespi
Julian Togelius
Danny Lange
78
144
0
04 Feb 2019
A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning
A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning
Francisco M. Garcia
Philip S. Thomas
102
41
0
03 Feb 2019
Certified Reinforcement Learning with Logic Guidance
Certified Reinforcement Learning with Logic Guidance
Mohammadhosein Hasanbeig
Daniel Kroening
Alessandro Abate
127
57
0
02 Feb 2019
The Hanabi Challenge: A New Frontier for AI Research
The Hanabi Challenge: A New Frontier for AI Research
Nolan Bard
Jakob N. Foerster
A. Chandar
Neil Burch
Marc Lanctot
...
Iain Dunning
Shibl Mourad
Hugo Larochelle
Marc G. Bellemare
Michael Bowling
LLMAG
100
355
0
01 Feb 2019
Policy Consolidation for Continual Reinforcement Learning
Policy Consolidation for Continual Reinforcement Learning
Christos Kaplanis
Murray Shanahan
Claudia Clopath
CLLOffRL
78
51
0
01 Feb 2019
Contrasting Exploration in Parameter and Action Space: A Zeroth-Order
  Optimization Perspective
Contrasting Exploration in Parameter and Action Space: A Zeroth-Order Optimization Perspective
Anirudh Vemula
Wen Sun
J. Andrew Bagnell
73
40
0
31 Jan 2019
Improving Evolutionary Strategies with Generative Neural Networks
Improving Evolutionary Strategies with Generative Neural Networks
Louis Faury
Clément Calauzènes
Olivier Fercoq
Syrine Krichene
67
13
0
31 Jan 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
113
370
0
30 Jan 2019
Discretizing Continuous Action Space for On-Policy Optimization
Discretizing Continuous Action Space for On-Policy Optimization
Yunhao Tang
Shipra Agrawal
OffRL
107
124
0
29 Jan 2019
Trust Region-Guided Proximal Policy Optimization
Trust Region-Guided Proximal Policy Optimization
Yuhui Wang
Hao He
Xiaoyang Tan
Yaozhong Gan
OffRL
89
57
0
29 Jan 2019
Modularization of End-to-End Learning: Case Study in Arcade Games
Modularization of End-to-End Learning: Case Study in Arcade Games
Andrew Melnik
Sascha Fleer
M. Schilling
Helge J. Ritter
OffRL
71
12
0
27 Jan 2019
Trust Region Value Optimization using Kalman Filtering
Trust Region Value Optimization using Kalman Filtering
Shirli Di-Castro Shashua
Shie Mannor
61
8
0
23 Jan 2019
Neuroflight: Next Generation Flight Control Firmware
Neuroflight: Next Generation Flight Control Firmware
W. Koch
R. Mancuso
Azer Bestavros
72
30
0
19 Jan 2019
AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep
  Reinforcement Learning
AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning
Ameer Haj-Ali
Qijing Huang
William S. Moses
J. Xiang
Ion Stoica
Krste Asanović
J. Wawrzynek
49
36
0
15 Jan 2019
Motion Perception in Reinforcement Learning with Dynamic Objects
Motion Perception in Reinforcement Learning with Dynamic Objects
Artemij Amiranashvili
Alexey Dosovitskiy
V. Koltun
Thomas Brox
69
35
0
10 Jan 2019
Model-Predictive Policy Learning with Uncertainty Regularization for
  Driving in Dense Traffic
Model-Predictive Policy Learning with Uncertainty Regularization for Driving in Dense Traffic
Mikael Henaff
A. Canziani
Yann LeCun
OOD
111
123
0
08 Jan 2019
Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly
  Complex and Diverse Learning Environments and Their Solutions
Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions
Rui Wang
Joel Lehman
Jeff Clune
Kenneth O. Stanley
123
250
0
07 Jan 2019
Hierarchical Reinforcement Learning via Advantage-Weighted Information
  Maximization
Hierarchical Reinforcement Learning via Advantage-Weighted Information Maximization
Takayuki Osa
Voot Tangkaratt
Masashi Sugiyama
OffRL
62
27
0
05 Jan 2019
Adversarial Learning of a Sampler Based on an Unnormalized Distribution
Adversarial Learning of a Sampler Based on an Unnormalized Distribution
Chunyuan Li
Ke Bai
Jianqiao Li
Guoyin Wang
Changyou Chen
Lawrence Carin
150
10
0
03 Jan 2019
Complementary reinforcement learning towards explainable agents
Complementary reinforcement learning towards explainable agents
J. H. Lee
53
12
0
01 Jan 2019
Deconfounding Reinforcement Learning in Observational Settings
Deconfounding Reinforcement Learning in Observational Settings
Chaochao Lu
Bernhard Schölkopf
José Miguel Hernández-Lobato
CMLOOD
173
75
0
26 Dec 2018
Learning to Walk via Deep Reinforcement Learning
Learning to Walk via Deep Reinforcement Learning
Tuomas Haarnoja
Sehoon Ha
Aurick Zhou
Jie Tan
George Tucker
Sergey Levine
137
442
0
26 Dec 2018
Iroko: A Framework to Prototype Reinforcement Learning for Data Center
  Traffic Control
Iroko: A Framework to Prototype Reinforcement Learning for Data Center Traffic Control
Fabian Ruffy
Michael Przystupa
Ivan Beschastnikh
48
31
0
24 Dec 2018
VMAV-C: A Deep Attention-based Reinforcement Learning Algorithm for
  Model-based Control
VMAV-C: A Deep Attention-based Reinforcement Learning Algorithm for Model-based Control
Xingxing Liang
Qi Wang
Yanghe Feng
Zhong Liu
Jincai Huang
65
5
0
24 Dec 2018
NADPEx: An on-policy temporally consistent exploration method for deep
  reinforcement learning
NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning
Sirui Xie
Junning Huang
Lanxin Lei
Chunxiao Liu
Zheng Ma
Wayne Zhang
Liang Lin
57
8
0
21 Dec 2018
Pre-training with Non-expert Human Demonstration for Deep Reinforcement
  Learning
Pre-training with Non-expert Human Demonstration for Deep Reinforcement Learning
G. V. D. L. Cruz
Yunshu Du
Matthew E. Taylor
OffRL
51
26
0
21 Dec 2018
Previous
123...454647...505152
Next