ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRL
    ODL
ArXivPDFHTML

Papers citing "OpenAI Gym"

50 / 1,654 papers shown
Title
Enhanced POET: Open-Ended Reinforcement Learning through Unbounded
  Invention of Learning Challenges and their Solutions
Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions
Rui Wang
Joel Lehman
Aditya Rawal
Jiale Zhi
Yulun Li
Jeff Clune
Kenneth O. Stanley
22
125
0
19 Mar 2020
Neuroevolution of Self-Interpretable Agents
Neuroevolution of Self-Interpretable Agents
Yujin Tang
Duong Nguyen
David R Ha
34
111
0
18 Mar 2020
Pretraining Image Encoders without Reconstruction via Feature Prediction
  Loss
Pretraining Image Encoders without Reconstruction via Feature Prediction Loss
G. Pihlgren
Fredrik Sandin
Marcus Liwicki
18
3
0
16 Mar 2020
Self-Supervised Discovering of Interpretable Features for Reinforcement
  Learning
Self-Supervised Discovering of Interpretable Features for Reinforcement Learning
Wenjie Shi
Gao Huang
Shiji Song
Zhuoyuan Wang
Tingyu Lin
Cheng Wu
SSL
28
18
0
16 Mar 2020
The Chef's Hat Simulation Environment for Reinforcement-Learning-Based
  Agents
The Chef's Hat Simulation Environment for Reinforcement-Learning-Based Agents
Pablo V. A. Barros
Anne C. Bloem
Inge M. Hootsmans
Lena M. Opheij
Romain H. A. Toebosch
E. Barakova
A. Sciutti
17
9
0
12 Mar 2020
Online Meta-Critic Learning for Off-Policy Actor-Critic Methods
Online Meta-Critic Learning for Off-Policy Actor-Critic Methods
Wei Zhou
Yiying Li
Yongxin Yang
Huaimin Wang
Timothy M. Hospedales
OffRL
34
46
0
11 Mar 2020
Explore and Exploit with Heterotic Line Bundle Models
Explore and Exploit with Heterotic Line Bundle Models
Magdalena Larfors
Robin Schneider
41
38
0
10 Mar 2020
Stable Policy Optimization via Off-Policy Divergence Regularization
Stable Policy Optimization via Off-Policy Divergence Regularization
Ahmed Touati
Amy Zhang
Joelle Pineau
Pascal Vincent
OffRL
36
17
0
09 Mar 2020
q-VAE for Disentangled Representation Learning and Latent Dynamical
  Systems
q-VAE for Disentangled Representation Learning and Latent Dynamical Systems
Taisuke Kobayashis
BDL
DRL
22
17
0
04 Mar 2020
Hierarchically Decoupled Imitation for Morphological Transfer
Hierarchically Decoupled Imitation for Morphological Transfer
D. Hejna
Pieter Abbeel
Lerrel Pinto
LM&Ro
25
41
0
03 Mar 2020
Exploration-efficient Deep Reinforcement Learning with Demonstration
  Guidance for Robot Control
Exploration-efficient Deep Reinforcement Learning with Demonstration Guidance for Robot Control
Ke Lin
Liang Gong
Xudong Li
Te Sun
Binhao Chen
Chengliang Liu
Zhengfeng Zhang
Jian Pu
Junping Zhang
24
8
0
27 Feb 2020
Plannable Approximations to MDP Homomorphisms: Equivariance under
  Actions
Plannable Approximations to MDP Homomorphisms: Equivariance under Actions
Elise van der Pol
Thomas Kipf
F. Oliehoek
Max Welling
25
77
0
27 Feb 2020
Policy Evaluation Networks
Policy Evaluation Networks
J. Harb
Tom Schaul
Doina Precup
Pierre-Luc Bacon
OffRL
20
36
0
26 Feb 2020
Simultaneously Evolving Deep Reinforcement Learning Models using
  Multifactorial Optimization
Simultaneously Evolving Deep Reinforcement Learning Models using Multifactorial Optimization
Aritz D. Martinez
E. Osaba
Javier Del Ser
Francisco Herrera
22
10
0
25 Feb 2020
Off-Policy Deep Reinforcement Learning with Analogous Disentangled
  Exploration
Off-Policy Deep Reinforcement Learning with Analogous Disentangled Exploration
Hoang Trung-Dung
Yitao Liang
Guy Van den Broeck
OffRL
22
3
0
25 Feb 2020
Informative Path Planning for Mobile Sensing with Reinforcement Learning
Informative Path Planning for Mobile Sensing with Reinforcement Learning
Yongyong Wei
Rong Zheng
26
34
0
18 Feb 2020
Universal Value Density Estimation for Imitation Learning and
  Goal-Conditioned Reinforcement Learning
Universal Value Density Estimation for Imitation Learning and Goal-Conditioned Reinforcement Learning
Yannick Schroecker
Charles Isbell
OffRL
36
12
0
15 Feb 2020
PDDLGym: Gym Environments from PDDL Problems
PDDLGym: Gym Environments from PDDL Problems
Tom Silver
Rohan Chitnis
AI4CE
25
56
0
15 Feb 2020
Applying Depth-Sensing to Automated Surgical Manipulation with a da
  Vinci Robot
Applying Depth-Sensing to Automated Surgical Manipulation with a da Vinci Robot
M. Hwang
Daniel Seita
Brijen Thananjeyan
Jeffrey Ichnowski
Samuel Paradis
Danyal Fer
Thomas Low
Ken Goldberg
8
31
0
15 Feb 2020
Robust Reinforcement Learning via Adversarial training with Langevin
  Dynamics
Robust Reinforcement Learning via Adversarial training with Langevin Dynamics
Parameswaran Kamalaruban
Yu-ting Huang
Ya-Ping Hsieh
Paul Rolland
C. Shi
V. Cevher
31
60
0
14 Feb 2020
Learning Functionally Decomposed Hierarchies for Continuous Control
  Tasks with Path Planning
Learning Functionally Decomposed Hierarchies for Continuous Control Tasks with Path Planning
Sammy Christen
Lukás Jendele
Emre Aksan
Otmar Hilliges
OffRL
30
25
0
14 Feb 2020
XCS Classifier System with Experience Replay
XCS Classifier System with Experience Replay
Anthony Stein
Roland Maier
Lukas Rosenbauer
J. Hähner
BDL
28
21
0
13 Feb 2020
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted
  Prescription
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted Prescription
Olivier Francon
Santiago Gonzalez
B. Hodjat
Elliot Meyerson
Risto Miikkulainen
Xin Qiu
H. Shahrzad
26
16
0
13 Feb 2020
Representation of Reinforcement Learning Policies in Reproducing Kernel
  Hilbert Spaces
Representation of Reinforcement Learning Policies in Reproducing Kernel Hilbert Spaces
Bogdan Mazoure
T. Doan
Tianyu Li
V. Makarenkov
Joelle Pineau
Doina Precup
Guillaume Rabusseau
OffRL
21
1
0
07 Feb 2020
Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic
  with Advantage Weighted Mixture Policy(SAC-AWMP)
Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP)
Zhimin Hou
Kuangen Zhang
Yi Wan
Dongyu Li
Chenglong Fu
Haoyong Yu
27
15
0
07 Feb 2020
Ready Policy One: World Building Through Active Learning
Ready Policy One: World Building Through Active Learning
Philip J. Ball
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
OffRL
32
49
0
07 Feb 2020
Effective Diversity in Population Based Reinforcement Learning
Effective Diversity in Population Based Reinforcement Learning
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
22
158
0
03 Feb 2020
Evolving Neural Networks through a Reverse Encoding Tree
Evolving Neural Networks through a Reverse Encoding Tree
Haoling Zhang
Chao-Han Huck Yang
Hector Zenil
N. Kiani
Yue-Hong Shen
Jesper N. Tegnér
19
5
0
03 Feb 2020
An Upper Bound of the Bias of Nadaraya-Watson Kernel Regression under
  Lipschitz Assumptions
An Upper Bound of the Bias of Nadaraya-Watson Kernel Regression under Lipschitz Assumptions
Samuele Tosatto
R. Akrour
Jan Peters
15
4
0
29 Jan 2020
Rotation, Translation, and Cropping for Zero-Shot Generalization
Rotation, Translation, and Cropping for Zero-Shot Generalization
Chang Ye
Ahmed Khalifa
Philip Bontrager
Julian Togelius
32
38
0
27 Jan 2020
PCGRL: Procedural Content Generation via Reinforcement Learning
PCGRL: Procedural Content Generation via Reinforcement Learning
Ahmed Khalifa
Philip Bontrager
Sam Earle
Julian Togelius
21
143
0
24 Jan 2020
PoPS: Policy Pruning and Shrinking for Deep Reinforcement Learning
PoPS: Policy Pruning and Shrinking for Deep Reinforcement Learning
Dor Livne
Kobi Cohen
34
50
0
14 Jan 2020
Improving Image Autoencoder Embeddings with Perceptual Loss
Improving Image Autoencoder Embeddings with Perceptual Loss
G. Pihlgren
Fredrik Sandin
Marcus Liwicki
25
33
0
10 Jan 2020
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for
  Addressing Value Estimation Errors
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors
Jingliang Duan
Yang Guan
Shengbo Eben Li
Yangang Ren
B. Cheng
OffRL
25
174
0
09 Jan 2020
Joint Goal and Strategy Inference across Heterogeneous Demonstrators via
  Reward Network Distillation
Joint Goal and Strategy Inference across Heterogeneous Demonstrators via Reward Network Distillation
Letian Chen
Rohan R. Paleja
Muyleng Ghuy
Matthew C. Gombolay
30
38
0
02 Jan 2020
Model Inversion Networks for Model-Based Optimization
Model Inversion Networks for Model-Based Optimization
Aviral Kumar
Sergey Levine
OffRL
38
93
0
31 Dec 2019
Uncertainty-Based Out-of-Distribution Classification in Deep
  Reinforcement Learning
Uncertainty-Based Out-of-Distribution Classification in Deep Reinforcement Learning
Andreas Sedlmeier
Thomas Gabor
Thomy Phan
Lenz Belzner
Claudia Linnhoff-Popien
21
25
0
31 Dec 2019
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
43
188
0
23 Dec 2019
Monte-Carlo Tree Search for Policy Optimization
Monte-Carlo Tree Search for Policy Optimization
Xiaobai Ma
Katherine Driggs-Campbell
Zongzhang Zhang
Mykel J. Kochenderfer
20
6
0
23 Dec 2019
Taming an autonomous surface vehicle for path following and collision
  avoidance using deep reinforcement learning
Taming an autonomous surface vehicle for path following and collision avoidance using deep reinforcement learning
Eivind Meyer
Haakon Robinson
Adil Rasheed
Omer San
33
65
0
18 Dec 2019
Dota 2 with Large Scale Deep Reinforcement Learning
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
46
1,799
0
13 Dec 2019
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning
Shuai Lu
Shuai Han
Wenbo Zhou
Junwei Zhang
29
26
0
13 Dec 2019
Adversarial recovery of agent rewards from latent spaces of the limit
  order book
Adversarial recovery of agent rewards from latent spaces of the limit order book
Jacobo Roa-Vicens
Yuanbo Wang
Virgile Mison
Y. Gal
Ricardo M. A. Silva
23
3
0
09 Dec 2019
Adaptive Online Planning for Continual Lifelong Learning
Adaptive Online Planning for Continual Lifelong Learning
Kevin Lu
Igor Mordatch
Pieter Abbeel
OffRL
OnRL
CLL
11
15
0
03 Dec 2019
IKEA Furniture Assembly Environment for Long-Horizon Complex
  Manipulation Tasks
IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks
Youngwoon Lee
E. Hu
Zhengyu Yang
Alexander Yin
Joseph J. Lim
36
122
0
17 Nov 2019
Empirical Study of Off-Policy Policy Evaluation for Reinforcement
  Learning
Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning
Cameron Voloshin
Hoang Minh Le
Nan Jiang
Yisong Yue
OffRL
30
152
0
15 Nov 2019
Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs Using
  Proximal Policy Optimization
Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs Using Proximal Policy Optimization
Eivind Bøhn
E. M. Coates
Signe Moe
T. Johansen
25
129
0
13 Nov 2019
Multi-Agent Connected Autonomous Driving using Deep Reinforcement
  Learning
Multi-Agent Connected Autonomous Driving using Deep Reinforcement Learning
Praveen Palanisamy
45
142
0
11 Nov 2019
Fully Bayesian Recurrent Neural Networks for Safe Reinforcement Learning
Fully Bayesian Recurrent Neural Networks for Safe Reinforcement Learning
Matthew Benatan
Edward O. Pyzer-Knapp
BDL
24
6
0
08 Nov 2019
Experience Sharing Between Cooperative Reinforcement Learning Agents
Experience Sharing Between Cooperative Reinforcement Learning Agents
Lucas O. Souza
G. Ramos
C. Ralha
27
9
0
06 Nov 2019
Previous
123...272829...323334
Next