ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for
  Addressing Value Estimation Errors
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors
Jingliang Duan
Yang Guan
Shengbo Eben Li
Yangang Ren
B. Cheng
OffRL
88
183
0
09 Jan 2020
Sample-based Distributional Policy Gradient
Sample-based Distributional Policy Gradient
Rahul Singh
Keuntaek Lee
Yongxin Chen
64
19
0
08 Jan 2020
Blue River Controls: A toolkit for Reinforcement Learning Control
  Systems on Hardware
Blue River Controls: A toolkit for Reinforcement Learning Control Systems on Hardware
Kirill Polzounov
R. Sundar
L. Redden
38
10
0
07 Jan 2020
Reanalysis of Variance Reduced Temporal Difference Learning
Reanalysis of Variance Reduced Temporal Difference Learning
Tengyu Xu
Zhe Wang
Yi Zhou
Yingbin Liang
OffRL
104
39
0
07 Jan 2020
Learning Reusable Options for Multi-Task Reinforcement Learning
Learning Reusable Options for Multi-Task Reinforcement Learning
Francisco M. Garcia
Chris Nota
Philip S. Thomas
26
4
0
06 Jan 2020
Universal Successor Features for Transfer Reinforcement Learning
Universal Successor Features for Transfer Reinforcement Learning
Chen Ma
Dylan R. Ashley
Junfeng Wen
Yoshua Bengio
OffRL
58
26
0
05 Jan 2020
Joint Goal and Strategy Inference across Heterogeneous Demonstrators via
  Reward Network Distillation
Joint Goal and Strategy Inference across Heterogeneous Demonstrators via Reward Network Distillation
Letian Chen
Rohan R. Paleja
Muyleng Ghuy
Matthew C. Gombolay
116
39
0
02 Jan 2020
Model Inversion Networks for Model-Based Optimization
Model Inversion Networks for Model-Based Optimization
Aviral Kumar
Sergey Levine
OffRL
97
100
0
31 Dec 2019
Uncertainty-Based Out-of-Distribution Classification in Deep
  Reinforcement Learning
Uncertainty-Based Out-of-Distribution Classification in Deep Reinforcement Learning
Andreas Sedlmeier
Thomas Gabor
Thomy Phan
Lenz Belzner
Claudia Linnhoff-Popien
46
25
0
31 Dec 2019
Information Theoretic Model Predictive Q-Learning
Information Theoretic Model Predictive Q-Learning
M. Bhardwaj
Ankur Handa
Dieter Fox
Byron Boots
80
23
0
31 Dec 2019
Augmented Replay Memory in Reinforcement Learning With Continuous
  Control
Augmented Replay Memory in Reinforcement Learning With Continuous Control
Mirza Ramicic
Andrea Bonarini
KELMCLLOffRL
28
1
0
29 Dec 2019
Quasi-Newton Trust Region Policy Optimization
Quasi-Newton Trust Region Policy Optimization
Devesh K. Jha
A. Raghunathan
Diego Romeres
57
9
0
26 Dec 2019
Learning an Interpretable Traffic Signal Control Policy
Learning an Interpretable Traffic Signal Control Policy
James Ault
Josiah P. Hanna
Guni Sharon
45
50
0
23 Dec 2019
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRLAI4TS
131
193
0
23 Dec 2019
Monte-Carlo Tree Search for Policy Optimization
Monte-Carlo Tree Search for Policy Optimization
Xiaobai Ma
Katherine Driggs-Campbell
Zongzhang Zhang
Mykel J. Kochenderfer
114
6
0
23 Dec 2019
Taming an autonomous surface vehicle for path following and collision
  avoidance using deep reinforcement learning
Taming an autonomous surface vehicle for path following and collision avoidance using deep reinforcement learning
Eivind Meyer
Haakon Robinson
Adil Rasheed
Omer San
65
66
0
18 Dec 2019
Relational Mimic for Visual Adversarial Imitation Learning
Relational Mimic for Visual Adversarial Imitation Learning
Lionel Blondé
Yichuan Tang
Jian Zhang
Russ Webb
40
0
0
18 Dec 2019
Learning to grow: control of material self-assembly using evolutionary
  reinforcement learning
Learning to grow: control of material self-assembly using evolutionary reinforcement learning
S. Whitelam
Isaac Tamblyn
53
34
0
18 Dec 2019
Analysing Deep Reinforcement Learning Agents Trained with Domain
  Randomisation
Analysing Deep Reinforcement Learning Agents Trained with Domain Randomisation
Tianhong Dai
Kai Arulkumaran
Tamara Gerbert
Samyakh Tukra
Feryal M. P. Behbahani
Anil Anthony Bharath
87
28
0
18 Dec 2019
Dota 2 with Large Scale Deep Reinforcement Learning
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNNVLMCLLAI4CELRM
181
1,839
0
13 Dec 2019
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning
Shuai Lu
Shuai Han
Wenbo Zhou
Junwei Zhang
72
26
0
13 Dec 2019
Zero-shot generalization using cascaded system-representations
Zero-shot generalization using cascaded system-representations
A. Malik
OffRL
24
2
0
11 Dec 2019
Efficacy of Modern Neuro-Evolutionary Strategies for Continuous Control
  Optimization
Efficacy of Modern Neuro-Evolutionary Strategies for Continuous Control Optimization
Paolo Pagliuca
Nicola Milano
S. Nolfi
80
31
0
11 Dec 2019
Efficient Robotic Task Generalization Using Deep Model Fusion
  Reinforcement Learning
Efficient Robotic Task Generalization Using Deep Model Fusion Reinforcement Learning
Tianying Wang
Hao Zhang
Wei Qi Toh
Erik Cambria
Cheston Tan
Yan Wu
Yong Liu
Wei Jing
33
6
0
11 Dec 2019
Marginalized State Distribution Entropy Regularization in Policy
  Optimization
Marginalized State Distribution Entropy Regularization in Policy Optimization
Riashat Islam
Zafarali Ahmed
Doina Precup
59
17
0
11 Dec 2019
Adversarial recovery of agent rewards from latent spaces of the limit
  order book
Adversarial recovery of agent rewards from latent spaces of the limit order book
Jacobo Roa-Vicens
Yuanbo Wang
Virgile Mison
Y. Gal
Ricardo M. A. Silva
51
3
0
09 Dec 2019
Learning Latent State Spaces for Planning through Reward Prediction
Learning Latent State Spaces for Planning through Reward Prediction
Aaron J. Havens
Ouyang Yi
P. Nagarajan
Yasuhiro Fujita
36
7
0
09 Dec 2019
Transformer Based Reinforcement Learning For Games
Transformer Based Reinforcement Learning For Games
Uddeshya Upadhyay
Nikunj Shah
Sucheta Ravikanti
Mayanka Medhe
OffRL
52
11
0
09 Dec 2019
Value-of-Information based Arbitration between Model-based and
  Model-free Control
Value-of-Information based Arbitration between Model-based and Model-free Control
Krishn Bera
Yash Mandilwar
R. Bapi
11
3
0
08 Dec 2019
VALAN: Vision and Language Agent Navigation
VALAN: Vision and Language Agent Navigation
L. Lansing
Vihan Jain
Harsh Mehta
Haoshuo Huang
Eugene Ie
LM&RoAI4TS
58
8
0
06 Dec 2019
Training Agents using Upside-Down Reinforcement Learning
Training Agents using Upside-Down Reinforcement Learning
R. Srivastava
Pranav Shyam
Filipe Wall Mutz
Wojciech Ja'skowski
Jürgen Schmidhuber
OffRL
93
126
0
05 Dec 2019
Reinforcement Learning with Convolutional Reservoir Computing
Reinforcement Learning with Convolutional Reservoir Computing
Hanten Chang
K. Futagami
51
22
0
05 Dec 2019
Learning Human Objectives by Evaluating Hypothetical Behavior
Learning Human Objectives by Evaluating Hypothetical Behavior
S. Reddy
Anca Dragan
Sergey Levine
Shane Legg
Jan Leike
87
77
0
05 Dec 2019
Optimizing Norm-Bounded Weighted Ambiguity Sets for Robust MDPs
Optimizing Norm-Bounded Weighted Ambiguity Sets for Robust MDPs
R. Russel
Bahram Behzadian
Marek Petrik
43
3
0
04 Dec 2019
Adaptive Online Planning for Continual Lifelong Learning
Adaptive Online Planning for Continual Lifelong Learning
Kevin Lu
Igor Mordatch
Pieter Abbeel
OffRLOnRLCLL
73
15
0
03 Dec 2019
Flow Rate Control in Smart District Heating Systems Using Deep
  Reinforcement Learning
Flow Rate Control in Smart District Heating Systems Using Deep Reinforcement Learning
Tinghao Zhang
Jing Luo
Ping Chen
Jie Liu
AI4CE
47
5
0
01 Dec 2019
Playing Games in the Dark: An approach for cross-modality transfer in
  reinforcement learning
Playing Games in the Dark: An approach for cross-modality transfer in reinforcement learning
Rui Silva
Miguel Vasco
Francisco S. Melo
Ana Paiva
Manuela Veloso
OffRL
41
14
0
28 Nov 2019
LeRoP: A Learning-Based Modular Robot Photography Framework
LeRoP: A Learning-Based Modular Robot Photography Framework
Hao Kang
Jianming Zhang
Haoxiang Li
Zhe Lin
TJ Rhodes
Bedrich Benes
42
4
0
28 Nov 2019
Adversarial Deep Reinforcement Learning based Adaptive Moving Target
  Defense
Adversarial Deep Reinforcement Learning based Adaptive Moving Target Defense
Taha Eghtesad
Yevgeniy Vorobeychik
Aron Laszka
AAML
49
8
0
27 Nov 2019
Biologically inspired architectures for sample-efficient deep
  reinforcement learning
Biologically inspired architectures for sample-efficient deep reinforcement learning
Pierre Harvey Richemond
Arinbjorn Kolbeinsson
Yike Guo
59
2
0
25 Nov 2019
ORL: Reinforcement Learning Benchmarks for Online Stochastic
  Optimization Problems
ORL: Reinforcement Learning Benchmarks for Online Stochastic Optimization Problems
Bharathan Balaji
Jordan Bell-Masterson
Enes Bilgin
Andreas C. Damianou
Pablo Moreno Garcia
Arpit Jain
Runfei Luo
Alvaro Maggiar
Balakrishnan Narayanaswamy
Chun Jimmie Ye
OffRL
63
32
0
24 Nov 2019
DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep
  Reinforcement Learning
DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning
Mohammadhosein Hasanbeig
N. Jeppu
Alessandro Abate
T. Melham
Daniel Kroening
95
20
0
22 Nov 2019
Actively Learning Gaussian Process Dynamics
Actively Learning Gaussian Process Dynamics
Mona Buisson-Fenet
Friedrich Solowjow
Sebastian Trimpe
GP
103
64
0
22 Nov 2019
Sample-Efficient Reinforcement Learning with Maximum Entropy Mellowmax
  Episodic Control
Sample-Efficient Reinforcement Learning with Maximum Entropy Mellowmax Episodic Control
Marta Sarrico
Kai Arulkumaran
A. Agostinelli
Pierre Harvey Richemond
Anil Anthony Bharath
24
2
0
21 Nov 2019
Memory-Efficient Episodic Control Reinforcement Learning with Dynamic
  Online k-means
Memory-Efficient Episodic Control Reinforcement Learning with Dynamic Online k-means
A. Agostinelli
Kai Arulkumaran
Marta Sarrico
Pierre Harvey Richemond
Anil Anthony Bharath
OffRL
22
4
0
21 Nov 2019
Agent Probing Interaction Policies
Agent Probing Interaction Policies
Siddharth Ghiya
Oluwafemi Azeez
Brendan Miller
20
0
0
21 Nov 2019
Unsupervised Object Segmentation with Explicit Localization Module
Unsupervised Object Segmentation with Explicit Localization Module
Weitang Liu
Lifeng Wei
James Sharpnack
John Douglas Owens
SSeg
23
4
0
21 Nov 2019
Evaluating task-agnostic exploration for fixed-batch learning of
  arbitrary future tasks
Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks
Vibhavari Dasagi
Robert Lee
Jake Bruce
Jurgen Leitner
OffRL
56
2
0
20 Nov 2019
MANGA: Method Agnostic Neural-policy Generalization and Adaptation
MANGA: Method Agnostic Neural-policy Generalization and Adaptation
Homanga Bharadhwaj
Shoichiro Yamaguchi
S. Maeda
43
4
0
19 Nov 2019
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse
  Representations Online
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online
Yangchen Pan
Kirby Banman
Martha White
22
0
0
19 Nov 2019
Previous
123...394041...505152
Next