ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Multi-Agent Trust Region Policy Optimization
Multi-Agent Trust Region Policy Optimization
Hepeng Li
Haibo He
106
42
0
15 Oct 2020
Deep Learning of Koopman Representation for Control
Deep Learning of Koopman Representation for Control
Yiqiang Han
Wenjian Hao
Umesh Vaidya
AI4CE
57
110
0
15 Oct 2020
Self-Imitation Learning for Robot Tasks with Sparse and Delayed Rewards
Self-Imitation Learning for Robot Tasks with Sparse and Delayed Rewards
Zhixin Chen
Mengxiang Lin
58
6
0
14 Oct 2020
Efficient Wasserstein Natural Gradients for Reinforcement Learning
Efficient Wasserstein Natural Gradients for Reinforcement Learning
Theodore H. Moskovitz
Michael Arbel
Ferenc Huszár
Arthur Gretton
72
21
0
12 Oct 2020
EpidemiOptim: A Toolbox for the Optimization of Control Policies in
  Epidemiological Models
EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models
Cédric Colas
B. Hejblum
S. Rouillon
R. Thiébaut
Pierre-Yves Oudeyer
Clément Moulin-Frier
M. Prague
61
22
0
09 Oct 2020
Learning Value Functions in Deep Policy Gradients using Residual
  Variance
Learning Value Functions in Deep Policy Gradients using Residual Variance
Yannis Flet-Berliac
Reda Ouhamma
Odalric-Ambrym Maillard
Philippe Preux
OffRL
72
1
0
09 Oct 2020
Learning to Locomote: Understanding How Environment Design Matters for
  Deep Reinforcement Learning
Learning to Locomote: Understanding How Environment Design Matters for Deep Reinforcement Learning
Daniele Reda
Tianxin Tao
M. van de Panne
AI4CE
109
53
0
09 Oct 2020
CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and
  Transfer Learning
CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning
Ossama Ahmed
Frederik Trauble
Anirudh Goyal
Alexander Neitz
Yoshua Bengio
Bernhard Schölkopf
M. Wuthrich
Stefan Bauer
CML
118
123
0
08 Oct 2020
A novel control mode of bionic morphing tail based on deep reinforcement
  learning
A novel control mode of bionic morphing tail based on deep reinforcement learning
Liming Zheng
Zhou Zhou
Peng Sun
Zhilin Zhang
Rui Wang
AI4CE
36
1
0
08 Oct 2020
Learning Intrinsic Symbolic Rewards in Reinforcement Learning
Learning Intrinsic Symbolic Rewards in Reinforcement Learning
Hassam Sheikh
Shauharda Khadka
Santiago Miret
Somdeb Majumdar
OffRL
69
7
0
08 Oct 2020
Proximal Policy Optimization with Relative Pearson Divergence
Proximal Policy Optimization with Relative Pearson Divergence
Taisuke Kobayashi
43
17
0
07 Oct 2020
Reinforcement Learning with Random Delays
Reinforcement Learning with Random Delays
Simon Ramstedt
Yann Bouteiller
Giovanni Beltrame
C. Pal
Jonathan Binas
227
61
0
06 Oct 2020
Learning Diverse Options via InfoMax Termination Critic
Learning Diverse Options via InfoMax Termination Critic
Yuji Kanagawa
Tomoyuki Kaneko
64
1
0
06 Oct 2020
Active Feature Acquisition with Generative Surrogate Models
Active Feature Acquisition with Generative Surrogate Models
Yang Li
Junier B. Oliva
RALMTPM
72
38
0
06 Oct 2020
Reward Machines: Exploiting Reward Function Structure in Reinforcement
  Learning
Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning
Rodrigo Toro Icarte
Toryn Q. Klassen
Richard Valenzano
Sheila A. McIlraith
OffRL
152
222
0
06 Oct 2020
Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards
  for Real-time Strategy Games
Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards for Real-time Strategy Games
Shengyi Huang
Santiago Ontañón
63
10
0
05 Oct 2020
A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Shangtong Zhang
Romain Laroche
H. V. Seijen
Shimon Whiteson
Rémi Tachet des Combes
122
15
0
02 Oct 2020
MADRaS : Multi Agent Driving Simulator
MADRaS : Multi Agent Driving Simulator
Anirban Santara
S. Rudra
Sree Aditya Buridi
Meha Kaushik
A. Naik
Bharat Kaul
Balaraman Ravindran
72
30
0
02 Oct 2020
Deep Reinforcement Learning with Mixed Convolutional Network
Deep Reinforcement Learning with Mixed Convolutional Network
Yanyu Zhang
SSL
10
2
0
01 Oct 2020
Heteroscedastic Bayesian Optimisation for Stochastic Model Predictive
  Control
Heteroscedastic Bayesian Optimisation for Stochastic Model Predictive Control
Rel Guzman
Rafael Oliveira
F. Ramos
97
15
0
01 Oct 2020
Deep Reinforcement Learning for Efficient Measurement of Quantum Devices
Deep Reinforcement Learning for Efficient Measurement of Quantum Devices
Vu-Linh Nguyen
S. B. Orbell
D. Lennon
H. Moon
F. Vigneau
...
D. Zumbuhl
G. Briggs
Michael A. Osborne
D. Sejdinovic
N. Ares
55
41
0
30 Sep 2020
MARS-Gym: A Gym framework to model, train, and evaluate Recommender
  Systems for Marketplaces
MARS-Gym: A Gym framework to model, train, and evaluate Recommender Systems for Marketplaces
Marlesson R. O. Santana
Luckeciano C. Melo
Fernando H. F. Camargo
Bruno Brandão
Anderson Soares
Renan M. Oliveira
Sandor Caetano
OffRL
48
15
0
30 Sep 2020
Reannealing of Decaying Exploration Based On Heuristic Measure in Deep
  Q-Network
Reannealing of Decaying Exploration Based On Heuristic Measure in Deep Q-Network
Xing Wang
A. Vinel
40
0
0
29 Sep 2020
Towards Effective Context for Meta-Reinforcement Learning: an Approach
  based on Contrastive Learning
Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning
Haotian Fu
Hongyao Tang
Jianye Hao
Chong Chen
Xidong Feng
Dong Li
Wulong Liu
OffRL
83
50
0
29 Sep 2020
Cross Learning in Deep Q-Networks
Cross Learning in Deep Q-Networks
Xing Wang
A. Vinel
25
2
0
29 Sep 2020
Novelty Search in Representational Space for Sample Efficient
  Exploration
Novelty Search in Representational Space for Sample Efficient Exploration
Ruo Yu Tao
Vincent François-Lavet
Joelle Pineau
90
45
0
28 Sep 2020
Agent Environment Cycle Games
J. K. Terry
Nathaniel Grammel
Benjamin Black
Ananth Hari
Caroline Horsch
L. Santos
65
7
0
28 Sep 2020
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a
  Survey
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey
Wenshuai Zhao
Jorge Peña Queralta
Tomi Westerlund
OffRL
256
743
0
24 Sep 2020
Motion Planning by Reinforcement Learning for an Unmanned Aerial Vehicle
  in Virtual Open Space with Static Obstacles
Motion Planning by Reinforcement Learning for an Unmanned Aerial Vehicle in Virtual Open Space with Static Obstacles
Sanghyun Kim
Jongmin Park
Jae-Kwan Yun
Jiwon Seo
28
17
0
24 Sep 2020
The Agent Web Model -- Modelling web hacking for reinforcement learning
The Agent Web Model -- Modelling web hacking for reinforcement learning
L. Erdődi
Fabio Massimo Zennaro
24
3
0
23 Sep 2020
A Centralised Soft Actor Critic Deep Reinforcement Learning Approach to
  District Demand Side Management through CityLearn
A Centralised Soft Actor Critic Deep Reinforcement Learning Approach to District Demand Side Management through CityLearn
Anjukan Kathirgamanathan
Kacper Twardowski
E. Mangina
D. Finn
63
21
0
22 Sep 2020
Learning Task-Agnostic Action Spaces for Movement Optimization
Learning Task-Agnostic Action Spaces for Movement Optimization
Amin Babadi
M. van de Panne
Caren Liu
Perttu Hämäläinen
51
2
0
22 Sep 2020
CMAX++ : Leveraging Experience in Planning and Execution using
  Inaccurate Models
CMAX++ : Leveraging Experience in Planning and Execution using Inaccurate Models
Anirudh Vemula
J. Andrew Bagnell
Maxim Likhachev
88
9
0
21 Sep 2020
Deep Reinforcement Learning Methods for Structure-Guided Processing Path
  Optimization
Deep Reinforcement Learning Methods for Structure-Guided Processing Path Optimization
Johannes Dornheim
L. Morand
Samuel Zeitvogel
Tarek Iraki
Norbert Link
Dirk Helm
58
21
0
21 Sep 2020
RL STaR Platform: Reinforcement Learning for Simulation based Training
  of Robots
RL STaR Platform: Reinforcement Learning for Simulation based Training of Robots
Tamir Blum
Gabin Paillet
Mickaël Laîné
Kazuya Yoshida
43
7
0
21 Sep 2020
Learn to Exceed: Stereo Inverse Reinforcement Learning with Concurrent
  Policy Optimization
Learn to Exceed: Stereo Inverse Reinforcement Learning with Concurrent Policy Optimization
Feng Tao
Yongcan Cao
99
2
0
21 Sep 2020
Multiplayer Support for the Arcade Learning Environment
Multiplayer Support for the Arcade Learning Environment
J. K. Terry
Benjamin Black
Luis Santos
74
13
0
20 Sep 2020
Measuring the Complexity of Domains Used to Evaluate AI Systems
Measuring the Complexity of Domains Used to Evaluate AI Systems
Christopher Pereyda
Lawrence Holder
25
3
0
18 Sep 2020
GRAC: Self-Guided and Self-Regularized Actor-Critic
GRAC: Self-Guided and Self-Regularized Actor-Critic
Lin Shao
Yifan You
Mengyuan Yan
Qingyun Sun
Jeannette Bohg
89
24
0
18 Sep 2020
Autonomous Learning of Features for Control: Experiments with Embodied
  and Situated Agents
Autonomous Learning of Features for Control: Experiments with Embodied and Situated Agents
Nicola Milano
S. Nolfi
46
0
0
15 Sep 2020
Extended Radial Basis Function Controller for Reinforcement Learning
Extended Radial Basis Function Controller for Reinforcement Learning
Nicholas Capel
Naifu Zhang
35
1
0
12 Sep 2020
Physically Embedded Planning Problems: New Challenges for Reinforcement
  Learning
Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
M. Berk Mirza
Andrew Jaegle
Jonathan J. Hunt
A. Guez
S. Tunyasuvunakool
...
Peter Karkus
S. Racanière
Lars Buesing
Timothy Lillicrap
N. Heess
AI4CE
79
12
0
11 Sep 2020
TripleTree: A Versatile Interpretable Representation of Black Box Agents
  and their Environments
TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments
Tom Bewley
J. Lawry
FAtt
74
27
0
10 Sep 2020
COVID-19 Pandemic Cyclic Lockdown Optimization Using Reinforcement
  Learning
COVID-19 Pandemic Cyclic Lockdown Optimization Using Reinforcement Learning
M. Arango
Lyudmil Pelov
57
17
0
10 Sep 2020
Integrated Benchmarking and Design for Reproducible and Accessible
  Evaluation of Robotic Agents
Integrated Benchmarking and Design for Reproducible and Accessible Evaluation of Robotic Agents
J. Tani
Andrea F. Daniele
Gianmarco Bernasconi
Amaury Camus
Aleksandar Petrov
...
Tomasz Zaluska
Matthew R. Walter
Emilio Frazzoli
Liam Paull
A. Censi
50
8
0
09 Sep 2020
DyNODE: Neural Ordinary Differential Equations for Dynamics Modeling in
  Continuous Control
DyNODE: Neural Ordinary Differential Equations for Dynamics Modeling in Continuous Control
V. M. Alvarez
R. Rosca
Cristian G. Falcutescu
63
11
0
09 Sep 2020
Visualizing the Loss Landscape of Actor Critic Methods with Applications
  in Inventory Optimization
Visualizing the Loss Landscape of Actor Critic Methods with Applications in Inventory Optimization
Recep Yusuf Bekci
M. Gümüş
29
4
0
04 Sep 2020
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators
  using Reinforcement Learning
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning
Sheng-Chun Kao
Geonhwa Jeong
T. Krishna
111
96
0
04 Sep 2020
SEDRo: A Simulated Environment for Developmental Robotics
SEDRo: A Simulated Environment for Developmental Robotics
Aishwarya Pothula
Md Ashaduzzaman Rubel Mondol
Sanath Narasimhan
Sm Mazharul Islam
Deokgun Park
36
5
0
03 Sep 2020
Adaptive Risk Sensitive Model Predictive Control with Stochastic Search
Adaptive Risk Sensitive Model Predictive Control with Stochastic Search
Ziyi Wang
Oswin So
Keuntaek Lee
Camilo A. Duarte
Evangelos A. Theodorou
61
2
0
02 Sep 2020
Previous
123...323334...505152
Next