ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRL
    ODL
ArXivPDFHTML

Papers citing "OpenAI Gym"

50 / 1,677 papers shown
Title
Perceptive Locomotion with Controllable Pace and Natural Gait
  Transitions Over Uneven Terrains
Perceptive Locomotion with Controllable Pace and Natural Gait Transitions Over Uneven Terrains
Daniel C.H. Tan
Jenny Zhang
Michael
M. Chuah
Zhibin Li
31
2
0
26 Jan 2023
Evaluating Deception and Moving Target Defense with Network Attack
  Simulation
Evaluating Deception and Moving Target Defense with Network Attack Simulation
Daniel Reti
Karina Elzer
Daniel Fraunholz
Daniel Schneider
Hans D. Schotten
AAML
26
7
0
25 Jan 2023
Deep Reinforcement Learning for Concentric Tube Robot Path Following
Deep Reinforcement Learning for Concentric Tube Robot Path Following
Keshav Iyengar
Sarah Spurgeon
Danail Stoyanov
MedIm
26
4
0
22 Jan 2023
Asynchronous Deep Double Duelling Q-Learning for Trading-Signal
  Execution in Limit Order Book Markets
Asynchronous Deep Double Duelling Q-Learning for Trading-Signal Execution in Limit Order Book Markets
Peer Nagy
Jan-Peter Calliess
S. Zohren
34
3
0
20 Jan 2023
Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement
  Learning
Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Haoxuan Pan
Deheng Ye
Xiaoming Duan
Qiang Fu
Wei Yang
Jianping He
Mingfei Sun
OffRL
25
2
0
20 Jan 2023
Multi-Agent Interplay in a Competitive Survival Environment
Multi-Agent Interplay in a Competitive Survival Environment
Andrea Fanti
23
0
0
19 Jan 2023
A Survey of Meta-Reinforcement Learning
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
42
125
0
19 Jan 2023
DIRECT: Learning from Sparse and Shifting Rewards using Discriminative
  Reward Co-Training
DIRECT: Learning from Sparse and Shifting Rewards using Discriminative Reward Co-Training
Philipp Altmann
Thomy Phan
Fabian Ritz
Thomas Gabor
Claudia Linnhoff-Popien
OffRL
35
1
0
18 Jan 2023
Adversarial Robust Deep Reinforcement Learning Requires Redefining
  Robustness
Adversarial Robust Deep Reinforcement Learning Requires Redefining Robustness
Ezgi Korkmaz
29
27
0
17 Jan 2023
Asynchronous training of quantum reinforcement learning
Asynchronous training of quantum reinforcement learning
Samuel Yen-Chi Chen
OffRL
37
20
0
12 Jan 2023
Efficient Preference-Based Reinforcement Learning Using Learned Dynamics
  Models
Efficient Preference-Based Reinforcement Learning Using Learned Dynamics Models
Yi Liu
Gaurav Datta
Ellen R. Novoseller
Daniel S. Brown
46
22
0
11 Jan 2023
schlably: A Python Framework for Deep Reinforcement Learning Based
  Scheduling Experiments
schlably: A Python Framework for Deep Reinforcement Learning Based Scheduling Experiments
Constantin Waubert de Puiseau
Jannik Peters
Christian Dörpelkus
Hasan Tercan
Tobias Meisen
OffRL
22
7
0
10 Jan 2023
Learning to Perceive in Deep Model-Free Reinforcement Learning
Learning to Perceive in Deep Model-Free Reinforcement Learning
Gonccalo Querido
Alberto Sardinha
Francisco S. Melo
27
0
0
10 Jan 2023
DRL-GAN: A Hybrid Approach for Binary and Multiclass Network Intrusion
  Detection
DRL-GAN: A Hybrid Approach for Binary and Multiclass Network Intrusion Detection
Caroline Strickland
Chandrika Saha
Muhammad Zakar
Sareh Nejad
Noshin Tasnim
D. Lizotte
Anwar Haque
37
10
0
05 Jan 2023
Character Simulation Using Imitation Learning With Game Engine Physics
Character Simulation Using Imitation Learning With Game Engine Physics
Joao Rodrigues
R. Nóbrega
AI4CE
24
2
0
05 Jan 2023
Genetic Imitation Learning by Reward Extrapolation
Genetic Imitation Learning by Reward Extrapolation
Boyuan Zheng
Jianlong Zhou
Fang Chen
24
0
0
03 Jan 2023
Explaining Imitation Learning through Frames
Explaining Imitation Learning through Frames
Boyuan Zheng
Jianlong Zhou
Chun-Hao Liu
Yiqiao Li
Fang Chen
24
0
0
03 Jan 2023
A Policy Optimization Method Towards Optimal-time Stability
A Policy Optimization Method Towards Optimal-time Stability
Shengjie Wang
Lan Fengb
Xiang Zheng
Yu-wen Cao
Oluwatosin Oseni
Haotian Xu
Tao Zhang
Yang Gao
39
1
0
02 Jan 2023
On the Challenges of using Reinforcement Learning in Precision Drug
  Dosing: Delay and Prolongedness of Action Effects
On the Challenges of using Reinforcement Learning in Precision Drug Dosing: Delay and Prolongedness of Action Effects
Sumana Basu
M. Legault
Adriana Romero Soriano
Doina Precup
OffRL
28
3
0
02 Jan 2023
Tuning Synaptic Connections instead of Weights by Genetic Algorithm in
  Spiking Policy Network
Tuning Synaptic Connections instead of Weights by Genetic Algorithm in Spiking Policy Network
Duzhen Zhang
Tielin Zhang
Shuncheng Jia
Qingyu Wang
Bo Xu
OffRL
224
5
0
29 Dec 2022
Backward Curriculum Reinforcement Learning
Backward Curriculum Reinforcement Learning
Kyungmin Ko
OnRL
17
0
0
29 Dec 2022
Falsification of Learning-Based Controllers through Multi-Fidelity
  Bayesian Optimization
Falsification of Learning-Based Controllers through Multi-Fidelity Bayesian Optimization
Zahra Shahrooei
Mykel J. Kochenderfer
Ali Baheri
48
6
0
28 Dec 2022
Variance Reduction for Score Functions Using Optimal Baselines
Variance Reduction for Score Functions Using Optimal Baselines
Ronan L. Keane
H. Gao
26
0
0
27 Dec 2022
Off-Policy Reinforcement Learning with Loss Function Weighted by
  Temporal Difference Error
Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error
Bumgeun Park
Taeyoung Kim
Woohyeon Moon
L. Vecchietti
Dongsoo Har
OffRL
39
2
0
26 Dec 2022
Novel Reinforcement Learning Algorithm for Suppressing Synchronization
  in Closed Loop Deep Brain Stimulators
Novel Reinforcement Learning Algorithm for Suppressing Synchronization in Closed Loop Deep Brain Stimulators
Harshali Agarwal
Heena Rathore
38
3
0
25 Dec 2022
NARS vs. Reinforcement learning: ONA vs. Q-Learning
NARS vs. Reinforcement learning: ONA vs. Q-Learning
Ali Beikmohammadi
21
0
0
23 Dec 2022
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with
  Robotic and Human Co-Workers
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
Aleksandar Krnjaic
Raul D. Steleac
Jonathan D. Thomas
Georgios Papoudakis
Lukas Schafer
...
Kuan-Ho Lao
Murat Cubuktepe
Matthew Haley
Peter Borsting
Stefano V. Albrecht
OffRL
24
17
0
22 Dec 2022
Hyperparameters in Contextual RL are Highly Situational
Hyperparameters in Contextual RL are Highly Situational
Theresa Eimer
C. Benjamins
Marius Lindauer
26
4
0
21 Dec 2022
Neighboring state-based RL Exploration
Neighboring state-based RL Exploration
Jeffery Cheng
Kevin Li
Justin Lin
Pedro Pachuca
OffRL
15
0
0
21 Dec 2022
Enhancing Cyber Resilience of Networked Microgrids using Vertical
  Federated Reinforcement Learning
Enhancing Cyber Resilience of Networked Microgrids using Vertical Federated Reinforcement Learning
Sayak Mukherjee
Ramij-Raja Hossain
Yuan Liu
W. Du
Veronica Adetola
Sheik M. Mohiuddin
Qiuhua Huang
Tianzhixi Yin
Ankit Singhal
31
4
0
17 Dec 2022
Training Robots to Evaluate Robots: Example-Based Interactive Reward
  Functions for Policy Learning
Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning
Kun-Yen Huang
E. Hu
Dinesh Jayaraman
OffRL
43
5
0
17 Dec 2022
Emergent Behaviors in Multi-Agent Target Acquisition
Emergent Behaviors in Multi-Agent Target Acquisition
P. Sharma
Erin G. Zaroukian
Derrik E. Asher
Bryson Howell
37
1
0
15 Dec 2022
Robust Policy Optimization in Deep Reinforcement Learning
Robust Policy Optimization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
25
8
0
14 Dec 2022
Scaling Marginalized Importance Sampling to High-Dimensional
  State-Spaces via State Abstraction
Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction
Brahma S. Pavse
Josiah P. Hanna
OffRL
37
7
0
14 Dec 2022
Reinforcement Learning in System Identification
Reinforcement Learning in System Identification
J. Antonio
Martin H Oscar Fernández
Sergio Pérez
Anas Belfadil
C. Ibáñez-Llano
Freddy José Perozo
Javier Valle
Javier Arechalde Pelaz
20
0
0
14 Dec 2022
Efficient Exploration in Resource-Restricted Reinforcement Learning
Efficient Exploration in Resource-Restricted Reinforcement Learning
Zhihai Wang
Taoxing Pan
Qi Zhou
Jie Wang
OffRL
20
10
0
14 Dec 2022
Quantum Policy Gradient Algorithm with Optimized Action Decoding
Quantum Policy Gradient Algorithm with Optimized Action Decoding
Nico Meyer
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
M. Hartmann
29
20
0
13 Dec 2022
PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration
PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration
Qisheng Zhang
Zhen Guo
A. Jøsang
Lance M. Kaplan
F. Chen
Dong-Ho Jeong
Jin-Hee Cho
25
0
0
13 Dec 2022
Minimax Optimal Estimation of Stability Under Distribution Shift
Minimax Optimal Estimation of Stability Under Distribution Shift
Hongseok Namkoong
Yuanzhe Ma
Peter Glynn
42
6
0
13 Dec 2022
AutoDRIVE: A Comprehensive, Flexible and Integrated Digital Twin
  Ecosystem for Enhancing Autonomous Driving Research and Education
AutoDRIVE: A Comprehensive, Flexible and Integrated Digital Twin Ecosystem for Enhancing Autonomous Driving Research and Education
Tanmay Vilas Samak
Chinmay Vilas Samak
Sivanathan Kandhasamy
Venkat Krovi
Mingjuan Xie
35
23
0
10 Dec 2022
Compiler Optimization for Quantum Computing Using Reinforcement Learning
Compiler Optimization for Quantum Computing Using Reinforcement Learning
Nils Quetschlich
Lukas Burgholzer
Robert Wille
48
26
0
08 Dec 2022
Model-based trajectory stitching for improved behavioural cloning and
  its applications
Model-based trajectory stitching for improved behavioural cloning and its applications
Charles A. Hepburn
Giovanni Montana
OffRL
37
5
0
08 Dec 2022
Design and Planning of Flexible Mobile Micro-Grids Using Deep
  Reinforcement Learning
Design and Planning of Flexible Mobile Micro-Grids Using Deep Reinforcement Learning
Cesare Caputo
Michel-Alexandre Cardin
Pudong Ge
Fei Teng
A. Korre
Ehecatl Antonio del Rio Chanona
22
18
0
08 Dec 2022
Tight Performance Guarantees of Imitator Policies with Continuous
  Actions
Tight Performance Guarantees of Imitator Policies with Continuous Actions
Davide Maran
Alberto Maria Metelli
Marcello Restelli
OffRL
33
5
0
07 Dec 2022
Collision-tolerant Aerial Robots: A Survey
Collision-tolerant Aerial Robots: A Survey
Paolo De Petris
S. Carlson
C. Papachristos
Kostas Alexis
47
4
0
06 Dec 2022
State Space Closure: Revisiting Endless Online Level Generation via
  Reinforcement Learning
State Space Closure: Revisiting Endless Online Level Generation via Reinforcement Learning
Ziqi Wang
Tianye Shu
Jialin Liu
OffRL
29
1
0
06 Dec 2022
A Novel Deep Reinforcement Learning Based Automated Stock Trading System
  Using Cascaded LSTM Networks
A Novel Deep Reinforcement Learning Based Automated Stock Trading System Using Cascaded LSTM Networks
Jie Zou
Jiashu Lou
Baohua Wang
Sixue Liu
AIFin
29
28
0
06 Dec 2022
Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce
  Order Fraud Evaluation
Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce Order Fraud Evaluation
Soysal Degirmenci
Chris Jones
OffRL
27
1
0
05 Dec 2022
A Machine with Short-Term, Episodic, and Semantic Memory Systems
A Machine with Short-Term, Episodic, and Semantic Memory Systems
Taewoon Kim
Michael Cochez
Vincent Franccois-Lavet
Mark Antonius Neerincx
Piek Vossen
45
5
0
05 Dec 2022
Learning Physically Realizable Skills for Online Packing of General 3D
  Shapes
Learning Physically Realizable Skills for Online Packing of General 3D Shapes
Hang Zhao
Zherong Pan
Yang Yu
Kai Xu
OffRL
45
14
0
05 Dec 2022
Previous
123...121314...323334
Next