ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Tuning Synaptic Connections instead of Weights by Genetic Algorithm in
  Spiking Policy Network
Tuning Synaptic Connections instead of Weights by Genetic Algorithm in Spiking Policy Network
Duzhen Zhang
Tielin Zhang
Shuncheng Jia
Qingyu Wang
Bo Xu
OffRL
379
5
0
29 Dec 2022
Backward Curriculum Reinforcement Learning
Backward Curriculum Reinforcement Learning
Kyungmin Ko
OnRL
40
0
0
29 Dec 2022
Falsification of Learning-Based Controllers through Multi-Fidelity
  Bayesian Optimization
Falsification of Learning-Based Controllers through Multi-Fidelity Bayesian Optimization
Zahra Shahrooei
Mykel J. Kochenderfer
Ali Baheri
85
7
0
28 Dec 2022
Variance Reduction for Score Functions Using Optimal Baselines
Variance Reduction for Score Functions Using Optimal Baselines
Ronan L. Keane
H. Gao
52
0
0
27 Dec 2022
Off-Policy Reinforcement Learning with Loss Function Weighted by
  Temporal Difference Error
Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error
Bumgeun Park
Taeyoung Kim
Woohyeon Moon
L. Vecchietti
Dongsoo Har
OffRL
62
2
0
26 Dec 2022
Novel Reinforcement Learning Algorithm for Suppressing Synchronization
  in Closed Loop Deep Brain Stimulators
Novel Reinforcement Learning Algorithm for Suppressing Synchronization in Closed Loop Deep Brain Stimulators
Harshali Agarwal
Heena Rathore
58
3
0
25 Dec 2022
NARS vs. Reinforcement learning: ONA vs. Q-Learning
NARS vs. Reinforcement learning: ONA vs. Q-Learning
Ali Beikmohammadi
108
0
0
23 Dec 2022
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with
  Robotic and Human Co-Workers
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
Aleksandar Krnjaic
Raul D. Steleac
Jonathan D. Thomas
Georgios Papoudakis
Lukas Schafer
...
Kuan-Ho Lao
Murat Cubuktepe
Matthew Haley
Peter Borsting
Stefano V. Albrecht
OffRL
86
18
0
22 Dec 2022
Hyperparameters in Contextual RL are Highly Situational
Hyperparameters in Contextual RL are Highly Situational
Theresa Eimer
C. Benjamins
Marius Lindauer
143
4
0
21 Dec 2022
Neighboring state-based RL Exploration
Neighboring state-based RL Exploration
Jeffery Cheng
Kevin Li
Justin Lin
Pedro Pachuca
OffRL
22
0
0
21 Dec 2022
Enhancing Cyber Resilience of Networked Microgrids using Vertical
  Federated Reinforcement Learning
Enhancing Cyber Resilience of Networked Microgrids using Vertical Federated Reinforcement Learning
Sayak Mukherjee
Ramij-Raja Hossain
Yuan Liu
W. Du
Veronica Adetola
Sheik M. Mohiuddin
Qiuhua Huang
Tianzhixi Yin
Ankit Singhal
56
5
0
17 Dec 2022
Training Robots to Evaluate Robots: Example-Based Interactive Reward
  Functions for Policy Learning
Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning
Kun-Yen Huang
E. Hu
Dinesh Jayaraman
OffRL
111
5
0
17 Dec 2022
Emergent Behaviors in Multi-Agent Target Acquisition
Emergent Behaviors in Multi-Agent Target Acquisition
P. Sharma
Erin G. Zaroukian
Derrik E. Asher
Bryson Howell
104
1
0
15 Dec 2022
Robust Policy Optimization in Deep Reinforcement Learning
Robust Policy Optimization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
49
12
0
14 Dec 2022
Scaling Marginalized Importance Sampling to High-Dimensional
  State-Spaces via State Abstraction
Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction
Brahma S. Pavse
Josiah P. Hanna
OffRL
69
7
0
14 Dec 2022
Reinforcement Learning in System Identification
Reinforcement Learning in System Identification
J. Antonio
Martin H Oscar Fernández
Sergio Pérez
Anas Belfadil
C. Ibáñez-Llano
Freddy José Perozo
Javier Valle
Javier Arechalde Pelaz
49
0
0
14 Dec 2022
Efficient Exploration in Resource-Restricted Reinforcement Learning
Efficient Exploration in Resource-Restricted Reinforcement Learning
Zhihai Wang
Taoxing Pan
Qi Zhou
Jie Wang
OffRL
54
12
0
14 Dec 2022
Quantum Policy Gradient Algorithm with Optimized Action Decoding
Quantum Policy Gradient Algorithm with Optimized Action Decoding
Nico Meyer
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
M. Hartmann
61
22
0
13 Dec 2022
PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration
PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration
Qisheng Zhang
Zhen Guo
A. Jøsang
Lance M. Kaplan
F. Chen
Dong-Ho Jeong
Jin-Hee Cho
44
0
0
13 Dec 2022
Minimax Optimal Estimation of Stability Under Distribution Shift
Minimax Optimal Estimation of Stability Under Distribution Shift
Hongseok Namkoong
Yuanzhe Ma
Peter Glynn
202
6
0
13 Dec 2022
AutoDRIVE: A Comprehensive, Flexible and Integrated Digital Twin
  Ecosystem for Enhancing Autonomous Driving Research and Education
AutoDRIVE: A Comprehensive, Flexible and Integrated Digital Twin Ecosystem for Enhancing Autonomous Driving Research and Education
Tanmay Vilas Samak
Chinmay Vilas Samak
Sivanathan Kandhasamy
Venkat Krovi
Mingjuan Xie
89
24
0
10 Dec 2022
Compiler Optimization for Quantum Computing Using Reinforcement Learning
Compiler Optimization for Quantum Computing Using Reinforcement Learning
Nils Quetschlich
Lukas Burgholzer
Robert Wille
76
27
0
08 Dec 2022
Model-based trajectory stitching for improved behavioural cloning and
  its applications
Model-based trajectory stitching for improved behavioural cloning and its applications
Charles A. Hepburn
Giovanni Montana
OffRL
81
7
0
08 Dec 2022
Design and Planning of Flexible Mobile Micro-Grids Using Deep
  Reinforcement Learning
Design and Planning of Flexible Mobile Micro-Grids Using Deep Reinforcement Learning
Cesare Caputo
Michel-Alexandre Cardin
Pudong Ge
Fei Teng
A. Korre
Ehecatl Antonio del Rio Chanona
39
18
0
08 Dec 2022
Tight Performance Guarantees of Imitator Policies with Continuous
  Actions
Tight Performance Guarantees of Imitator Policies with Continuous Actions
Davide Maran
Alberto Maria Metelli
Marcello Restelli
OffRL
79
5
0
07 Dec 2022
Collision-tolerant Aerial Robots: A Survey
Collision-tolerant Aerial Robots: A Survey
Paolo De Petris
S. Carlson
C. Papachristos
Kostas Alexis
105
4
0
06 Dec 2022
State Space Closure: Revisiting Endless Online Level Generation via
  Reinforcement Learning
State Space Closure: Revisiting Endless Online Level Generation via Reinforcement Learning
Ziqi Wang
Tianye Shu
Jialin Liu
OffRL
53
1
0
06 Dec 2022
A Novel Deep Reinforcement Learning Based Automated Stock Trading System
  Using Cascaded LSTM Networks
A Novel Deep Reinforcement Learning Based Automated Stock Trading System Using Cascaded LSTM Networks
Jie Zou
Jiashu Lou
Baohua Wang
Sixue Liu
AIFin
87
32
0
06 Dec 2022
Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce
  Order Fraud Evaluation
Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce Order Fraud Evaluation
Soysal Degirmenci
Chris Jones
OffRL
61
1
0
05 Dec 2022
A Machine with Short-Term, Episodic, and Semantic Memory Systems
A Machine with Short-Term, Episodic, and Semantic Memory Systems
Taewoon Kim
Michael Cochez
Vincent Franccois-Lavet
Mark Antonius Neerincx
Piek Vossen
80
5
0
05 Dec 2022
Learning Physically Realizable Skills for Online Packing of General 3D
  Shapes
Learning Physically Realizable Skills for Online Packing of General 3D Shapes
Hang Zhao
Zherong Pan
Yang Yu
Kai Xu
OffRL
80
15
0
05 Dec 2022
Differentiated Federated Reinforcement Learning Based Traffic Offloading
  on Space-Air-Ground Integrated Networks
Differentiated Federated Reinforcement Learning Based Traffic Offloading on Space-Air-Ground Integrated Networks
Yeguang Qin
Yilin Yang
Fengxiao Tang
Xin Yao
Mingde Zhao
Nei Kato
69
6
0
05 Dec 2022
Automata Learning meets Shielding
Automata Learning meets Shielding
Martin Tappler
Stefan Pranger
Bettina Könighofer
Edi Muškardin
Roderick Bloem
Kim G. Larsen
72
5
0
04 Dec 2022
RLogist: Fast Observation Strategy on Whole-slide Images with Deep
  Reinforcement Learning
RLogist: Fast Observation Strategy on Whole-slide Images with Deep Reinforcement Learning
Boxuan Zhao
Jun Zhang
Deheng Ye
Jiancheng Cao
Xiao Han
Qiang Fu
Wei Yang
OffRL
91
10
0
04 Dec 2022
Selecting Mechanical Parameters of a Monopode Jumping System with
  Reinforcement Learning
Selecting Mechanical Parameters of a Monopode Jumping System with Reinforcement Learning
Andrew S. Albright
J. Vaughan
75
1
0
02 Dec 2022
Utilizing Prior Solutions for Reward Shaping and Composition in
  Entropy-Regularized Reinforcement Learning
Utilizing Prior Solutions for Reward Shaping and Composition in Entropy-Regularized Reinforcement Learning
Jacob Adamczyk
A. Arriojas
Stas Tiomkin
R. Kulkarni
72
11
0
02 Dec 2022
STL-Based Synthesis of Feedback Controllers Using Reinforcement Learning
STL-Based Synthesis of Feedback Controllers Using Reinforcement Learning
Nikhil Kumar Singh
Indranil Saha
48
6
0
02 Dec 2022
Karolos: An Open-Source Reinforcement Learning Framework for Robot-Task
  Environments
Karolos: An Open-Source Reinforcement Learning Framework for Robot-Task Environments
Christian Bitter
Timo Thun
Tobias Meisen
96
1
0
01 Dec 2022
Symmetry Detection in Trajectory Data for More Meaningful Reinforcement
  Learning Representations
Symmetry Detection in Trajectory Data for More Meaningful Reinforcement Learning Representations
Marissa DÁlonzo
Rebecca L. Russell
106
0
0
29 Nov 2022
Learning and Understanding a Disentangled Feature Representation for
  Hidden Parameters in Reinforcement Learning
Learning and Understanding a Disentangled Feature Representation for Hidden Parameters in Reinforcement Learning
Christopher P. Reale
Rebecca L. Russell
87
1
0
29 Nov 2022
Closing the gap between SVRG and TD-SVRG with Gradient Splitting
Closing the gap between SVRG and TD-SVRG with Gradient Splitting
Arsenii Mustafin
Alexander Olshevsky
I. Paschalidis
47
1
0
29 Nov 2022
Learning from Good Trajectories in Offline Multi-Agent Reinforcement
  Learning
Learning from Good Trajectories in Offline Multi-Agent Reinforcement Learning
Qiangxing Tian
Kun Kuang
Furui Liu
Baoxiang Wang
OffRL
75
11
0
28 Nov 2022
Inapplicable Actions Learning for Knowledge Transfer in Reinforcement
  Learning
Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning
Leo Ardon
Alberto Pozanco
Daniel Borrajo
Sumitra Ganesh
OffRL
58
0
0
28 Nov 2022
Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse
  Reinforcement Learning
Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning
Tuan-Duong Trinh
Haoyu Chen
Daniel S. Brown
OffRL
72
8
0
28 Nov 2022
Causal Deep Reinforcement Learning Using Observational Data
Causal Deep Reinforcement Learning Using Observational Data
Wenxuan Zhu
Chao Yu
Qiaosheng Zhang
CMLOffRL
73
5
0
28 Nov 2022
Quantile Constrained Reinforcement Learning: A Reinforcement Learning
  Framework Constraining Outage Probability
Quantile Constrained Reinforcement Learning: A Reinforcement Learning Framework Constraining Outage Probability
Whiyoung Jung
Myungsik Cho
Jongeui Park
Young-Jin Sung
85
4
0
28 Nov 2022
QLAMMP: A Q-Learning Agent for Optimizing Fees on Automated Market
  Making Protocols
QLAMMP: A Q-Learning Agent for Optimizing Fees on Automated Market Making Protocols
Dev Churiwala
Bhaskar Krishnamachari
16
4
0
28 Nov 2022
Domain Generalization for Robust Model-Based Offline Reinforcement
  Learning
Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Alan Clark
Shoaib Ahmed Siddiqui
Robert Kirk
Usman Anwar
Stephen Chung
David M. Krueger
OODOffRL
70
0
0
27 Nov 2022
Generalizing Gaussian Smoothing for Random Search
Generalizing Gaussian Smoothing for Random Search
Katelyn Gao
Ozan Sener
87
14
0
27 Nov 2022
How Crucial is Transformer in Decision Transformer?
How Crucial is Transformer in Decision Transformer?
Max Siebenborn
Boris Belousov
Junning Huang
Jan Peters
54
15
0
26 Nov 2022
Previous
123...131415...505152
Next