Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 1,677 papers shown
Title
Perceptive Locomotion with Controllable Pace and Natural Gait Transitions Over Uneven Terrains
Daniel C.H. Tan
Jenny Zhang
Michael
M. Chuah
Zhibin Li
31
2
0
26 Jan 2023
Evaluating Deception and Moving Target Defense with Network Attack Simulation
Daniel Reti
Karina Elzer
Daniel Fraunholz
Daniel Schneider
Hans D. Schotten
AAML
26
7
0
25 Jan 2023
Deep Reinforcement Learning for Concentric Tube Robot Path Following
Keshav Iyengar
Sarah Spurgeon
Danail Stoyanov
MedIm
26
4
0
22 Jan 2023
Asynchronous Deep Double Duelling Q-Learning for Trading-Signal Execution in Limit Order Book Markets
Peer Nagy
Jan-Peter Calliess
S. Zohren
34
3
0
20 Jan 2023
Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Haoxuan Pan
Deheng Ye
Xiaoming Duan
Qiang Fu
Wei Yang
Jianping He
Mingfei Sun
OffRL
25
2
0
20 Jan 2023
Multi-Agent Interplay in a Competitive Survival Environment
Andrea Fanti
23
0
0
19 Jan 2023
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
42
125
0
19 Jan 2023
DIRECT: Learning from Sparse and Shifting Rewards using Discriminative Reward Co-Training
Philipp Altmann
Thomy Phan
Fabian Ritz
Thomas Gabor
Claudia Linnhoff-Popien
OffRL
35
1
0
18 Jan 2023
Adversarial Robust Deep Reinforcement Learning Requires Redefining Robustness
Ezgi Korkmaz
29
27
0
17 Jan 2023
Asynchronous training of quantum reinforcement learning
Samuel Yen-Chi Chen
OffRL
37
20
0
12 Jan 2023
Efficient Preference-Based Reinforcement Learning Using Learned Dynamics Models
Yi Liu
Gaurav Datta
Ellen R. Novoseller
Daniel S. Brown
46
22
0
11 Jan 2023
schlably: A Python Framework for Deep Reinforcement Learning Based Scheduling Experiments
Constantin Waubert de Puiseau
Jannik Peters
Christian Dörpelkus
Hasan Tercan
Tobias Meisen
OffRL
22
7
0
10 Jan 2023
Learning to Perceive in Deep Model-Free Reinforcement Learning
Gonccalo Querido
Alberto Sardinha
Francisco S. Melo
27
0
0
10 Jan 2023
DRL-GAN: A Hybrid Approach for Binary and Multiclass Network Intrusion Detection
Caroline Strickland
Chandrika Saha
Muhammad Zakar
Sareh Nejad
Noshin Tasnim
D. Lizotte
Anwar Haque
37
10
0
05 Jan 2023
Character Simulation Using Imitation Learning With Game Engine Physics
Joao Rodrigues
R. Nóbrega
AI4CE
24
2
0
05 Jan 2023
Genetic Imitation Learning by Reward Extrapolation
Boyuan Zheng
Jianlong Zhou
Fang Chen
24
0
0
03 Jan 2023
Explaining Imitation Learning through Frames
Boyuan Zheng
Jianlong Zhou
Chun-Hao Liu
Yiqiao Li
Fang Chen
24
0
0
03 Jan 2023
A Policy Optimization Method Towards Optimal-time Stability
Shengjie Wang
Lan Fengb
Xiang Zheng
Yu-wen Cao
Oluwatosin Oseni
Haotian Xu
Tao Zhang
Yang Gao
39
1
0
02 Jan 2023
On the Challenges of using Reinforcement Learning in Precision Drug Dosing: Delay and Prolongedness of Action Effects
Sumana Basu
M. Legault
Adriana Romero Soriano
Doina Precup
OffRL
28
3
0
02 Jan 2023
Tuning Synaptic Connections instead of Weights by Genetic Algorithm in Spiking Policy Network
Duzhen Zhang
Tielin Zhang
Shuncheng Jia
Qingyu Wang
Bo Xu
OffRL
224
5
0
29 Dec 2022
Backward Curriculum Reinforcement Learning
Kyungmin Ko
OnRL
17
0
0
29 Dec 2022
Falsification of Learning-Based Controllers through Multi-Fidelity Bayesian Optimization
Zahra Shahrooei
Mykel J. Kochenderfer
Ali Baheri
48
6
0
28 Dec 2022
Variance Reduction for Score Functions Using Optimal Baselines
Ronan L. Keane
H. Gao
26
0
0
27 Dec 2022
Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error
Bumgeun Park
Taeyoung Kim
Woohyeon Moon
L. Vecchietti
Dongsoo Har
OffRL
39
2
0
26 Dec 2022
Novel Reinforcement Learning Algorithm for Suppressing Synchronization in Closed Loop Deep Brain Stimulators
Harshali Agarwal
Heena Rathore
38
3
0
25 Dec 2022
NARS vs. Reinforcement learning: ONA vs. Q-Learning
Ali Beikmohammadi
21
0
0
23 Dec 2022
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
Aleksandar Krnjaic
Raul D. Steleac
Jonathan D. Thomas
Georgios Papoudakis
Lukas Schafer
...
Kuan-Ho Lao
Murat Cubuktepe
Matthew Haley
Peter Borsting
Stefano V. Albrecht
OffRL
24
17
0
22 Dec 2022
Hyperparameters in Contextual RL are Highly Situational
Theresa Eimer
C. Benjamins
Marius Lindauer
26
4
0
21 Dec 2022
Neighboring state-based RL Exploration
Jeffery Cheng
Kevin Li
Justin Lin
Pedro Pachuca
OffRL
15
0
0
21 Dec 2022
Enhancing Cyber Resilience of Networked Microgrids using Vertical Federated Reinforcement Learning
Sayak Mukherjee
Ramij-Raja Hossain
Yuan Liu
W. Du
Veronica Adetola
Sheik M. Mohiuddin
Qiuhua Huang
Tianzhixi Yin
Ankit Singhal
31
4
0
17 Dec 2022
Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning
Kun-Yen Huang
E. Hu
Dinesh Jayaraman
OffRL
43
5
0
17 Dec 2022
Emergent Behaviors in Multi-Agent Target Acquisition
P. Sharma
Erin G. Zaroukian
Derrik E. Asher
Bryson Howell
37
1
0
15 Dec 2022
Robust Policy Optimization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
25
8
0
14 Dec 2022
Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction
Brahma S. Pavse
Josiah P. Hanna
OffRL
37
7
0
14 Dec 2022
Reinforcement Learning in System Identification
J. Antonio
Martin H Oscar Fernández
Sergio Pérez
Anas Belfadil
C. Ibáñez-Llano
Freddy José Perozo
Javier Valle
Javier Arechalde Pelaz
20
0
0
14 Dec 2022
Efficient Exploration in Resource-Restricted Reinforcement Learning
Zhihai Wang
Taoxing Pan
Qi Zhou
Jie Wang
OffRL
20
10
0
14 Dec 2022
Quantum Policy Gradient Algorithm with Optimized Action Decoding
Nico Meyer
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
M. Hartmann
29
20
0
13 Dec 2022
PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration
Qisheng Zhang
Zhen Guo
A. Jøsang
Lance M. Kaplan
F. Chen
Dong-Ho Jeong
Jin-Hee Cho
25
0
0
13 Dec 2022
Minimax Optimal Estimation of Stability Under Distribution Shift
Hongseok Namkoong
Yuanzhe Ma
Peter Glynn
42
6
0
13 Dec 2022
AutoDRIVE: A Comprehensive, Flexible and Integrated Digital Twin Ecosystem for Enhancing Autonomous Driving Research and Education
Tanmay Vilas Samak
Chinmay Vilas Samak
Sivanathan Kandhasamy
Venkat Krovi
Mingjuan Xie
35
23
0
10 Dec 2022
Compiler Optimization for Quantum Computing Using Reinforcement Learning
Nils Quetschlich
Lukas Burgholzer
Robert Wille
48
26
0
08 Dec 2022
Model-based trajectory stitching for improved behavioural cloning and its applications
Charles A. Hepburn
Giovanni Montana
OffRL
37
5
0
08 Dec 2022
Design and Planning of Flexible Mobile Micro-Grids Using Deep Reinforcement Learning
Cesare Caputo
Michel-Alexandre Cardin
Pudong Ge
Fei Teng
A. Korre
Ehecatl Antonio del Rio Chanona
22
18
0
08 Dec 2022
Tight Performance Guarantees of Imitator Policies with Continuous Actions
Davide Maran
Alberto Maria Metelli
Marcello Restelli
OffRL
33
5
0
07 Dec 2022
Collision-tolerant Aerial Robots: A Survey
Paolo De Petris
S. Carlson
C. Papachristos
Kostas Alexis
47
4
0
06 Dec 2022
State Space Closure: Revisiting Endless Online Level Generation via Reinforcement Learning
Ziqi Wang
Tianye Shu
Jialin Liu
OffRL
29
1
0
06 Dec 2022
A Novel Deep Reinforcement Learning Based Automated Stock Trading System Using Cascaded LSTM Networks
Jie Zou
Jiashu Lou
Baohua Wang
Sixue Liu
AIFin
29
28
0
06 Dec 2022
Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce Order Fraud Evaluation
Soysal Degirmenci
Chris Jones
OffRL
27
1
0
05 Dec 2022
A Machine with Short-Term, Episodic, and Semantic Memory Systems
Taewoon Kim
Michael Cochez
Vincent Franccois-Lavet
Mark Antonius Neerincx
Piek Vossen
45
5
0
05 Dec 2022
Learning Physically Realizable Skills for Online Packing of General 3D Shapes
Hang Zhao
Zherong Pan
Yang Yu
Kai Xu
OffRL
45
14
0
05 Dec 2022
Previous
1
2
3
...
12
13
14
...
32
33
34
Next