Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
Tuning Synaptic Connections instead of Weights by Genetic Algorithm in Spiking Policy Network
Duzhen Zhang
Tielin Zhang
Shuncheng Jia
Qingyu Wang
Bo Xu
OffRL
379
5
0
29 Dec 2022
Backward Curriculum Reinforcement Learning
Kyungmin Ko
OnRL
40
0
0
29 Dec 2022
Falsification of Learning-Based Controllers through Multi-Fidelity Bayesian Optimization
Zahra Shahrooei
Mykel J. Kochenderfer
Ali Baheri
85
7
0
28 Dec 2022
Variance Reduction for Score Functions Using Optimal Baselines
Ronan L. Keane
H. Gao
52
0
0
27 Dec 2022
Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error
Bumgeun Park
Taeyoung Kim
Woohyeon Moon
L. Vecchietti
Dongsoo Har
OffRL
62
2
0
26 Dec 2022
Novel Reinforcement Learning Algorithm for Suppressing Synchronization in Closed Loop Deep Brain Stimulators
Harshali Agarwal
Heena Rathore
58
3
0
25 Dec 2022
NARS vs. Reinforcement learning: ONA vs. Q-Learning
Ali Beikmohammadi
108
0
0
23 Dec 2022
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
Aleksandar Krnjaic
Raul D. Steleac
Jonathan D. Thomas
Georgios Papoudakis
Lukas Schafer
...
Kuan-Ho Lao
Murat Cubuktepe
Matthew Haley
Peter Borsting
Stefano V. Albrecht
OffRL
86
18
0
22 Dec 2022
Hyperparameters in Contextual RL are Highly Situational
Theresa Eimer
C. Benjamins
Marius Lindauer
143
4
0
21 Dec 2022
Neighboring state-based RL Exploration
Jeffery Cheng
Kevin Li
Justin Lin
Pedro Pachuca
OffRL
22
0
0
21 Dec 2022
Enhancing Cyber Resilience of Networked Microgrids using Vertical Federated Reinforcement Learning
Sayak Mukherjee
Ramij-Raja Hossain
Yuan Liu
W. Du
Veronica Adetola
Sheik M. Mohiuddin
Qiuhua Huang
Tianzhixi Yin
Ankit Singhal
56
5
0
17 Dec 2022
Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning
Kun-Yen Huang
E. Hu
Dinesh Jayaraman
OffRL
111
5
0
17 Dec 2022
Emergent Behaviors in Multi-Agent Target Acquisition
P. Sharma
Erin G. Zaroukian
Derrik E. Asher
Bryson Howell
104
1
0
15 Dec 2022
Robust Policy Optimization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
49
12
0
14 Dec 2022
Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction
Brahma S. Pavse
Josiah P. Hanna
OffRL
69
7
0
14 Dec 2022
Reinforcement Learning in System Identification
J. Antonio
Martin H Oscar Fernández
Sergio Pérez
Anas Belfadil
C. Ibáñez-Llano
Freddy José Perozo
Javier Valle
Javier Arechalde Pelaz
49
0
0
14 Dec 2022
Efficient Exploration in Resource-Restricted Reinforcement Learning
Zhihai Wang
Taoxing Pan
Qi Zhou
Jie Wang
OffRL
54
12
0
14 Dec 2022
Quantum Policy Gradient Algorithm with Optimized Action Decoding
Nico Meyer
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
M. Hartmann
61
22
0
13 Dec 2022
PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration
Qisheng Zhang
Zhen Guo
A. Jøsang
Lance M. Kaplan
F. Chen
Dong-Ho Jeong
Jin-Hee Cho
44
0
0
13 Dec 2022
Minimax Optimal Estimation of Stability Under Distribution Shift
Hongseok Namkoong
Yuanzhe Ma
Peter Glynn
202
6
0
13 Dec 2022
AutoDRIVE: A Comprehensive, Flexible and Integrated Digital Twin Ecosystem for Enhancing Autonomous Driving Research and Education
Tanmay Vilas Samak
Chinmay Vilas Samak
Sivanathan Kandhasamy
Venkat Krovi
Mingjuan Xie
89
24
0
10 Dec 2022
Compiler Optimization for Quantum Computing Using Reinforcement Learning
Nils Quetschlich
Lukas Burgholzer
Robert Wille
76
27
0
08 Dec 2022
Model-based trajectory stitching for improved behavioural cloning and its applications
Charles A. Hepburn
Giovanni Montana
OffRL
81
7
0
08 Dec 2022
Design and Planning of Flexible Mobile Micro-Grids Using Deep Reinforcement Learning
Cesare Caputo
Michel-Alexandre Cardin
Pudong Ge
Fei Teng
A. Korre
Ehecatl Antonio del Rio Chanona
39
18
0
08 Dec 2022
Tight Performance Guarantees of Imitator Policies with Continuous Actions
Davide Maran
Alberto Maria Metelli
Marcello Restelli
OffRL
79
5
0
07 Dec 2022
Collision-tolerant Aerial Robots: A Survey
Paolo De Petris
S. Carlson
C. Papachristos
Kostas Alexis
105
4
0
06 Dec 2022
State Space Closure: Revisiting Endless Online Level Generation via Reinforcement Learning
Ziqi Wang
Tianye Shu
Jialin Liu
OffRL
53
1
0
06 Dec 2022
A Novel Deep Reinforcement Learning Based Automated Stock Trading System Using Cascaded LSTM Networks
Jie Zou
Jiashu Lou
Baohua Wang
Sixue Liu
AIFin
87
32
0
06 Dec 2022
Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce Order Fraud Evaluation
Soysal Degirmenci
Chris Jones
OffRL
61
1
0
05 Dec 2022
A Machine with Short-Term, Episodic, and Semantic Memory Systems
Taewoon Kim
Michael Cochez
Vincent Franccois-Lavet
Mark Antonius Neerincx
Piek Vossen
80
5
0
05 Dec 2022
Learning Physically Realizable Skills for Online Packing of General 3D Shapes
Hang Zhao
Zherong Pan
Yang Yu
Kai Xu
OffRL
80
15
0
05 Dec 2022
Differentiated Federated Reinforcement Learning Based Traffic Offloading on Space-Air-Ground Integrated Networks
Yeguang Qin
Yilin Yang
Fengxiao Tang
Xin Yao
Mingde Zhao
Nei Kato
69
6
0
05 Dec 2022
Automata Learning meets Shielding
Martin Tappler
Stefan Pranger
Bettina Könighofer
Edi Muškardin
Roderick Bloem
Kim G. Larsen
72
5
0
04 Dec 2022
RLogist: Fast Observation Strategy on Whole-slide Images with Deep Reinforcement Learning
Boxuan Zhao
Jun Zhang
Deheng Ye
Jiancheng Cao
Xiao Han
Qiang Fu
Wei Yang
OffRL
91
10
0
04 Dec 2022
Selecting Mechanical Parameters of a Monopode Jumping System with Reinforcement Learning
Andrew S. Albright
J. Vaughan
75
1
0
02 Dec 2022
Utilizing Prior Solutions for Reward Shaping and Composition in Entropy-Regularized Reinforcement Learning
Jacob Adamczyk
A. Arriojas
Stas Tiomkin
R. Kulkarni
72
11
0
02 Dec 2022
STL-Based Synthesis of Feedback Controllers Using Reinforcement Learning
Nikhil Kumar Singh
Indranil Saha
48
6
0
02 Dec 2022
Karolos: An Open-Source Reinforcement Learning Framework for Robot-Task Environments
Christian Bitter
Timo Thun
Tobias Meisen
96
1
0
01 Dec 2022
Symmetry Detection in Trajectory Data for More Meaningful Reinforcement Learning Representations
Marissa DÁlonzo
Rebecca L. Russell
106
0
0
29 Nov 2022
Learning and Understanding a Disentangled Feature Representation for Hidden Parameters in Reinforcement Learning
Christopher P. Reale
Rebecca L. Russell
87
1
0
29 Nov 2022
Closing the gap between SVRG and TD-SVRG with Gradient Splitting
Arsenii Mustafin
Alexander Olshevsky
I. Paschalidis
47
1
0
29 Nov 2022
Learning from Good Trajectories in Offline Multi-Agent Reinforcement Learning
Qiangxing Tian
Kun Kuang
Furui Liu
Baoxiang Wang
OffRL
75
11
0
28 Nov 2022
Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning
Leo Ardon
Alberto Pozanco
Daniel Borrajo
Sumitra Ganesh
OffRL
58
0
0
28 Nov 2022
Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning
Tuan-Duong Trinh
Haoyu Chen
Daniel S. Brown
OffRL
72
8
0
28 Nov 2022
Causal Deep Reinforcement Learning Using Observational Data
Wenxuan Zhu
Chao Yu
Qiaosheng Zhang
CML
OffRL
73
5
0
28 Nov 2022
Quantile Constrained Reinforcement Learning: A Reinforcement Learning Framework Constraining Outage Probability
Whiyoung Jung
Myungsik Cho
Jongeui Park
Young-Jin Sung
85
4
0
28 Nov 2022
QLAMMP: A Q-Learning Agent for Optimizing Fees on Automated Market Making Protocols
Dev Churiwala
Bhaskar Krishnamachari
16
4
0
28 Nov 2022
Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Alan Clark
Shoaib Ahmed Siddiqui
Robert Kirk
Usman Anwar
Stephen Chung
David M. Krueger
OOD
OffRL
70
0
0
27 Nov 2022
Generalizing Gaussian Smoothing for Random Search
Katelyn Gao
Ozan Sener
87
14
0
27 Nov 2022
How Crucial is Transformer in Decision Transformer?
Max Siebenborn
Boris Belousov
Junning Huang
Jan Peters
54
15
0
26 Nov 2022
Previous
1
2
3
...
13
14
15
...
50
51
52
Next