Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
Causal Repair of Learning-enabled Cyber-physical Systems
Pengyuan Lu
I. Ruchkin
Matthew Cleaveland
O. Sokolsky
Insup Lee
52
2
0
06 Apr 2023
AutoRL Hyperparameter Landscapes
Aditya Mohan
C. Benjamins
Konrad Wienecke
A. Dockhorn
Marius Lindauer
136
8
0
05 Apr 2023
Online augmentation of learned grasp sequence policies for more adaptable and data-efficient in-hand manipulation
E. Gordon
Rana Soltani-Zarrin
OffRL
58
6
0
04 Apr 2023
Quantum Imitation Learning
Zhihao Cheng
Kaining Zhang
Li Shen
Dacheng Tao
67
1
0
04 Apr 2023
PyFlyt -- UAV Simulation Environments for Reinforcement Learning Research
Jun Jet Tai
J. Wong
M. Innocente
N. Horri
J. Brusey
S. K. Phang
120
10
0
03 Apr 2023
Managing power grids through topology actions: A comparative study between advanced rule-based and reinforcement learning agents
Malte Lehna
J. Viebahn
Christoph Scholz
Antoine Marot
Sven Tomforde
77
22
0
03 Apr 2023
Physical Deep Reinforcement Learning Towards Safety Guarantee
H. Cao
Y. Mao
L. Sha
Marco Caccamo
AI4CE
24
5
0
29 Mar 2023
Learning Complicated Manipulation Skills via Deterministic Policy with Limited Demonstrations
Li Haofeng
C. Yiwen
Tan Jiayi
Marcelo H. Ang Jr
OffRL
35
2
0
29 Mar 2023
A Survey of Machine Learning-Based Ride-Hailing Planning
Dacheng Wen
Yupeng Li
F. Lau
68
4
0
26 Mar 2023
Sequential Knockoffs for Variable Selection in Reinforcement Learning
Tao Ma
Hengrui Cai
Zhengling Qi
C. Shi
Eric B. Laber
97
3
0
24 Mar 2023
marl-jax: Multi-Agent Reinforcement Leaning Framework
K. Mehta
Anuj Mahajan
Kiran Ravish
109
3
0
24 Mar 2023
Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Stephanie Milani
Anssi Kanervisto
Karolis Ramanauskas
Sander Schulhoff
Brandon Houghton
...
Vinicius G. Goecks
Nicholas R. Waytowich
David Watkins
J. Miller
Rohin Shah
59
16
0
23 Mar 2023
Reinforcement Learning with Exogenous States and Rewards
George Trimponias
Thomas G. Dietterich
OffRL
65
2
0
22 Mar 2023
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality
François Ged
M. H. Veiga
81
0
0
22 Mar 2023
A multi-functional simulation platform for on-demand ride service operations
Siyuan Feng
Taijie Chen
Yuhao Zhang
Jintao Ke
Zhengfei Zheng
Hai Yang
35
5
0
22 Mar 2023
Adaptive Road Configurations for Improved Autonomous Vehicle-Pedestrian Interactions using Reinforcement Learning
Qiming Ye
Yuxiang Feng
Jose Javier Escribano Macias
M. Stettler
Panagiotis Angeloudis
44
4
0
22 Mar 2023
ChatGPT and a New Academic Reality: Artificial Intelligence-Written Research Papers and the Ethics of the Large Language Models in Scholarly Publishing
Brady Lund
Ting Wang
Nishith Reddy Mannuru
Bing Nie
S. Shimray
Ziang Wang
AI4CE
94
529
0
21 Mar 2023
Affective Workload Allocation for Multi-human Multi-robot Teams
Wonse Jo
Ruiqi Wang
B. Yang
D. Foti
M. Rastgaar
Byung-Cheol Min
33
1
0
18 Mar 2023
Comparing NARS and Reinforcement Learning: An Analysis of ONA and
Q
Q
Q
-Learning Algorithms
Ali Beikmohammadi
Sindri Magnússon
50
3
0
17 Mar 2023
Reinforcement Learning for Omega-Regular Specifications on Continuous-Time MDP
A. Falah
Shibashis Guha
Ashutosh Trivedi
36
0
0
16 Mar 2023
Learning Rewards to Optimize Global Performance Metrics in Deep Reinforcement Learning
Junqi Qian
Paul Weng
Chenmien Tan
71
1
0
16 Mar 2023
MAHTM: A Multi-Agent Framework for Hierarchical Transactive Microgrids
Nicolas Mauricio Cuadrado
Roberto Gutiérrez
Yongli Zhu
Martin Takáč
72
2
0
15 Mar 2023
Act-Then-Measure: Reinforcement Learning for Partially Observable Environments with Active Measuring
Merlijn Krale
T. D. Simão
N. Jansen
OffRL
57
7
0
14 Mar 2023
Beyond Games: A Systematic Review of Neural Monte Carlo Tree Search Applications
Marco Kemmerling
Daniel Lutticke
Robert H. Schmitt
74
15
0
14 Mar 2023
Reinforcement Learning-based Wavefront Sensorless Adaptive Optics Approaches for Satellite-to-Ground Laser Communication
Payam Parvizi
Runnan Zou
C. Bellinger
R. Cheriton
D. Spinello
42
2
0
13 Mar 2023
Kernel Density Bayesian Inverse Reinforcement Learning
Aishwarya Mandyam
Didong Li
Jiayu Yao
Diana Cai
Andrew Jones
Barbara E. Engelhardt
OffRL
BDL
90
3
0
13 Mar 2023
Synthetic Experience Replay
Cong Lu
Philip J. Ball
Yee Whye Teh
Jack Parker-Holder
OffRL
170
80
0
12 Mar 2023
Beware of Instantaneous Dependence in Reinforcement Learning
Zhengmao Zhu
Yu-Ren Liu
Hong Tian
Yang Yu
Kun Zhang
OffRL
53
1
0
09 Mar 2023
On the Benefits of Biophysical Synapses
Julian Lemmel
Radu Grosu
18
0
0
08 Mar 2023
A Multiplicative Value Function for Safe and Efficient Reinforcement Learning
Nick Bührer
Zhejun Zhang
Alexander Liniger
Feng Yu
Luc Van Gool
67
1
0
07 Mar 2023
Evolutionary Reinforcement Learning: A Survey
Hui Bai
Ran Cheng
Yaochu Jin
OffRL
142
56
0
07 Mar 2023
Learning to Backdoor Federated Learning
Henger Li
Chen Wu
Senchun Zhu
Zizhan Zheng
FedML
82
10
0
06 Mar 2023
Ensemble Reinforcement Learning: A Survey
Yanjie Song
Ponnuthurai Nagaratnam Suganthan
Witold Pedrycz
Junwei Ou
Yongming He
Y. Chen
Yutong Wu
OffRL
91
41
0
05 Mar 2023
Demonstration-guided Deep Reinforcement Learning for Coordinated Ramp Metering and Perimeter Control in Large Scale Networks
Zijian Hu
Wei-Ying Ma
39
5
0
04 Mar 2023
CoRL: Environment Creation and Management Focused on System Integration
J. D. Merrick
Benjamin K. Heiner
Cameron Long
Brian Stieber
Steve Fierro
Vardaan Gangal
Madison Blake
Joshua Blackburn
AI4CE
76
2
0
03 Mar 2023
Synthetic Data Generator for Adaptive Interventions in Global Health
Aditya Rastogi
J. F. Garamendi
Ana Fernández del Río
Anna Guitart
Moiz Hassan Khan
Dexian Tang
África Periánez
63
0
0
03 Mar 2023
Dynamic Competency Self-Assessment for Autonomous Agents
Nicholas Conlon
Nisar R. Ahmed
D. Szafir
49
3
0
03 Mar 2023
The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting
Hongyao Tang
Hao Fei
Jianye Hao
69
1
0
02 Mar 2023
Reinforced Labels: Multi-Agent Deep Reinforcement Learning for Point-Feature Label Placement
Petr Bobák
Ladislav Čmolík
Martin Cadík
OffRL
68
3
0
02 Mar 2023
The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms
Anirudh Vemula
Yuda Song
Aarti Singh
J. Andrew Bagnell
Sanjiban Choudhury
OffRL
83
14
0
01 Mar 2023
Policy Dispersion in Non-Markovian Environment
B. Qu
Xiaofeng Cao
Jielong Yang
Hechang Chen
Chang Yi
Ivor W.Tsang
Yew-Soon Ong
56
0
0
28 Feb 2023
Taylor TD-learning
Michele Garibbo
Maxime Robeyns
Laurence Aitchison
OffRL
58
1
0
27 Feb 2023
CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems
Sagar Patel
Sangeetha Abdu Jyothi
Nina Narodytska
OffRL
54
0
0
27 Feb 2023
High-Precise Robot Arm Manipulation based on Online Iterative Learning and Forward Simulation with Positioning Error Below End-Effector Physical Minimum Displacement
Weiming Qu
Tianlin Liu
D. Luo
75
2
0
26 Feb 2023
Q-Cogni: An Integrated Causal Reinforcement Learning Framework
C. Cunha
Wen Liu
T. French
Ajmal Mian
71
1
0
26 Feb 2023
DeepCPG Policies for Robot Locomotion
Aditya M. Deshpande
Eric Hurd
A. Minai
Manish Kumar
74
9
0
25 Feb 2023
Autonomous Exploration and Mapping for Mobile Robots via Cumulative Curriculum Reinforcement Learning
Zehan Li
Jinghao Xin
Ning Li
67
5
0
25 Feb 2023
EvoTorch: Scalable Evolutionary Computation in Python
N. E. Toklu
Timothy James Atkinson
Vojtvech Micka
Paweł Liskowski
R. Srivastava
94
14
0
24 Feb 2023
Why Target Networks Stabilise Temporal Difference Methods
Matt Fellows
Matthew Smith
Shimon Whiteson
OOD
AAML
97
8
0
24 Feb 2023
Improving Deep Policy Gradients with Value Function Search
Enrico Marchesini
Chris Amato
49
9
0
20 Feb 2023
Previous
1
2
3
...
11
12
13
...
50
51
52
Next