OpenAI Gym

5 June 2016

Papers citing "OpenAI Gym"

50 / 2,578 papers shown

Title
Causal Repair of Learning-enabled Cyber-physical Systems Pengyuan Lu I. Ruchkin Matthew Cleaveland O. Sokolsky Insup Lee 52 2 0 06 Apr 2023
AutoRL Hyperparameter Landscapes Aditya Mohan C. Benjamins Konrad Wienecke A. Dockhorn Marius Lindauer 136 8 0 05 Apr 2023
Online augmentation of learned grasp sequence policies for more adaptable and data-efficient in-hand manipulation E. Gordon Rana Soltani-Zarrin OffRL 58 6 0 04 Apr 2023
Quantum Imitation Learning Zhihao Cheng Kaining Zhang Li Shen Dacheng Tao 67 1 0 04 Apr 2023
PyFlyt -- UAV Simulation Environments for Reinforcement Learning Research Jun Jet Tai J. Wong M. Innocente N. Horri J. Brusey S. K. Phang 120 10 0 03 Apr 2023
Managing power grids through topology actions: A comparative study between advanced rule-based and reinforcement learning agents Malte Lehna J. Viebahn Christoph Scholz Antoine Marot Sven Tomforde 77 22 0 03 Apr 2023
Physical Deep Reinforcement Learning Towards Safety Guarantee H. Cao Y. Mao L. Sha Marco Caccamo AI4CE 24 5 0 29 Mar 2023
Learning Complicated Manipulation Skills via Deterministic Policy with Limited Demonstrations Li Haofeng C. Yiwen Tan Jiayi Marcelo H. Ang Jr OffRL 35 2 0 29 Mar 2023
A Survey of Machine Learning-Based Ride-Hailing Planning Dacheng Wen Yupeng Li F. Lau 68 4 0 26 Mar 2023
Sequential Knockoffs for Variable Selection in Reinforcement Learning Tao Ma Hengrui Cai Zhengling Qi C. Shi Eric B. Laber 97 3 0 24 Mar 2023
marl-jax: Multi-Agent Reinforcement Leaning Framework K. Mehta Anuj Mahajan Kiran Ravish 109 3 0 24 Mar 2023
Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition Stephanie Milani Anssi Kanervisto Karolis Ramanauskas Sander Schulhoff Brandon Houghton ... Vinicius G. Goecks Nicholas R. Waytowich David Watkins J. Miller Rohin Shah 59 16 0 23 Mar 2023
Reinforcement Learning with Exogenous States and Rewards George Trimponias Thomas G. Dietterich OffRL 65 2 0 22 Mar 2023
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality François Ged M. H. Veiga 81 0 0 22 Mar 2023
A multi-functional simulation platform for on-demand ride service operations Siyuan Feng Taijie Chen Yuhao Zhang Jintao Ke Zhengfei Zheng Hai Yang 35 5 0 22 Mar 2023
Adaptive Road Configurations for Improved Autonomous Vehicle-Pedestrian Interactions using Reinforcement Learning Qiming Ye Yuxiang Feng Jose Javier Escribano Macias M. Stettler Panagiotis Angeloudis 44 4 0 22 Mar 2023
ChatGPT and a New Academic Reality: Artificial Intelligence-Written Research Papers and the Ethics of the Large Language Models in Scholarly Publishing Brady Lund Ting Wang Nishith Reddy Mannuru Bing Nie S. Shimray Ziang Wang AI4CE 94 529 0 21 Mar 2023
Affective Workload Allocation for Multi-human Multi-robot Teams Wonse Jo Ruiqi Wang B. Yang D. Foti M. Rastgaar Byung-Cheol Min 33 1 0 18 Mar 2023
Comparing NARS and Reinforcement Learning: An Analysis of ONA and $Q$ -Learning Algorithms Ali Beikmohammadi Sindri Magnússon 50 3 0 17 Mar 2023
Reinforcement Learning for Omega-Regular Specifications on Continuous-Time MDP A. Falah Shibashis Guha Ashutosh Trivedi 36 0 0 16 Mar 2023
Learning Rewards to Optimize Global Performance Metrics in Deep Reinforcement Learning Junqi Qian Paul Weng Chenmien Tan 71 1 0 16 Mar 2023
MAHTM: A Multi-Agent Framework for Hierarchical Transactive Microgrids Nicolas Mauricio Cuadrado Roberto Gutiérrez Yongli Zhu Martin Takáč 72 2 0 15 Mar 2023
Act-Then-Measure: Reinforcement Learning for Partially Observable Environments with Active Measuring Merlijn Krale T. D. Simão N. Jansen OffRL 57 7 0 14 Mar 2023
Beyond Games: A Systematic Review of Neural Monte Carlo Tree Search Applications Marco Kemmerling Daniel Lutticke Robert H. Schmitt 74 15 0 14 Mar 2023
Reinforcement Learning-based Wavefront Sensorless Adaptive Optics Approaches for Satellite-to-Ground Laser Communication Payam Parvizi Runnan Zou C. Bellinger R. Cheriton D. Spinello 42 2 0 13 Mar 2023
Kernel Density Bayesian Inverse Reinforcement Learning Aishwarya Mandyam Didong Li Jiayu Yao Diana Cai Andrew Jones Barbara E. Engelhardt OffRL BDL 90 3 0 13 Mar 2023
Synthetic Experience Replay Cong Lu Philip J. Ball Yee Whye Teh Jack Parker-Holder OffRL 170 80 0 12 Mar 2023
Beware of Instantaneous Dependence in Reinforcement Learning Zhengmao Zhu Yu-Ren Liu Hong Tian Yang Yu Kun Zhang OffRL 53 1 0 09 Mar 2023
On the Benefits of Biophysical Synapses Julian Lemmel Radu Grosu 18 0 0 08 Mar 2023
A Multiplicative Value Function for Safe and Efficient Reinforcement Learning Nick Bührer Zhejun Zhang Alexander Liniger Feng Yu Luc Van Gool 67 1 0 07 Mar 2023
Evolutionary Reinforcement Learning: A Survey Hui Bai Ran Cheng Yaochu Jin OffRL 142 56 0 07 Mar 2023
Learning to Backdoor Federated Learning Henger Li Chen Wu Senchun Zhu Zizhan Zheng FedML 82 10 0 06 Mar 2023
Ensemble Reinforcement Learning: A Survey Yanjie Song Ponnuthurai Nagaratnam Suganthan Witold Pedrycz Junwei Ou Yongming He Y. Chen Yutong Wu OffRL 91 41 0 05 Mar 2023
Demonstration-guided Deep Reinforcement Learning for Coordinated Ramp Metering and Perimeter Control in Large Scale Networks Zijian Hu Wei-Ying Ma 39 5 0 04 Mar 2023
CoRL: Environment Creation and Management Focused on System Integration J. D. Merrick Benjamin K. Heiner Cameron Long Brian Stieber Steve Fierro Vardaan Gangal Madison Blake Joshua Blackburn AI4CE 76 2 0 03 Mar 2023
Synthetic Data Generator for Adaptive Interventions in Global Health Aditya Rastogi J. F. Garamendi Ana Fernández del Río Anna Guitart Moiz Hassan Khan Dexian Tang África Periánez 63 0 0 03 Mar 2023
Dynamic Competency Self-Assessment for Autonomous Agents Nicholas Conlon Nisar R. Ahmed D. Szafir 49 3 0 03 Mar 2023
The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting Hongyao Tang Hao Fei Jianye Hao 69 1 0 02 Mar 2023
Reinforced Labels: Multi-Agent Deep Reinforcement Learning for Point-Feature Label Placement Petr Bobák Ladislav Čmolík Martin Cadík OffRL 68 3 0 02 Mar 2023
The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms Anirudh Vemula Yuda Song Aarti Singh J. Andrew Bagnell Sanjiban Choudhury OffRL 83 14 0 01 Mar 2023
Policy Dispersion in Non-Markovian Environment B. Qu Xiaofeng Cao Jielong Yang Hechang Chen Chang Yi Ivor W.Tsang Yew-Soon Ong 56 0 0 28 Feb 2023
Taylor TD-learning Michele Garibbo Maxime Robeyns Laurence Aitchison OffRL 58 1 0 27 Feb 2023
CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems Sagar Patel Sangeetha Abdu Jyothi Nina Narodytska OffRL 54 0 0 27 Feb 2023
High-Precise Robot Arm Manipulation based on Online Iterative Learning and Forward Simulation with Positioning Error Below End-Effector Physical Minimum Displacement Weiming Qu Tianlin Liu D. Luo 75 2 0 26 Feb 2023
Q-Cogni: An Integrated Causal Reinforcement Learning Framework C. Cunha Wen Liu T. French Ajmal Mian 71 1 0 26 Feb 2023
DeepCPG Policies for Robot Locomotion Aditya M. Deshpande Eric Hurd A. Minai Manish Kumar 74 9 0 25 Feb 2023
Autonomous Exploration and Mapping for Mobile Robots via Cumulative Curriculum Reinforcement Learning Zehan Li Jinghao Xin Ning Li 67 5 0 25 Feb 2023
EvoTorch: Scalable Evolutionary Computation in Python N. E. Toklu Timothy James Atkinson Vojtvech Micka Paweł Liskowski R. Srivastava 94 14 0 24 Feb 2023
Why Target Networks Stabilise Temporal Difference Methods Matt Fellows Matthew Smith Shimon Whiteson OOD AAML 97 8 0 24 Feb 2023
Improving Deep Policy Gradients with Value Function Search Enrico Marchesini Chris Amato 49 9 0 20 Feb 2023