OpenAI Gym

5 June 2016

Papers citing "OpenAI Gym"

50 / 2,578 papers shown

Title
μ-DDRL: A QoS-Aware Distributed Deep Reinforcement Learning Technique for Service Offloading in Fog computing Environments M. Goudarzi M. A. Rodriguez Majid Sarvi Rajkumar Buyya OffRL 79 3 0 13 Oct 2023
Distance-rank Aware Sequential Reward Learning for Inverse Reinforcement Learning with Sub-optimal Demonstrations Lu Li Yuxin Pan Ruobing Chen Jie Liu Zilin Wang Yu Liu Zhiheng Li 125 0 0 13 Oct 2023
Octopus: Embodied Vision-Language Programmer from Environmental Feedback Jingkang Yang Yuhao Dong Shuai Liu Yue Liu Ziyue Wang ... Haoran Tan Jiamu Kang Yuanhan Zhang Kaiyang Zhou Ziwei Liu LM&Ro 89 49 0 12 Oct 2023
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios Yazhe Niu Yuan Pu Zhenjie Yang Xueyan Li Tong Zhou Jiyuan Ren Shuai Hu Hongsheng Li Yu Liu 139 15 0 12 Oct 2023
Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples Hao Sun Alihan Huyuk Daniel Jarrett M. Schaar OffRL 113 8 0 11 Oct 2023
RANS: Highly-Parallelised Simulator for Reinforcement Learning based Autonomous Navigating Spacecrafts Matteo El Hariry Antoine Richard Miguel Olivares-Mendez 74 4 0 11 Oct 2023
Imitation Learning from Purified Demonstration Yunke Wang Minjing Dong Bo Du Chang Xu 68 1 0 11 Oct 2023
RoboHive: A Unified Framework for Robot Learning Vikash Kumar Rutav Shah Gaoyue Zhou Vincent Moens Vittorio Caggiano Jay Vakil Abhishek Gupta Aravind Rajeswaran 67 25 0 10 Oct 2023
Realizing Stabilized Landing for Computation-Limited Reusable Rockets: A Quantum Reinforcement Learning Approach Gyusun Kim Jaehyun Chung Soohyun Park 51 8 0 10 Oct 2023
Initial Task Assignment in Multi-Human Multi-Robot Teams: An Attention-enhanced Hierarchical Reinforcement Learning Approach Ruiqi Wang Dezhong Zhao Arjun Gupte Byung-Cheol Min 40 1 0 08 Oct 2023
Surgical Gym: A high-performance GPU-based platform for reinforcement learning with surgical robots Samuel Schmidgall Axel Krieger Jason K. Eshraghian OOD 88 16 0 07 Oct 2023
Searching for Optimal Runtime Assurance via Reachability and Reinforcement Learning Kristina Miller Christopher K. Zeitler William Shen Kerianne L. Hobbs Sayan Mitra John Schierman Mahesh Viswanathan 49 0 0 06 Oct 2023
Improving Reinforcement Learning Efficiency with Auxiliary Tasks in Non-Visual Environments: A Comparison Moritz Lange Noah Krystiniak Raphael C. Engelhardt Wolfgang Konen Laurenz Wiskott OffRL 59 1 0 06 Oct 2023
On Representation Complexity of Model-based and Model-free Reinforcement Learning Hanlin Zhu Baihe Huang Stuart Russell OffRL 76 4 0 03 Oct 2023
Imitation Learning from Observation through Optimal Transport Wei-Di Chang Scott Fujimoto David Meger Gregory Dudek 61 4 0 02 Oct 2023
Accurate Simulation and Parameter Identification of Deformable Linear Objects using Discrete Elastic Rods in Generalized Coordinates Qi Jing Chen Timothy Bretl AI4CE 48 0 0 02 Oct 2023
Optimizing with Low Budgets: a Comparison on the Black-box Optimization Benchmarking Suite and OpenAI Gym Elena Raponi Nathanaël Carraz Rakotonirina Jérémy Rapin Carola Doerr O. Teytaud 106 6 0 29 Sep 2023
HyperPPO: A scalable method for finding small policies for robotic control Luming Tang Zhehui Huang Gaurav Sukhatme 65 4 0 28 Sep 2023
CasIL: Cognizing and Imitating Skills via a Dual Cognition-Action Architecture Zixuan Chen Ze Ji Shuyang Liu Jing Huo Yiyu Chen Yang Gao 54 1 0 28 Sep 2023
Stackelberg Batch Policy Learning Wenzhuo Zhou Annie Qu OffRL 78 1 0 28 Sep 2023
Enhancing data efficiency in reinforcement learning: a novel imagination mechanism based on mesh information propagation Zihang Wang Maowei Jiang AI4CE 78 0 0 25 Sep 2023
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework Wenzhuo Zhou Yuhan Li Ruoqing Zhu Annie Qu OffRL 83 5 0 23 Sep 2023
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy Optimization Hai Zhang Hang Yu Junqiao Zhao Di Zhang Chang Huang Hongtu Zhou Xiao Zhang Chen Ye 87 10 0 22 Sep 2023
Learning Actions and Control of Focus of Attention with a Log-Polar-like Sensor Robin Göransson Volker Krueger 29 0 0 22 Sep 2023
Trip Planning for Autonomous Vehicles with Wireless Data Transfer Needs Using Reinforcement Learning Yousef AlSaqabi Bhaskar Krishnamachari 55 2 0 21 Sep 2023
State2Explanation: Concept-Based Explanations to Benefit Agent Learning and User Understanding Devleena Das Sonia Chernova Been Kim LRM LLMAG 114 24 0 21 Sep 2023
Learning to Recover for Safe Reinforcement Learning Haoyu Wang Xin Yuan Qinqing Ren 56 0 0 21 Sep 2023
Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory Sampling Wenjun Huang Yunduan Cui Huiyun Li Xin Wu MU 121 0 0 20 Sep 2023
Monte-Carlo tree search with uncertainty propagation via optimal transport Tuan Dam Pascal Stenger Lukas Schneider Joni Pajarinen Carlo DÉramo Odalric-Ambrym Maillard 46 1 0 19 Sep 2023
gym-saturation: Gymnasium environments for saturation provers (System description) Boris Shminke 71 1 0 16 Sep 2023
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning Xiao-Yin Liu Xiao-Hu Zhou Xiaoliang Xie Shiqi Liu Zhen-Qiu Feng Hao Li Mei-Jiang Gui Tian-Yu Xiang De-Xing Huang Zeng-Guang Hou OffRL OOD 84 5 0 16 Sep 2023
Deep Multi-Agent Reinforcement Learning for Decentralized Active Hypothesis Testing Hadar Szostak Kobi Cohen 59 4 0 14 Sep 2023
Stable In-hand Manipulation with Finger Specific Multi-agent Shadow Reward Lingfeng Tao Jiucai Zhang Xiaoli Zhang 56 0 0 13 Sep 2023
Investigating the Impact of Action Representations in Policy Gradient Algorithms Jan Schneider-Barnes Pierre Schumacher Daniel Haeufle Bernhard Scholkopf Le Chen OffRL 41 2 0 13 Sep 2023
Attention Loss Adjusted Prioritized Experience Replay Zhuoying Chen Huiping Li Rizhong Wang 53 2 0 13 Sep 2023
Fitness Approximation through Machine Learning Itai Tzruia Tomer Halperin Moshe Sipper Achiya Elyasaf 45 2 0 06 Sep 2023
ORL-AUDITOR: Dataset Auditing in Offline Deep Reinforcement Learning L. Du Min Chen Mingyang Sun Shouling Ji Peng Cheng Jiming Chen Zhikun Zhang OffRL 101 9 0 06 Sep 2023
Representation Learning for Sequential Volumetric Design Tasks Md Ferdous Alam Yi Wang Linh Tran Chin-Yi Cheng Jieliang Luo 3DV 91 2 0 05 Sep 2023
Distributionally Robust Model-based Reinforcement Learning with Large State Spaces Shyam Sundhar Ramesh Pier Giuseppe Sessa Yifan Hu Andreas Krause Ilija Bogunovic OOD 78 12 0 05 Sep 2023
Marginalized Importance Sampling for Off-Environment Policy Evaluation Pulkit Katdare Nan Jiang Katherine Driggs-Campbell OffRL 89 4 0 04 Sep 2023
Leveraging Reward Consistency for Interpretable Feature Discovery in Reinforcement Learning Qisen Yang Huanqian Wang Mukun Tong Wenjie Shi Gao Huang Shiji Song 72 5 0 04 Sep 2023
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance Qisen Yang Shenzhi Wang Qihang Zhang Gao Huang Shiji Song OffRL OnRL 79 8 0 04 Sep 2023
Solving Non-Rectangular Reward-Robust MDPs via Frequency Regularization Uri Gadot E. Derman Navdeep Kumar Maxence Mohamed Elfatihi Kfir Y. Levy Shie Mannor 76 7 0 03 Sep 2023
Neurosymbolic Reinforcement Learning and Planning: A Survey Kamal Acharya Waleed Raza Carlos Dourado Alvaro Velasquez Houbing Song NAI OffRL 90 17 0 02 Sep 2023
Suicidal Pedestrian: Generation of Safety-Critical Scenarios for Autonomous Vehicles Yuhang Yang Kalle Kujanpää Amin Babadi Joni Pajarinen Alexander Ilin 69 3 0 01 Sep 2023
The Power of MEME: Adversarial Malware Creation with Model-Based Reinforcement Learning M. Rigaki Sebastian Garcia AAML 53 4 0 31 Aug 2023
DRL-Based Trajectory Tracking for Motion-Related Modules in Autonomous Driving Yinda Xu Lidong Yu 64 7 0 30 Aug 2023
R^3: On-device Real-Time Deep Reinforcement Learning for Autonomous Robotics Zexin Li Aritra Samanta Yufei Li Andrea Soltoggio Hyoseung Kim Cong Liu 113 7 0 29 Aug 2023
Target-independent XLA optimization using Reinforcement Learning Milan Ganai Haichen Li Theodore Enns Yida Wang Randy Huang 74 0 0 28 Aug 2023
Distributionally Robust Statistical Verification with Imprecise Neural Networks Souradeep Dutta Michele Caprio Vivian Lin Matthew Cleaveland Kuk Jin Jang I. Ruchkin O. Sokolsky Insup Lee OOD AAML 217 8 0 28 Aug 2023