OpenAI Gym

5 June 2016

Papers citing "OpenAI Gym"

50 / 2,578 papers shown

Title
Multi-Agent Trust Region Policy Optimization Hepeng Li Haibo He 106 42 0 15 Oct 2020
Deep Learning of Koopman Representation for Control Yiqiang Han Wenjian Hao Umesh Vaidya AI4CE 57 110 0 15 Oct 2020
Self-Imitation Learning for Robot Tasks with Sparse and Delayed Rewards Zhixin Chen Mengxiang Lin 58 6 0 14 Oct 2020
Efficient Wasserstein Natural Gradients for Reinforcement Learning Theodore H. Moskovitz Michael Arbel Ferenc Huszár Arthur Gretton 72 21 0 12 Oct 2020
EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models Cédric Colas B. Hejblum S. Rouillon R. Thiébaut Pierre-Yves Oudeyer Clément Moulin-Frier M. Prague 61 22 0 09 Oct 2020
Learning Value Functions in Deep Policy Gradients using Residual Variance Yannis Flet-Berliac Reda Ouhamma Odalric-Ambrym Maillard Philippe Preux OffRL 72 1 0 09 Oct 2020
Learning to Locomote: Understanding How Environment Design Matters for Deep Reinforcement Learning Daniele Reda Tianxin Tao M. van de Panne AI4CE 109 53 0 09 Oct 2020
CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning Ossama Ahmed Frederik Trauble Anirudh Goyal Alexander Neitz Yoshua Bengio Bernhard Schölkopf M. Wuthrich Stefan Bauer CML 118 123 0 08 Oct 2020
A novel control mode of bionic morphing tail based on deep reinforcement learning Liming Zheng Zhou Zhou Peng Sun Zhilin Zhang Rui Wang AI4CE 36 1 0 08 Oct 2020
Learning Intrinsic Symbolic Rewards in Reinforcement Learning Hassam Sheikh Shauharda Khadka Santiago Miret Somdeb Majumdar OffRL 69 7 0 08 Oct 2020
Proximal Policy Optimization with Relative Pearson Divergence Taisuke Kobayashi 43 17 0 07 Oct 2020
Reinforcement Learning with Random Delays Simon Ramstedt Yann Bouteiller Giovanni Beltrame C. Pal Jonathan Binas 227 61 0 06 Oct 2020
Learning Diverse Options via InfoMax Termination Critic Yuji Kanagawa Tomoyuki Kaneko 64 1 0 06 Oct 2020
Active Feature Acquisition with Generative Surrogate Models Yang Li Junier B. Oliva RALM TPM 72 38 0 06 Oct 2020
Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning Rodrigo Toro Icarte Toryn Q. Klassen Richard Valenzano Sheila A. McIlraith OffRL 152 222 0 06 Oct 2020
Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards for Real-time Strategy Games Shengyi Huang Santiago Ontañón 63 10 0 05 Oct 2020
A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms Shangtong Zhang Romain Laroche H. V. Seijen Shimon Whiteson Rémi Tachet des Combes 122 15 0 02 Oct 2020
MADRaS : Multi Agent Driving Simulator Anirban Santara S. Rudra Sree Aditya Buridi Meha Kaushik A. Naik Bharat Kaul Balaraman Ravindran 72 30 0 02 Oct 2020
Deep Reinforcement Learning with Mixed Convolutional Network Yanyu Zhang SSL 10 2 0 01 Oct 2020
Heteroscedastic Bayesian Optimisation for Stochastic Model Predictive Control Rel Guzman Rafael Oliveira F. Ramos 97 15 0 01 Oct 2020
Deep Reinforcement Learning for Efficient Measurement of Quantum Devices Vu-Linh Nguyen S. B. Orbell D. Lennon H. Moon F. Vigneau ... D. Zumbuhl G. Briggs Michael A. Osborne D. Sejdinovic N. Ares 55 41 0 30 Sep 2020
MARS-Gym: A Gym framework to model, train, and evaluate Recommender Systems for Marketplaces Marlesson R. O. Santana Luckeciano C. Melo Fernando H. F. Camargo Bruno Brandão Anderson Soares Renan M. Oliveira Sandor Caetano OffRL 48 15 0 30 Sep 2020
Reannealing of Decaying Exploration Based On Heuristic Measure in Deep Q-Network Xing Wang A. Vinel 40 0 0 29 Sep 2020
Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning Haotian Fu Hongyao Tang Jianye Hao Chong Chen Xidong Feng Dong Li Wulong Liu OffRL 83 50 0 29 Sep 2020
Cross Learning in Deep Q-Networks Xing Wang A. Vinel 25 2 0 29 Sep 2020
Novelty Search in Representational Space for Sample Efficient Exploration Ruo Yu Tao Vincent François-Lavet Joelle Pineau 90 45 0 28 Sep 2020
Agent Environment Cycle Games J. K. Terry Nathaniel Grammel Benjamin Black Ananth Hari Caroline Horsch L. Santos 65 7 0 28 Sep 2020
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey Wenshuai Zhao Jorge Peña Queralta Tomi Westerlund OffRL 256 743 0 24 Sep 2020
Motion Planning by Reinforcement Learning for an Unmanned Aerial Vehicle in Virtual Open Space with Static Obstacles Sanghyun Kim Jongmin Park Jae-Kwan Yun Jiwon Seo 28 17 0 24 Sep 2020
The Agent Web Model -- Modelling web hacking for reinforcement learning L. Erdődi Fabio Massimo Zennaro 24 3 0 23 Sep 2020
A Centralised Soft Actor Critic Deep Reinforcement Learning Approach to District Demand Side Management through CityLearn Anjukan Kathirgamanathan Kacper Twardowski E. Mangina D. Finn 63 21 0 22 Sep 2020
Learning Task-Agnostic Action Spaces for Movement Optimization Amin Babadi M. van de Panne Caren Liu Perttu Hämäläinen 51 2 0 22 Sep 2020
CMAX++ : Leveraging Experience in Planning and Execution using Inaccurate Models Anirudh Vemula J. Andrew Bagnell Maxim Likhachev 88 9 0 21 Sep 2020
Deep Reinforcement Learning Methods for Structure-Guided Processing Path Optimization Johannes Dornheim L. Morand Samuel Zeitvogel Tarek Iraki Norbert Link Dirk Helm 58 21 0 21 Sep 2020
RL STaR Platform: Reinforcement Learning for Simulation based Training of Robots Tamir Blum Gabin Paillet Mickaël Laîné Kazuya Yoshida 43 7 0 21 Sep 2020
Learn to Exceed: Stereo Inverse Reinforcement Learning with Concurrent Policy Optimization Feng Tao Yongcan Cao 99 2 0 21 Sep 2020
Multiplayer Support for the Arcade Learning Environment J. K. Terry Benjamin Black Luis Santos 74 13 0 20 Sep 2020
Measuring the Complexity of Domains Used to Evaluate AI Systems Christopher Pereyda Lawrence Holder 25 3 0 18 Sep 2020
GRAC: Self-Guided and Self-Regularized Actor-Critic Lin Shao Yifan You Mengyuan Yan Qingyun Sun Jeannette Bohg 89 24 0 18 Sep 2020
Autonomous Learning of Features for Control: Experiments with Embodied and Situated Agents Nicola Milano S. Nolfi 46 0 0 15 Sep 2020
Extended Radial Basis Function Controller for Reinforcement Learning Nicholas Capel Naifu Zhang 35 1 0 12 Sep 2020
Physically Embedded Planning Problems: New Challenges for Reinforcement Learning M. Berk Mirza Andrew Jaegle Jonathan J. Hunt A. Guez S. Tunyasuvunakool ... Peter Karkus S. Racanière Lars Buesing Timothy Lillicrap N. Heess AI4CE 79 12 0 11 Sep 2020
TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments Tom Bewley J. Lawry FAtt 74 27 0 10 Sep 2020
COVID-19 Pandemic Cyclic Lockdown Optimization Using Reinforcement Learning M. Arango Lyudmil Pelov 57 17 0 10 Sep 2020
Integrated Benchmarking and Design for Reproducible and Accessible Evaluation of Robotic Agents J. Tani Andrea F. Daniele Gianmarco Bernasconi Amaury Camus Aleksandar Petrov ... Tomasz Zaluska Matthew R. Walter Emilio Frazzoli Liam Paull A. Censi 50 8 0 09 Sep 2020
DyNODE: Neural Ordinary Differential Equations for Dynamics Modeling in Continuous Control V. M. Alvarez R. Rosca Cristian G. Falcutescu 63 11 0 09 Sep 2020
Visualizing the Loss Landscape of Actor Critic Methods with Applications in Inventory Optimization Recep Yusuf Bekci M. Gümüş 29 4 0 04 Sep 2020
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning Sheng-Chun Kao Geonhwa Jeong T. Krishna 111 96 0 04 Sep 2020
SEDRo: A Simulated Environment for Developmental Robotics Aishwarya Pothula Md Ashaduzzaman Rubel Mondol Sanath Narasimhan Sm Mazharul Islam Deokgun Park 36 5 0 03 Sep 2020
Adaptive Risk Sensitive Model Predictive Control with Stochastic Search Ziyi Wang Oswin So Keuntaek Lee Camilo A. Duarte Evangelos A. Theodorou 61 2 0 02 Sep 2020