OpenAI Gym

5 June 2016

Papers citing "OpenAI Gym"

50 / 2,578 papers shown

Title
Generalized Hindsight for Reinforcement Learning Alexander C. Li Lerrel Pinto Pieter Abbeel 67 70 0 26 Feb 2020
Whole-Body Control of a Mobile Manipulator using End-to-End Reinforcement Learning Julien Kindle Fadri Furrer Tonci Novkovic Jen Jen Chung Roland Siegwart Juan I. Nieto OffRL 96 30 0 25 Feb 2020
TanksWorld: A Multi-Agent Environment for AI Safety Research Corban G. Rivera Olivia Lyons Arielle Summitt Ayman Fatima J. Pak ... R. Chalmers Aryeh Englander Edward W. Staley I. Wang Ashley J. Llorens 23 2 0 25 Feb 2020
Simultaneously Evolving Deep Reinforcement Learning Models using Multifactorial Optimization Aritz D. Martinez E. Osaba Javier Del Ser Francisco Herrera 71 10 0 25 Feb 2020
Off-Policy Deep Reinforcement Learning with Analogous Disentangled Exploration Hoang Trung-Dung Yitao Liang Guy Van den Broeck OffRL 71 3 0 25 Feb 2020
CybORG: An Autonomous Cyber Operations Research Gym Callum Baillie Maxwell Standen Jonathon Schwartz Michael Docking David Bowman Junae Kim 42 30 0 25 Feb 2020
Safe reinforcement learning for probabilistic reachability and safety specifications: A Lyapunov-based approach Subin Huh Insoon Yang 76 20 0 24 Feb 2020
Behavior Cloning in OpenAI using Case Based Reasoning C. Peters B. Esfandiari Mohamad Zalat Robert West OffRL 17 0 0 23 Feb 2020
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients Ashley D. Edwards Himanshu Sahni Rosanne Liu Jane Hung Ankit Jain Rui Wang Adrien Ecoffet Thomas Miconi Charles Isbell J. Yosinski OffRL 48 18 0 21 Feb 2020
On the Search for Feedback in Reinforcement Learning Ran A. Wang Karthikeya S. Parunandi Aayushman Sharma R. Goyal S. Chakravorty 53 9 0 21 Feb 2020
Accelerating Reinforcement Learning with a Directional-Gaussian-Smoothing Evolution Strategy Jiaxing Zhang Hoang Tran Guannan Zhang 68 9 0 21 Feb 2020
Support-weighted Adversarial Imitation Learning Ruohan Wang C. Ciliberto P. Amadori Y. Demiris 39 4 0 20 Feb 2020
Adaptive Temporal Difference Learning with Linear Function Approximation Tao Sun Han Shen Tianyi Chen Dongsheng Li 77 23 0 20 Feb 2020
Using AI for Mitigating the Impact of Network Delay in Cloud-based Intelligent Traffic Signal Control Rusheng Zhang Xinze Zhou Ozan K. Tonguz 36 1 0 19 Feb 2020
Informative Path Planning for Mobile Sensing with Reinforcement Learning Yongyong Wei Rong Zheng 81 34 0 18 Feb 2020
Adaptive Estimator Selection for Off-Policy Evaluation Yi-Hsun Su Pavithra Srinath A. Krishnamurthy OffRL 62 48 0 18 Feb 2020
DISCO: Double Likelihood-free Inference Stochastic Control Lucas Barcelos Rafael Oliveira Rafael Possas Lionel Ott Fabio Ramos 33 11 0 18 Feb 2020
Kalman meets Bellman: Improving Policy Evaluation through Value Tracking Shirli Di-Castro Shashua Shie Mannor OffRL 50 12 0 17 Feb 2020
Adaptive Experience Selection for Policy Gradient S. Mohamad Giovanni Montana 104 0 0 17 Feb 2020
Universal Value Density Estimation for Imitation Learning and Goal-Conditioned Reinforcement Learning Yannick Schroecker Charles Isbell OffRL 88 13 0 15 Feb 2020
PDDLGym: Gym Environments from PDDL Problems Tom Silver Rohan Chitnis AI4CE 105 57 0 15 Feb 2020
Applying Depth-Sensing to Automated Surgical Manipulation with a da Vinci Robot M. Hwang Daniel Seita Brijen Thananjeyan Jeffrey Ichnowski Samuel Paradis Danyal Fer Thomas Low Ken Goldberg 125 31 0 15 Feb 2020
Robust Reinforcement Learning via Adversarial training with Langevin Dynamics Parameswaran Kamalaruban Yu-ting Huang Ya-Ping Hsieh Paul Rolland C. Shi Volkan Cevher 103 61 0 14 Feb 2020
Learning Functionally Decomposed Hierarchies for Continuous Control Tasks with Path Planning Sammy Christen Lukás Jendele Emre Aksan Otmar Hilliges OffRL 78 25 0 14 Feb 2020
A Framework for End-to-End Learning on Semantic Tree-Structured Data William Woof Ke Chen 40 3 0 13 Feb 2020
XCS Classifier System with Experience Replay Anthony Stein Roland Maier Lukas Rosenbauer J. Hähner BDL 35 21 0 13 Feb 2020
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted Prescription Olivier Francon Santiago Gonzalez Babak Hodjat Elliot Meyerson Risto Miikkulainen Xin Qiu Hormoz Shahrzad 80 17 0 13 Feb 2020
Learning to Generate Levels From Nothing Philip Bontrager Julian Togelius GAN 61 22 0 12 Feb 2020
Multi-task Reinforcement Learning with a Planning Quasi-Metric Vincent Micheli Karthigan Sinnathamby Franccois Fleuret 70 2 0 08 Feb 2020
Capsule Network Performance with Autonomous Navigation Tom Molnar Eugenio Culurciello 3DPC 25 2 0 08 Feb 2020
BitPruning: Learning Bitlengths for Aggressive and Accurate Quantization Milovs Nikolić G. B. Hacene Ciaran Bannon Alberto Delmas Lascorz Matthieu Courbariaux Yoshua Bengio Vincent Gripon Andreas Moshovos MQ 66 25 0 08 Feb 2020
Representation of Reinforcement Learning Policies in Reproducing Kernel Hilbert Spaces Bogdan Mazoure T. Doan Tianyu Li V. Makarenkov Joelle Pineau Doina Precup Guillaume Rabusseau OffRL 67 1 0 07 Feb 2020
Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP) Zhimin Hou Kuangen Zhang Yi Wan Dongyu Li Chenglong Fu Haoyong Yu 103 15 0 07 Feb 2020
Accelerating Reinforcement Learning for Reaching using Continuous Curriculum Learning Sha Luo Hamidreza Kasaei Lambert Schomaker CLL 92 46 0 07 Feb 2020
Ready Policy One: World Building Through Active Learning Philip J. Ball Jack Parker-Holder Aldo Pacchiano K. Choromanski Stephen J. Roberts OffRL 92 49 0 07 Feb 2020
Mutual Information-based State-Control for Intrinsically Motivated Reinforcement Learning Rui Zhao Yang Gao Pieter Abbeel Volker Tresp Wenyuan Xu SSL 57 4 0 05 Feb 2020
Deep Radial-Basis Value Functions for Continuous Control Kavosh Asadi Neev Parikh Ronald E. Parr George Konidaris Michael L. Littman 44 4 0 05 Feb 2020
Learning rewards for robotic ultrasound scanning using probabilistic temporal ranking Michael G. Burke Katie Lu Daniel Angelov Artūras Straižys Craig Innes Kartic Subr S. Ramamoorthy 54 11 0 04 Feb 2020
Effective Diversity in Population Based Reinforcement Learning Jack Parker-Holder Aldo Pacchiano K. Choromanski Stephen J. Roberts 130 165 0 03 Feb 2020
Evolving Neural Networks through a Reverse Encoding Tree Haoling Zhang Chao-Han Huck Yang Hector Zenil N. Kiani Yue-Hong Shen Jesper N. Tegnér 55 5 0 03 Feb 2020
GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values Shangtong Zhang Bo Liu Shimon Whiteson OffRL 116 103 0 29 Jan 2020
An Upper Bound of the Bias of Nadaraya-Watson Kernel Regression under Lipschitz Assumptions Samuele Tosatto R. Akrour Jan Peters 64 4 0 29 Jan 2020
Rotation, Translation, and Cropping for Zero-Shot Generalization Chang Ye Ahmed Khalifa Philip Bontrager Julian Togelius 93 38 0 27 Jan 2020
Challenges and Countermeasures for Adversarial Attacks on Deep Reinforcement Learning Inaam Ilahi Muhammad Usama Junaid Qadir M. Janjua Ala I. Al-Fuqaha D. Hoang Dusit Niyato AAML 147 137 0 27 Jan 2020
PCGRL: Procedural Content Generation via Reinforcement Learning Ahmed Khalifa Philip Bontrager Sam Earle Julian Togelius 73 146 0 24 Jan 2020
On Simple Reactive Neural Networks for Behaviour-Based Reinforcement Learning Ameya Pore G. Aragon-Camarasa 53 11 0 22 Jan 2020
Continuous-action Reinforcement Learning for Playing Racing Games: Comparing SPG to PPO Mario S. Holubar M. Wiering 50 10 0 15 Jan 2020
PoPS: Policy Pruning and Shrinking for Deep Reinforcement Learning Dor Livne Kobi Cohen 61 52 0 14 Jan 2020
Multi-Robot Formation Control Using Reinforcement Learning Abhay Rawat K. Karlapalem 36 4 0 13 Jan 2020
Improving Image Autoencoder Embeddings with Perceptual Loss G. Pihlgren Fredrik Sandin Marcus Liwicki 72 34 0 10 Jan 2020