Challenges of Real-World Reinforcement Learning

29 April 2019

Papers citing "Challenges of Real-World Reinforcement Learning"

50 / 108 papers shown

Title
ABIDES-Gym: Gym Environments for Multi-Agent Discrete Event Simulation and Application to Financial Markets Selim Amrouni Aymeric Moulin Jared Vann Svitlana Vyetrenko T. Balch Manuela Veloso AI4CE 26 42 0 27 Oct 2021
Learning Robust Controllers Via Probabilistic Model-Based Policy Search V. Charvet B. S. Jensen R. Murray-Smith 19 2 0 26 Oct 2021
GrowSpace: Learning How to Shape Plants Yasmeen Hitti Ionelia Buzatu Manuel Del Verme M. Lefsrud Florian Golemo A. Durand 19 2 0 15 Oct 2021
Correct Me if I am Wrong: Interactive Learning for Robotic Manipulation Eugenio Chisari Tim Welschehold Joschka Boedecker Wolfram Burgard Abhinav Valada 19 37 0 07 Oct 2021
Adaptive control of a mechatronic system using constrained residual reinforcement learning Tom Staessens Tom Lefebvre Guillaume Crevecoeur 19 16 0 06 Oct 2021
Deep reinforcement learning for guidewire navigation in coronary artery phantom Jihoon Kweon Kyunghwan Kim Chaehyuk Lee Hwi Kwon Jinwoo Park ... Inwook Back J. Roh Y. Moon Jaesoon Choi Young-Hak Kim OnRL 18 33 0 05 Oct 2021
Deep Reinforcement Learning with Adjustments H. Khorasgani Haiyan Wang Chetan Gupta Susumu Serita 18 2 0 28 Sep 2021
Semi-Supervised Imitation Learning with Mixed Qualities of Demonstrations for Autonomous Driving Gunmin Lee Wooseok Oh Seungyoung Shin Dohyeong Kim Jeongwoo Oh Jaeyeon Jeong Sungjoon Choi Songhwai Oh SSL 33 2 0 23 Sep 2021
A Survey of Text Games for Reinforcement Learning informed by Natural Language P. Osborne Heido Nomm André Freitas AI4CE 32 24 0 20 Sep 2021
Learning Robot Swarm Tactics over Complex Adversarial Environments A. Behjat Hemanth Manjunatha Prajit KrisshnaKumar Apurv Jani Leighton Collins ... Joseph P. Distefano David Doermann Karthik Dantu Ehsan Esfahani Souma Chowdhury 6 11 0 13 Sep 2021
Reinforcement Learning based Condition-oriented Maintenance Scheduling for Flow Line Systems Raphael Lamprecht Ferdinand Wurst Marco F. Huber 14 3 0 27 Aug 2021
Accelerating the Learning of TAMER with Counterfactual Explanations Jakob Karalus F. Lindner OffRL 29 4 0 03 Aug 2021
The Benchmark Lottery Mostafa Dehghani Yi Tay A. Gritsenko Zhe Zhao N. Houlsby Fernando Diaz Donald Metzler Oriol Vinyals 42 89 0 14 Jul 2021
RRL: Resnet as representation for Reinforcement Learning Rutav Shah Vikash Kumar OffRL 30 111 0 07 Jul 2021
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research J. Luis E. Crawley B. Cameron OffRL 25 6 0 07 Jul 2021
Unsupervised Skill Discovery with Bottleneck Option Learning Jaekyeom Kim Seohong Park Gunhee Kim 32 32 0 27 Jun 2021
Learning Policies with Zero or Bounded Constraint Violation for Constrained MDPs Tao-Wen Liu Ruida Zhou D. Kalathil P. R. Kumar Chao Tian 29 78 0 04 Jun 2021
Universal Off-Policy Evaluation Yash Chandak S. Niekum Bruno C. da Silva Erik Learned-Miller Emma Brunskill Philip S. Thomas OffRL ELM 32 52 0 26 Apr 2021
Model-aided Deep Reinforcement Learning for Sample-efficient UAV Trajectory Design in IoT Networks Omid Esrafilian Harald Bayerlein David Gesbert 16 6 0 21 Apr 2021
Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow John Mcleod Hrvoje Stojić Vincent Adam Dongho Kim Jordi Grau-Moya Peter Vrancx Felix Leibfried OffRL 21 2 0 26 Mar 2021
Robust Multi-Modal Policies for Industrial Assembly via Reinforcement Learning and Demonstrations: A Large-Scale Study Jianlan Luo Oleg O. Sushkov Rugile Pevceviciute Wenzhao Lian Chang Su Mel Vecerík Ning Ye S. Schaal Jonathan Scholz OffRL 27 60 0 21 Mar 2021
Combining Pessimism with Optimism for Robust and Efficient Model-Based Deep Reinforcement Learning Sebastian Curi Ilija Bogunovic Andreas Krause 39 17 0 18 Mar 2021
RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems Martin Mladenov Chih-Wei Hsu Vihan Jain Eugene Ie Christopher Colby Nicolas Mayoraz H. Pham Dustin Tran Ivan Vendrov Craig Boutilier BDL 15 31 0 14 Mar 2021
Gym-ANM: Reinforcement Learning Environments for Active Network Management Tasks in Electricity Distribution Systems Robin Henry D. Ernst 21 34 0 14 Mar 2021
Offline Reinforcement Learning with Pseudometric Learning Robert Dadashi Shideh Rezaeifar Nino Vieillard Léonard Hussenot Olivier Pietquin M. Geist OffRL 33 40 0 02 Mar 2021
GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning Juhyoung Lee Sangyeob Kim Sangjin Kim Wooyoung Jo H. Yoo OffRL 21 9 0 24 Jan 2021
Social NCE: Contrastive Learning of Socially-aware Motion Representations Yuejiang Liu Qi Yan Alexandre Alahi 29 101 0 21 Dec 2020
Resonance: Replacing Software Constants with Context-Aware Models in Real-time Communication J. Gupchup A. Aazami Yaran Fan Senja Filipi Tom Finley ... D. Perednya Sriram Srinivasan John Langford Ross Cutler J. Gehrke OffRL 19 1 0 23 Nov 2020
Language-guided Navigation via Cross-Modal Grounding and Alternate Adversarial Learning Weixia Zhang Chao Ma Qi Wu Xiaokang Yang 39 44 0 22 Nov 2020
OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning Anurag Ajay Aviral Kumar Pulkit Agrawal Sergey Levine Ofir Nachum OffRL OnRL 34 155 0 26 Oct 2020
Multi-UAV Path Planning for Wireless Data Harvesting with Deep Reinforcement Learning Harald Bayerlein Mirco Theile Marco Caccamo David Gesbert 29 120 0 23 Oct 2020
Reinforcement Learning with Combinatorial Actions: An Application to Vehicle Routing A. Delarue Ross Anderson Christian Tjandraatmadja 35 93 0 22 Oct 2020
Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification D. Mankowitz D. A. Calian Rae Jeong Cosmin Paduraru N. Heess Sumanth Dathathri Martin Riedmiller Timothy A. Mann 24 11 0 20 Oct 2020
Artificial Intelligence for UAV-enabled Wireless Networks: A Survey Mohamed-Amine Lahmeri Mustafa A. Kishk Mohamed-Slim Alouini 26 102 0 24 Sep 2020
Human Engagement Providing Evaluative and Informative Advice for Interactive Reinforcement Learning Adam Bignold Francisco Cruz Richard Dazeley Peter Vamplew Cameron Foale 22 18 0 21 Sep 2020
QPLEX: Duplex Dueling Multi-Agent Q-Learning Jianhao Wang Zhizhou Ren Terry Liu Yang Yu Chongjie Zhang OffRL 51 437 0 03 Aug 2020
Probabilistic Active Meta-Learning Jean Kaddour Steindór Sæmundsson M. Deisenroth 22 34 0 17 Jul 2020
UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement Learning Approach Harald Bayerlein Mirco Theile Marco Caccamo David Gesbert 18 54 0 01 Jul 2020
Critic Regularized Regression Ziyun Wang Alexander Novikov Konrad Zolna Jost Tobias Springenberg Scott E. Reed ... Noah Y. Siegel J. Merel Çağlar Gülçehre N. Heess Nando de Freitas OffRL 36 317 0 26 Jun 2020
Learning to Play Table Tennis From Scratch using Muscular Robots Le Chen Simon Guist Roberto Calandra V. Berenz Bernhard Schölkopf Jan Peters 11 88 0 10 Jun 2020
Off-policy Learning for Remote Electrical Tilt Optimization Filippo Vannella Jaeseong Jeong Alexandre Proutière OffRL 14 14 0 21 May 2020
A Survey of Reinforcement Learning Algorithms for Dynamically Varying Environments Sindhu Padakandla 25 144 0 19 May 2020
Optimizing for the Future in Non-Stationary MDPs Yash Chandak Georgios Theocharous Shiv Shankar Martha White Sridhar Mahadevan Philip S. Thomas OffRL 13 65 0 17 May 2020
DeepSoCS: A Neural Scheduler for Heterogeneous System-on-Chip (SoC) Resource Scheduling Tegg Taekyong Sung J. Ha Jeewoo Kim Alex Yahja Chae-Bong Sohn Bo Ryu 21 9 0 15 May 2020
How Do You Act? An Empirical Study to Understand Behavior of Deep Reinforcement Learning Agents Richard Meyes Moritz Schneider Tobias Meisen 28 2 0 07 Apr 2020
ACNMP: Skill Transfer and Task Extrapolation through Learning from Demonstration and Reinforcement Learning via Representation Sharing M. Akbulut Erhan Öztop M. Yunus Seker Y. Nagai Ahmet E. Tekden Emre Ugur 14 2 0 25 Mar 2020
An empirical investigation of the challenges of real-world reinforcement learning Gabriel Dulac-Arnold Nir Levine D. Mankowitz Jerry Li Cosmin Paduraru Sven Gowal Todd Hester OffRL 34 120 0 24 Mar 2020
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization Dongsheng Ding Xiaohan Wei Zhuoran Yang Zhaoran Wang M. Jovanović 20 159 0 01 Mar 2020
Learning in Markov Decision Processes under Constraints Rahul Singh Abhishek Gupta Ness B. Shroff 41 27 0 27 Feb 2020
Scalable Multi-Task Imitation Learning with Autonomous Improvement Avi Singh Eric Jang A. Irpan Daniel Kappler Murtaza Dalal Sergey Levine Mohi Khansari Chelsea Finn 48 35 0 25 Feb 2020