Optimizing for the Future in Non-Stationary MDPs

Optimizing for the Future in Non-Stationary MDPs

17 May 2020

Georgios Theocharous

Sridhar Mahadevan

Philip S. Thomas

Papers citing "Optimizing for the Future in Non-Stationary MDPs"

17 / 17 papers shown

Title
Predictive Control and Regret Analysis of Non-Stationary MDP with Look-ahead Information Ziyi Zhang Yorie Nakahira Guannan Qu 36 1 0 13 Sep 2024
Pausing Policy Learning in Non-stationary Reinforcement Learning Hyunin Lee Ming Jin Javad Lavaei Somayeh Sojoudi OffRL 37 2 0 25 May 2024
Preparing for Black Swans: The Antifragility Imperative for Machine Learning Ming Jin 38 2 0 18 May 2024
Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments Vincent Liu Yash Chandak Philip S. Thomas Martha White OffRL 19 0 0 23 Feb 2023
Online Reinforcement Learning in Non-Stationary Context-Driven Environments Pouya Hamadanian Arash Nasr-Esfahany Malte Schwarzkopf Siddartha Sen MohammadIman Alizadeh CLL OffRL 50 0 0 04 Feb 2023
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments Yash Chandak Shiv Shankar Nathaniel D. Bastian Bruno Castro da Silva Emma Brunskil Philip S. Thomas OffRL 44 6 0 24 Jan 2023
General policy mapping: online continual reinforcement learning inspired on the insect brain A. Yanguas-Gil Sandeep Madireddy CLL OnRL 16 0 0 30 Nov 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning C. Steinparz Thomas Schmied Fabian Paischer Marius-Constantin Dinu Vihang Patil Angela Bitto-Nemling Hamid Eghbalzadeh Sepp Hochreiter CLL 24 11 0 12 Jul 2022
Factored Adaptation for Non-Stationary Reinforcement Learning Fan Feng Erdun Gao Kun Zhang Sara Magliacane CML OffRL 45 32 0 30 Mar 2022
Review of Metrics to Measure the Stability, Robustness and Resilience of Reinforcement Learning L. Pullum 13 2 0 22 Mar 2022
Reinforcement Learning for Personalized Drug Discovery and Design for Complex Diseases: A Systems Pharmacology Perspective Ryan K. Tan Yang Liu Lei Xie 37 2 0 21 Jan 2022
Autonomous Reinforcement Learning: Formalism and Benchmarking Archit Sharma Kelvin Xu Nikhil Sardana Abhishek Gupta Karol Hausman Sergey Levine Chelsea Finn OffRL 44 26 0 17 Dec 2021
Continual Learning In Environments With Polynomial Mixing Times Matthew D Riemer Sharath Chandra Raparthy Ignacio Cases G. Subbaraj M. P. Touzel Irina Rish CLL 41 8 0 13 Dec 2021
ESCADA: Efficient Safety and Context Aware Dose Allocation for Precision Medicine Ilker Demirel Ahmet Çelik Cem Tekin 31 4 0 26 Nov 2021
Universal Off-Policy Evaluation Yash Chandak S. Niekum Bruno C. da Silva Erik Learned-Miller Emma Brunskill Philip S. Thomas OffRL ELM 32 52 0 26 Apr 2021
Towards Safe Policy Improvement for Non-Stationary MDPs Yash Chandak Scott M. Jordan Georgios Theocharous Martha White Philip S. Thomas OffRL 71 33 0 23 Oct 2020
An Optimistic Acceleration of AMSGrad for Nonconvex Optimization Jun-Kun Wang Xiaoyun Li Belhal Karimi Ping Li ODL 18 1 0 04 Mar 2019