ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.08158
  4. Cited By
Optimizing for the Future in Non-Stationary MDPs

Optimizing for the Future in Non-Stationary MDPs

17 May 2020
Yash Chandak
Georgios Theocharous
Shiv Shankar
Martha White
Sridhar Mahadevan
Philip S. Thomas
    OffRL
ArXivPDFHTML

Papers citing "Optimizing for the Future in Non-Stationary MDPs"

17 / 17 papers shown
Title
Predictive Control and Regret Analysis of Non-Stationary MDP with
  Look-ahead Information
Predictive Control and Regret Analysis of Non-Stationary MDP with Look-ahead Information
Ziyi Zhang
Yorie Nakahira
Guannan Qu
36
1
0
13 Sep 2024
Pausing Policy Learning in Non-stationary Reinforcement Learning
Pausing Policy Learning in Non-stationary Reinforcement Learning
Hyunin Lee
Ming Jin
Javad Lavaei
Somayeh Sojoudi
OffRL
37
2
0
25 May 2024
Preparing for Black Swans: The Antifragility Imperative for Machine
  Learning
Preparing for Black Swans: The Antifragility Imperative for Machine Learning
Ming Jin
38
2
0
18 May 2024
Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old
  Data in Nonstationary Environments
Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments
Vincent Liu
Yash Chandak
Philip S. Thomas
Martha White
OffRL
19
0
0
23 Feb 2023
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Pouya Hamadanian
Arash Nasr-Esfahany
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
CLL
OffRL
50
0
0
04 Feb 2023
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Yash Chandak
Shiv Shankar
Nathaniel D. Bastian
Bruno Castro da Silva
Emma Brunskil
Philip S. Thomas
OffRL
44
6
0
24 Jan 2023
General policy mapping: online continual reinforcement learning inspired
  on the insect brain
General policy mapping: online continual reinforcement learning inspired on the insect brain
A. Yanguas-Gil
Sandeep Madireddy
CLL
OnRL
16
0
0
30 Nov 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong
  Reinforcement Learning
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
24
11
0
12 Jul 2022
Factored Adaptation for Non-Stationary Reinforcement Learning
Factored Adaptation for Non-Stationary Reinforcement Learning
Fan Feng
Erdun Gao
Kun Zhang
Sara Magliacane
CML
OffRL
45
32
0
30 Mar 2022
Review of Metrics to Measure the Stability, Robustness and Resilience of
  Reinforcement Learning
Review of Metrics to Measure the Stability, Robustness and Resilience of Reinforcement Learning
L. Pullum
13
2
0
22 Mar 2022
Reinforcement Learning for Personalized Drug Discovery and Design for
  Complex Diseases: A Systems Pharmacology Perspective
Reinforcement Learning for Personalized Drug Discovery and Design for Complex Diseases: A Systems Pharmacology Perspective
Ryan K. Tan
Yang Liu
Lei Xie
37
2
0
21 Jan 2022
Autonomous Reinforcement Learning: Formalism and Benchmarking
Autonomous Reinforcement Learning: Formalism and Benchmarking
Archit Sharma
Kelvin Xu
Nikhil Sardana
Abhishek Gupta
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
44
26
0
17 Dec 2021
Continual Learning In Environments With Polynomial Mixing Times
Continual Learning In Environments With Polynomial Mixing Times
Matthew D Riemer
Sharath Chandra Raparthy
Ignacio Cases
G. Subbaraj
M. P. Touzel
Irina Rish
CLL
41
8
0
13 Dec 2021
ESCADA: Efficient Safety and Context Aware Dose Allocation for Precision
  Medicine
ESCADA: Efficient Safety and Context Aware Dose Allocation for Precision Medicine
Ilker Demirel
Ahmet Çelik
Cem Tekin
31
4
0
26 Nov 2021
Universal Off-Policy Evaluation
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
32
52
0
26 Apr 2021
Towards Safe Policy Improvement for Non-Stationary MDPs
Towards Safe Policy Improvement for Non-Stationary MDPs
Yash Chandak
Scott M. Jordan
Georgios Theocharous
Martha White
Philip S. Thomas
OffRL
71
33
0
23 Oct 2020
An Optimistic Acceleration of AMSGrad for Nonconvex Optimization
An Optimistic Acceleration of AMSGrad for Nonconvex Optimization
Jun-Kun Wang
Xiaoyun Li
Belhal Karimi
Ping Li
ODL
18
1
0
04 Mar 2019
1