ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.10090
  4. Cited By
Non-Stationary Markov Decision Processes, a Worst-Case Approach using
  Model-Based Reinforcement Learning, Extended version

Non-Stationary Markov Decision Processes, a Worst-Case Approach using Model-Based Reinforcement Learning, Extended version

22 April 2019
Erwan Lecarpentier
Emmanuel Rachelson
ArXivPDFHTML

Papers citing "Non-Stationary Markov Decision Processes, a Worst-Case Approach using Model-Based Reinforcement Learning, Extended version"

19 / 19 papers shown
Title
Tolerance of Reinforcement Learning Controllers against Deviations in
  Cyber Physical Systems
Tolerance of Reinforcement Learning Controllers against Deviations in Cyber Physical Systems
Changjian Zhang
Parv Kapoor
Eunsuk Kang
Romulo Meira-Goes
David Garlan
Akila Ganlath
Shatadal Mishra
N. Ammar
42
0
0
24 Jun 2024
Decision Making in Non-Stationary Environments with Policy-Augmented
  Search
Decision Making in Non-Stationary Environments with Policy-Augmented Search
Ava Pettet
Yunuo Zhang
Baiting Luo
Kyle Wray
Hendrik Baier
Aron Laszka
Abhishek Dubey
Ayan Mukhopadhyay
20
4
0
06 Jan 2024
Confronting Reward Model Overoptimization with Constrained RLHF
Confronting Reward Model Overoptimization with Constrained RLHF
Ted Moskovitz
Aaditya K. Singh
DJ Strouse
T. Sandholm
Ruslan Salakhutdinov
Anca D. Dragan
Stephen Marcus McAleer
50
48
0
06 Oct 2023
SoK: Adversarial Machine Learning Attacks and Defences in Multi-Agent
  Reinforcement Learning
SoK: Adversarial Machine Learning Attacks and Defences in Multi-Agent Reinforcement Learning
Maxwell Standen
Junae Kim
Claudia Szabo
AAML
39
5
0
11 Jan 2023
Doubly Inhomogeneous Reinforcement Learning
Doubly Inhomogeneous Reinforcement Learning
Liyuan Hu
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
31
2
0
08 Nov 2022
Prioritizing emergency evacuations under compounding levels of
  uncertainty
Prioritizing emergency evacuations under compounding levels of uncertainty
Lisa J. Einstein
Robert J. Moss
Mykel J. Kochenderfer
14
1
0
30 Sep 2022
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities:
  Robustness, Safety, and Generalizability
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability
Mengdi Xu
Zuxin Liu
Peide Huang
Wenhao Ding
Zhepeng Cen
Bo-wen Li
Ding Zhao
79
45
0
16 Sep 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong
  Reinforcement Learning
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
29
11
0
12 Jul 2022
Testing Stationarity and Change Point Detection in Reinforcement Learning
Testing Stationarity and Change Point Detection in Reinforcement Learning
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
42
9
0
03 Mar 2022
Continual Learning In Environments With Polynomial Mixing Times
Continual Learning In Environments With Polynomial Mixing Times
Matthew D Riemer
Sharath Chandra Raparthy
Ignacio Cases
G. Subbaraj
M. P. Touzel
Irina Rish
CLL
41
8
0
13 Dec 2021
Coarse-Grained Smoothness for RL in Metric Spaces
Coarse-Grained Smoothness for RL in Metric Spaces
Giorgio Giannone
Kavosh Asadi
Cameron Allen
Sam Lobel
George Konidaris
Michael Littman
44
3
0
23 Oct 2021
Blackwell Online Learning for Markov Decision Processes
Blackwell Online Learning for Markov Decision Processes
Tao Li
Guanze Peng
Quanyan Zhu
OffRL
19
16
0
28 Dec 2020
Control with adaptive Q-learning
Control with adaptive Q-learning
J. Araújo
Mário A. T. Figueiredo
M. Botto
33
2
0
03 Nov 2020
Towards Safe Policy Improvement for Non-Stationary MDPs
Towards Safe Policy Improvement for Non-Stationary MDPs
Yash Chandak
Scott M. Jordan
Georgios Theocharous
Martha White
Philip S. Thomas
OffRL
71
33
0
23 Oct 2020
Single-partition adaptive Q-learning
Single-partition adaptive Q-learning
J. Araújo
Mário A. T. Figueiredo
M. Botto
OffRL
20
2
0
14 Jul 2020
A Survey of Reinforcement Learning Algorithms for Dynamically Varying
  Environments
A Survey of Reinforcement Learning Algorithms for Dynamically Varying Environments
Sindhu Padakandla
28
145
0
19 May 2020
Optimizing for the Future in Non-Stationary MDPs
Optimizing for the Future in Non-Stationary MDPs
Yash Chandak
Georgios Theocharous
Shiv Shankar
Martha White
Sridhar Mahadevan
Philip S. Thomas
OffRL
16
65
0
17 May 2020
Wasserstein Robust Reinforcement Learning
Wasserstein Robust Reinforcement Learning
Mohammed Abdullah
Hang Ren
Haitham Bou-Ammar
Vladimir Milenkovic
Rui Luo
Mingtian Zhang
Jun Wang
32
75
0
30 Jul 2019
Open Loop Execution of Tree-Search Algorithms, extended version
Open Loop Execution of Tree-Search Algorithms, extended version
Erwan Lecarpentier
G. Infantes
C. Lesire
Emmanuel Rachelson
19
8
0
03 May 2018
1