ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.03983
  4. Cited By
Doubly Inhomogeneous Reinforcement Learning

Doubly Inhomogeneous Reinforcement Learning

8 November 2022
Liyuan Hu
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
    OffRL
ArXivPDFHTML

Papers citing "Doubly Inhomogeneous Reinforcement Learning"

20 / 20 papers shown
Title
Reinforcement Learning for Individual Optimal Policy from Heterogeneous Data
Reinforcement Learning for Individual Optimal Policy from Heterogeneous Data
Rui Miao
Babak Shahbaba
Annie Qu
OffRL
75
0
0
14 May 2025
Off-policy Evaluation in Doubly Inhomogeneous Environments
Off-policy Evaluation in Doubly Inhomogeneous Environments
Zeyu Bian
C. Shi
Zhengling Qi
Lan Wang
OffRL
47
7
0
14 Jun 2023
Testing Stationarity and Change Point Detection in Reinforcement Learning
Testing Stationarity and Change Point Detection in Reinforcement Learning
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
73
9
0
03 Mar 2022
Reinforcement Learning with Heterogeneous Data: Estimation and Inference
Reinforcement Learning with Heterogeneous Data: Estimation and Inference
Elynn Y. Chen
Rui Song
Michael I. Jordan
OffRL
51
10
0
31 Jan 2022
Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via
  pT-Learning
Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-Learning
Wenzhuo Zhou
Ruoqing Zhu
Annie Qu
53
22
0
20 Oct 2021
Optimistic Policy Optimization is Provably Efficient in Non-stationary
  MDPs
Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs
Han Zhong
Zhuoran Yang
Zhaoran Wang
Csaba Szepesvári
79
21
0
18 Oct 2021
Online Bootstrap Inference For Policy Evaluation in Reinforcement
  Learning
Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Pratik Ramprasad
Yuantong Li
Zhuoran Yang
Zhaoran Wang
W. Sun
Guang Cheng
OffRL
76
27
0
08 Aug 2021
Pattern Transfer Learning for Reinforcement Learning in Order
  Dispatching
Pattern Transfer Learning for Reinforcement Learning in Order Dispatching
Runzhe Wan
Sheng Zhang
C. Shi
Shuang Luo
R. Song
AI4TS
23
3
0
27 May 2021
Non-stationary Reinforcement Learning without Prior Knowledge: An
  Optimal Black-box Approach
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach
Chen-Yu Wei
Haipeng Luo
OffRL
105
104
0
10 Feb 2021
Transfer Learning in Deep Reinforcement Learning: A Survey
Transfer Learning in Deep Reinforcement Learning: A Survey
Zhuangdi Zhu
Kaixiang Lin
Anil K. Jain
Jiayu Zhou
OffRL
LRM
100
577
0
16 Sep 2020
Batch Policy Learning in Average Reward Markov Decision Processes
Batch Policy Learning in Average Reward Markov Decision Processes
Peng Liao
Zhengling Qi
Runzhe Wan
P. Klasnja
Susan Murphy
OffRL
74
84
0
23 Jul 2020
Reinforcement Learning for Non-Stationary Markov Decision Processes: The
  Blessing of (More) Optimism
Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism
Wang Chi Cheung
D. Simchi-Levi
Ruihao Zhu
OffRL
45
93
0
24 Jun 2020
Does the Markov Decision Process Fit the Data: Testing for the Markov
  Property in Sequential Decision Making
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making
C. Shi
Runzhe Wan
R. Song
Wenbin Lu
Ling Leng
43
38
0
05 Feb 2020
Statistical Inference of the Value Function for Reinforcement Learning
  in Infinite Horizon Settings
Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon Settings
C. Shi
Shengyao Zhang
W. Lu
R. Song
OffRL
40
87
0
13 Jan 2020
Non-Stationary Markov Decision Processes, a Worst-Case Approach using
  Model-Based Reinforcement Learning, Extended version
Non-Stationary Markov Decision Processes, a Worst-Case Approach using Model-Based Reinforcement Learning, Extended version
Erwan Lecarpentier
Emmanuel Rachelson
56
83
0
22 Apr 2019
Subgeometric ergodicity and $β$-mixing
Subgeometric ergodicity and βββ-mixing
Mika Meitz
P. Saikkonen
54
3
0
15 Apr 2019
Dynamic Assortment Optimization with Changing Contextual Information
Dynamic Assortment Optimization with Changing Contextual Information
Xi Chen
Yining Wang
Yuanshuo Zhou
28
50
0
31 Oct 2018
Estimating Dynamic Treatment Regimes in Mobile Health Using V-learning
Estimating Dynamic Treatment Regimes in Mobile Health Using V-learning
Daniel J. Luckett
Eric B. Laber
A. Kahkoska
D. Maahs
E. Mayer‐Davis
Michael R. Kosorok
55
137
0
10 Nov 2016
Optimal Uniform Convergence Rates and Asymptotic Normality for Series
  Estimators Under Weak Dependence and Weak Conditions
Optimal Uniform Convergence Rates and Asymptotic Normality for Series Estimators Under Weak Dependence and Weak Conditions
Xiaohong Chen
T. Christensen
58
151
0
18 Dec 2014
Wild binary segmentation for multiple change-point detection
Wild binary segmentation for multiple change-point detection
Piotr Fryzlewicz
68
650
0
04 Nov 2014
1