Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.03983
Cited By
Doubly Inhomogeneous Reinforcement Learning
8 November 2022
Liyuan Hu
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Doubly Inhomogeneous Reinforcement Learning"
20 / 20 papers shown
Title
Reinforcement Learning for Individual Optimal Policy from Heterogeneous Data
Rui Miao
Babak Shahbaba
Annie Qu
OffRL
75
0
0
14 May 2025
Off-policy Evaluation in Doubly Inhomogeneous Environments
Zeyu Bian
C. Shi
Zhengling Qi
Lan Wang
OffRL
47
7
0
14 Jun 2023
Testing Stationarity and Change Point Detection in Reinforcement Learning
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
73
9
0
03 Mar 2022
Reinforcement Learning with Heterogeneous Data: Estimation and Inference
Elynn Y. Chen
Rui Song
Michael I. Jordan
OffRL
51
10
0
31 Jan 2022
Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-Learning
Wenzhuo Zhou
Ruoqing Zhu
Annie Qu
53
22
0
20 Oct 2021
Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs
Han Zhong
Zhuoran Yang
Zhaoran Wang
Csaba Szepesvári
79
21
0
18 Oct 2021
Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Pratik Ramprasad
Yuantong Li
Zhuoran Yang
Zhaoran Wang
W. Sun
Guang Cheng
OffRL
76
27
0
08 Aug 2021
Pattern Transfer Learning for Reinforcement Learning in Order Dispatching
Runzhe Wan
Sheng Zhang
C. Shi
Shuang Luo
R. Song
AI4TS
23
3
0
27 May 2021
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach
Chen-Yu Wei
Haipeng Luo
OffRL
105
104
0
10 Feb 2021
Transfer Learning in Deep Reinforcement Learning: A Survey
Zhuangdi Zhu
Kaixiang Lin
Anil K. Jain
Jiayu Zhou
OffRL
LRM
100
577
0
16 Sep 2020
Batch Policy Learning in Average Reward Markov Decision Processes
Peng Liao
Zhengling Qi
Runzhe Wan
P. Klasnja
Susan Murphy
OffRL
74
84
0
23 Jul 2020
Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism
Wang Chi Cheung
D. Simchi-Levi
Ruihao Zhu
OffRL
45
93
0
24 Jun 2020
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making
C. Shi
Runzhe Wan
R. Song
Wenbin Lu
Ling Leng
43
38
0
05 Feb 2020
Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon Settings
C. Shi
Shengyao Zhang
W. Lu
R. Song
OffRL
40
87
0
13 Jan 2020
Non-Stationary Markov Decision Processes, a Worst-Case Approach using Model-Based Reinforcement Learning, Extended version
Erwan Lecarpentier
Emmanuel Rachelson
56
83
0
22 Apr 2019
Subgeometric ergodicity and
β
β
β
-mixing
Mika Meitz
P. Saikkonen
54
3
0
15 Apr 2019
Dynamic Assortment Optimization with Changing Contextual Information
Xi Chen
Yining Wang
Yuanshuo Zhou
28
50
0
31 Oct 2018
Estimating Dynamic Treatment Regimes in Mobile Health Using V-learning
Daniel J. Luckett
Eric B. Laber
A. Kahkoska
D. Maahs
E. Mayer‐Davis
Michael R. Kosorok
55
137
0
10 Nov 2016
Optimal Uniform Convergence Rates and Asymptotic Normality for Series Estimators Under Weak Dependence and Weak Conditions
Xiaohong Chen
T. Christensen
58
151
0
18 Dec 2014
Wild binary segmentation for multiple change-point detection
Piotr Fryzlewicz
68
650
0
04 Nov 2014
1