ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.10719
  4. Cited By
Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via
  pT-Learning

Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-Learning

20 October 2021
Wenzhuo Zhou
Ruoqing Zhu
Annie Qu
ArXivPDFHTML

Papers citing "Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-Learning"

19 / 19 papers shown
Title
Reinforcement Learning for Individual Optimal Policy from Heterogeneous Data
Reinforcement Learning for Individual Optimal Policy from Heterogeneous Data
Rui Miao
Babak Shahbaba
Annie Qu
OffRL
31
0
0
14 May 2025
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Yuhan Li
Eugene Han
Yifan Hu
Wenzhuo Zhou
Zhengling Qi
Yifan Cui
Ruoqing Zhu
OffRL
165
0
0
01 May 2025
Statistical Inference in Reinforcement Learning: A Selective Survey
Statistical Inference in Reinforcement Learning: A Selective Survey
Chengchun Shi
OffRL
69
0
0
22 Feb 2025
Low-Rank Online Dynamic Assortment with Dual Contextual Information
Low-Rank Online Dynamic Assortment with Dual Contextual Information
Seong Jin Lee
Will Wei Sun
Yufeng Liu
40
0
0
19 Apr 2024
AI in Pharma for Personalized Sequential Decision-Making: Methods,
  Applications and Opportunities
AI in Pharma for Personalized Sequential Decision-Making: Methods, Applications and Opportunities
Yuhan Li
Hongtao Zhang
Keaven M Anderson
Songzi Li
Ruoqing Zhu
27
0
0
30 Nov 2023
Stage-Aware Learning for Dynamic Treatments
Stage-Aware Learning for Dynamic Treatments
Han Ye
Wenzhuo Zhou
Ruoqing Zhu
Annie Qu
21
1
0
30 Oct 2023
Bi-Level Offline Policy Optimization with Limited Exploration
Bi-Level Offline Policy Optimization with Limited Exploration
Wenzhuo Zhou
OffRL
36
4
0
10 Oct 2023
Stackelberg Batch Policy Learning
Stackelberg Batch Policy Learning
Wenzhuo Zhou
Annie Qu
OffRL
35
0
0
28 Sep 2023
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified
  Error Quantification Framework
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework
Wenzhuo Zhou
Yuhan Li
Ruoqing Zhu
Annie Qu
OffRL
28
4
0
23 Sep 2023
Off-policy Evaluation in Doubly Inhomogeneous Environments
Off-policy Evaluation in Doubly Inhomogeneous Environments
Zeyu Bian
C. Shi
Zhengling Qi
Lan Wang
OffRL
27
3
0
14 Jun 2023
Sequential Knockoffs for Variable Selection in Reinforcement Learning
Sequential Knockoffs for Variable Selection in Reinforcement Learning
Tao Ma
Hengrui Cai
Zhengling Qi
C. Shi
Eric B. Laber
29
3
0
24 Mar 2023
Quasi-optimal Reinforcement Learning with Continuous Actions
Quasi-optimal Reinforcement Learning with Continuous Actions
Yuhan Li
Wenzhuo Zhou
Ruoqing Zhu
OffRL
32
5
0
21 Jan 2023
Deep Spectral Q-learning with Application to Mobile Health
Deep Spectral Q-learning with Application to Mobile Health
Yuhe Gao
C. Shi
R. Song
32
0
0
03 Jan 2023
Doubly Inhomogeneous Reinforcement Learning
Doubly Inhomogeneous Reinforcement Learning
Liyuan Hu
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
31
2
0
08 Nov 2022
Reinforcement Learning in Modern Biostatistics: Constructing Optimal
  Adaptive Interventions
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions
Nina Deliu
Joseph Jay Williams
B. Chakraborty
OffRL
30
5
0
04 Mar 2022
Testing Stationarity and Change Point Detection in Reinforcement Learning
Testing Stationarity and Change Point Detection in Reinforcement Learning
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
42
9
0
03 Mar 2022
Statistically Efficient Advantage Learning for Offline Reinforcement
  Learning in Infinite Horizons
Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons
C. Shi
Shuang Luo
Yuan Le
Hongtu Zhu
R. Song
OffRL
OnRL
32
10
0
26 Feb 2022
A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation
  in Two-sided Markets
A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets
C. Shi
Runzhe Wan
Ge Song
Shuang Luo
R. Song
Hongtu Zhu
OffRL
41
6
0
21 Feb 2022
A Batch, Off-Policy, Actor-Critic Algorithm for Optimizing the Average
  Reward
A Batch, Off-Policy, Actor-Critic Algorithm for Optimizing the Average Reward
S. Murphy
Yanzhen Deng
Eric B. Laber
H. Maei
R. Sutton
K. Witkiewitz
OffRL
33
22
0
18 Jul 2016
1