ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.13163
  4. Cited By
Statistically Efficient Advantage Learning for Offline Reinforcement
  Learning in Infinite Horizons

Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons

26 February 2022
C. Shi
S. Luo
Yuan Le
Hongtu Zhu
R. Song
    OffRL
    OnRL
ArXivPDFHTML

Papers citing "Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons"

7 / 7 papers shown
Title
Statistical Inference in Reinforcement Learning: A Selective Survey
Statistical Inference in Reinforcement Learning: A Selective Survey
Chengchun Shi
OffRL
67
0
0
22 Feb 2025
Orthogonalized Estimation of Difference of $Q$-functions
Orthogonalized Estimation of Difference of QQQ-functions
Angela Zhou
36
0
0
12 Jun 2024
Inference on Optimal Dynamic Policies via Softmax Approximation
Inference on Optimal Dynamic Policies via Softmax Approximation
Qizhao Chen
Morgane Austern
Vasilis Syrgkanis
OffRL
29
1
0
08 Mar 2023
Quasi-optimal Reinforcement Learning with Continuous Actions
Quasi-optimal Reinforcement Learning with Continuous Actions
Yuhan Li
Wenzhuo Zhou
Ruoqing Zhu
OffRL
24
5
0
21 Jan 2023
Offline Reinforcement Learning for Safer Blood Glucose Control in People
  with Type 1 Diabetes
Offline Reinforcement Learning for Safer Blood Glucose Control in People with Type 1 Diabetes
Harry Emerson
Matt Guy
Ryan McConville
OffRL
29
46
0
07 Apr 2022
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
340
1,960
0
04 May 2020
Double Reinforcement Learning for Efficient Off-Policy Evaluation in
  Markov Decision Processes
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes
Nathan Kallus
Masatoshi Uehara
OffRL
38
181
0
22 Aug 2019
1