ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.04510
  4. Cited By
ReVar: Strengthening Policy Evaluation via Reduced Variance Sampling

ReVar: Strengthening Policy Evaluation via Reduced Variance Sampling

9 March 2022
Subhojyoti Mukherjee
Josiah P. Hanna
Robert D. Nowak
    OffRL
ArXivPDFHTML

Papers citing "ReVar: Strengthening Policy Evaluation via Reduced Variance Sampling"

10 / 10 papers shown
Title
Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive
  Approach
Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive Approach
Riccardo Poiani
Nicole Nobili
Alberto Maria Metelli
Marcello Restelli
29
0
0
17 Oct 2024
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Claire Chen
Shuze Liu
Shangtong Zhang
OffRL
102
1
0
08 Oct 2024
Doubly Optimal Policy Evaluation for Reinforcement Learning
Doubly Optimal Policy Evaluation for Reinforcement Learning
Shuze Liu
Claire Chen
Shangtong Zhang
OffRL
32
2
0
03 Oct 2024
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in
  Tabular MDP
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Subhojyoti Mukherjee
Josiah P. Hanna
Robert Nowak
OffRL
51
0
0
04 Jun 2024
On-Policy Policy Gradient Reinforcement Learning Without On-Policy
  Sampling
On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling
Nicholas Corrado
Josiah P. Hanna
OffRL
20
1
0
14 Nov 2023
Truncating Trajectories in Monte Carlo Reinforcement Learning
Truncating Trajectories in Monte Carlo Reinforcement Learning
Riccardo Poiani
Alberto Maria Metelli
Marcello Restelli
24
2
0
07 May 2023
Efficient Policy Evaluation with Offline Data Informed Behavior Policy
  Design
Efficient Policy Evaluation with Offline Data Informed Behavior Policy Design
Shuze Liu
Shangtong Zhang
OffRL
30
3
0
31 Jan 2023
SPEED: Experimental Design for Policy Evaluation in Linear
  Heteroscedastic Bandits
SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits
Subhojyoti Mukherjee
Qiaomin Xie
Josiah P. Hanna
R. Nowak
OffRL
53
5
0
29 Jan 2023
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in
  Reinforcement Learning
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning
Rujie Zhong
Duohan Zhang
Lukas Schafer
Stefano V. Albrecht
Josiah P. Hanna
OOD
OffRL
11
12
0
29 Nov 2021
Minimax Number of Strata for Online Stratified Sampling given Noisy
  Samples
Minimax Number of Strata for Online Stratified Sampling given Noisy Samples
Alexandra Carpentier
Rémi Munos
49
13
0
18 May 2012
1