ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.01457
  4. Cited By
Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits

Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits

3 December 2023
Muhammad Faaiz Taufiq
Arnaud Doucet
Rob Cornish
Jean-François Ton
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits"

7 / 7 papers shown
Title
A General Framework for Off-Policy Learning with Partially-Observed Reward
A General Framework for Off-Policy Learning with Partially-Observed Reward
Rikiya Takehi
Masahiro Asami
K. Kawakami
Yuta Saito
OffRL
33
0
0
17 Jun 2025
Clustering Context in Off-Policy Evaluation
Clustering Context in Off-Policy Evaluation
Daniel Guzman-Olivares
Philipp Schmidt
Jacek Golebiowski
Artur Bekasov
CMLOffRL
76
0
0
28 Feb 2025
Towards Representation Learning for Weighting Problems in Design-Based
  Causal Inference
Towards Representation Learning for Weighting Problems in Design-Based Causal Inference
Oscar Clivio
Avi Feller
Chris Holmes
CMLOOD
70
3
0
24 Sep 2024
Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection
  and Learning
Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning
Otmane Sakhi
Imad Aouali
Pierre Alquier
Nicolas Chopin
OffRL
73
5
0
23 May 2024
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Imad Aouali
Victor-Emmanuel Brunel
David Rohde
Anna Korba
OffRL
178
5
0
22 Feb 2024
Off-Policy Evaluation for Large Action Spaces via Policy Convolution
Off-Policy Evaluation for Large Action Spaces via Policy Convolution
Noveen Sachdeva
Lequn Wang
Dawen Liang
Nathan Kallus
Julian McAuley
OffRL
72
14
0
24 Oct 2023
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible
  Off-Policy Evaluation
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
OffRL
201
75
0
17 Aug 2020
1