ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.06037
  4. Cited By
Policy Evaluation and Optimization with Continuous Treatments

Policy Evaluation and Optimization with Continuous Treatments

16 February 2018
Nathan Kallus
Angela Zhou
    OffRL
ArXivPDFHTML

Papers citing "Policy Evaluation and Optimization with Continuous Treatments"

50 / 79 papers shown
Title
Contextual Online Uncertainty-Aware Preference Learning for Human Feedback
Contextual Online Uncertainty-Aware Preference Learning for Human Feedback
Nan Lu
Ethan X. Fang
Junwei Lu
248
0
0
27 Apr 2025
Prompt Optimization with Logged Bandit Data
Prompt Optimization with Logged Bandit Data
Haruka Kiyohara
Daniel Yiming Cao
Yuta Saito
Thorsten Joachims
86
0
0
03 Apr 2025
Off-policy estimation with adaptively collected data: the power of
  online learning
Off-policy estimation with adaptively collected data: the power of online learning
Jeonghwan Lee
Cong Ma
OffRL
81
0
0
19 Nov 2024
Uncertainty-Aware Optimal Treatment Selection for Clinical Time Series
Uncertainty-Aware Optimal Treatment Selection for Clinical Time Series
Thomas Schwarz
Cecilia Casolo
Niki Kilbertus
CML
46
0
0
11 Oct 2024
Probabilities of Causation for Continuous and Vector Variables
Probabilities of Causation for Continuous and Vector Variables
Yuta Kawakami
Manabu Kuroki
Jin Tian
48
4
0
30 May 2024
Kernel Metric Learning for In-Sample Off-Policy Evaluation of
  Deterministic RL Policies
Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies
Haanvid Lee
Tri Wahyu Guntara
Jongmin Lee
Yung-Kyun Noh
Kee-Eung Kim
OffRL
37
1
0
29 May 2024
A Causal Framework for Evaluating Deferring Systems
A Causal Framework for Evaluating Deferring Systems
Filippo Palomba
Andrea Pugnana
Jose M. Alvarez
Salvatore Ruggieri
CML
59
3
0
29 May 2024
Contextual Linear Optimization with Bandit Feedback
Contextual Linear Optimization with Bandit Feedback
Yichun Hu
Nathan Kallus
Xiaojie Mao
Yanchen Wu
42
0
0
26 May 2024
Cross-Validated Off-Policy Evaluation
Cross-Validated Off-Policy Evaluation
Matej Cief
Branislav Kveton
Michal Kompan
OffRL
33
1
0
24 May 2024
Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection
  and Learning
Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning
Otmane Sakhi
Imad Aouali
Pierre Alquier
Nicolas Chopin
OffRL
50
1
0
23 May 2024
Multi-Objective Recommendation via Multivariate Policy Learning
Multi-Objective Recommendation via Multivariate Policy Learning
Olivier Jeunen
Jatin Mandav
Ivan Potapov
Nakul Agarwal
Sourabh Vaid
Wenzhe Shi
Aleksei Ustimenko
OffRL
21
3
0
03 May 2024
Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy
  Evaluation
Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Haruka Kiyohara
Ren Kishimoto
K. Kawakami
Ken Kobayashi
Kazuhide Nakata
Yuta Saito
OffRL
34
9
0
30 Nov 2023
SCOPE-RL: A Python Library for Offline Reinforcement Learning and
  Off-Policy Evaluation
SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation
Haruka Kiyohara
Ren Kishimoto
K. Kawakami
Ken Kobayashi
Kazuhide Nakata
Yuta Saito
OffRL
ELM
42
4
0
30 Nov 2023
Off-Policy Evaluation for Large Action Spaces via Policy Convolution
Off-Policy Evaluation for Large Action Spaces via Policy Convolution
Noveen Sachdeva
Lequn Wang
Dawen Liang
Nathan Kallus
Julian McAuley
OffRL
43
12
0
24 Oct 2023
Optimal Exploration is no harder than Thompson Sampling
Optimal Exploration is no harder than Thompson Sampling
Zhaoqi Li
Kevin Jamieson
Lalit P. Jain
32
2
0
09 Oct 2023
Doubly Robust Proximal Causal Learning for Continuous Treatments
Doubly Robust Proximal Causal Learning for Continuous Treatments
Yong Wu
Yanwei Fu
Shouyan Wang
Xinwei Sun
28
1
0
22 Sep 2023
On the Actionability of Outcome Prediction
On the Actionability of Outcome Prediction
Lydia T. Liu
Solon Barocas
Jon Kleinberg
Karen Levy
OffRL
CML
13
7
0
08 Sep 2023
Causal Effect Estimation after Propensity Score Trimming with Continuous
  Treatments
Causal Effect Estimation after Propensity Score Trimming with Continuous Treatments
Zach Branson
Edward H. Kennedy
Sivaraman Balakrishnan
Larry Wasserman
CML
46
4
0
01 Sep 2023
Oracle-Efficient Pessimism: Offline Policy Optimization in Contextual
  Bandits
Oracle-Efficient Pessimism: Offline Policy Optimization in Contextual Bandits
Lequn Wang
A. Krishnamurthy
Aleksandrs Slivkins
OffRL
46
9
0
13 Jun 2023
Reliable Off-Policy Learning for Dosage Combinations
Reliable Off-Policy Learning for Dosage Combinations
Jonas Schweisthal
Dennis Frauen
Valentyn Melnychuk
Stefan Feuerriegel
OffRL
31
12
0
31 May 2023
Learning Action Embeddings for Off-Policy Evaluation
Learning Action Embeddings for Off-Policy Evaluation
Matej Cief
Jacek Golebiowski
Philipp Schmidt
Ziawasch Abedjan
Artur Bekasov
OffRL
6
5
0
06 May 2023
Fair Off-Policy Learning from Observational Data
Fair Off-Policy Learning from Observational Data
Dennis Frauen
Valentyn Melnychuk
Stefan Feuerriegel
FaML
OffRL
30
6
0
15 Mar 2023
Balanced Off-Policy Evaluation for Personalized Pricing
Balanced Off-Policy Evaluation for Personalized Pricing
Adam N. Elmachtoub
Vishal Gupta
Yunfan Zhao
OffRL
42
6
0
24 Feb 2023
Personalized Pricing with Invalid Instrumental Variables:
  Identification, Estimation, and Policy Learning
Personalized Pricing with Invalid Instrumental Variables: Identification, Estimation, and Policy Learning
Rui Miao
Zhengling Qi
Cong Shi
Lin Lin
21
2
0
24 Feb 2023
Sequential Counterfactual Risk Minimization
Sequential Counterfactual Risk Minimization
Houssam Zenati
Eustache Diemert
Matthieu Martin
Julien Mairal
Pierre Gaillard
OffRL
29
3
0
23 Feb 2023
STEEL: Singularity-aware Reinforcement Learning
STEEL: Singularity-aware Reinforcement Learning
Xiaohong Chen
Zhengling Qi
Runzhe Wan
OffRL
32
2
0
30 Jan 2023
Causal Deep Reinforcement Learning Using Observational Data
Causal Deep Reinforcement Learning Using Observational Data
Wenxuan Zhu
Chao Yu
Qiaosheng Zhang
CML
OffRL
26
5
0
28 Nov 2022
Local Metric Learning for Off-Policy Evaluation in Contextual Bandits
  with Continuous Actions
Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions
Haanvid Lee
Jongmin Lee
Yunseon Choi
Wonseok Jeon
Byung-Jun Lee
Yung-Kyun Noh
Kee-Eung Kim
OffRL
12
5
0
24 Oct 2022
PAC-Bayesian Offline Contextual Bandits With Guarantees
PAC-Bayesian Offline Contextual Bandits With Guarantees
Otmane Sakhi
Pierre Alquier
Nicolas Chopin
OffRL
34
12
0
24 Oct 2022
Improving Sample Efficiency in Evolutionary RL Using Off-Policy Ranking
Improving Sample Efficiency in Evolutionary RL Using Off-Policy Ranking
R. EshwarS
Shishir Kolathaya
Gugan Thoppe
24
0
0
22 Aug 2022
Conformal Off-policy Prediction
Conformal Off-policy Prediction
Yingying Zhang
C. Shi
Shuang Luo
OffRL
41
10
0
14 Jun 2022
Off-Policy Evaluation in Embedded Spaces
Off-Policy Evaluation in Embedded Spaces
Jaron J. R. Lee
David Arbour
Georgios Theocharous
OffRL
25
3
0
05 Mar 2022
Statistically Efficient Advantage Learning for Offline Reinforcement
  Learning in Infinite Horizons
Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons
C. Shi
Shuang Luo
Yuan Le
Hongtu Zhu
R. Song
OffRL
OnRL
37
10
0
26 Feb 2022
Off-Policy Evaluation with Policy-Dependent Optimization Response
Off-Policy Evaluation with Policy-Dependent Optimization Response
Wenshuo Guo
Michael I. Jordan
Angela Zhou
CML
OffRL
34
3
0
25 Feb 2022
Policy Learning for Optimal Individualized Dose Intervals
Policy Learning for Optimal Individualized Dose Intervals
Guanhua Chen
Xiaomao Li
Menggang Yu
11
4
0
24 Feb 2022
Convex Surrogate Loss Functions for Contextual Pricing with Transaction
  Data
Convex Surrogate Loss Functions for Contextual Pricing with Transaction Data
Max Biggs
OffRL
23
1
0
16 Feb 2022
Off-Policy Evaluation for Large Action Spaces via Embeddings
Off-Policy Evaluation for Large Action Spaces via Embeddings
Yuta Saito
Thorsten Joachims
OffRL
33
43
0
13 Feb 2022
A nonparametric doubly robust test for a continuous treatment effect
A nonparametric doubly robust test for a continuous treatment effect
Charles R. Doss
Guangwei Weng
Lan Wang
I. Moscovice
T. Chantarat
21
2
0
07 Feb 2022
Interpretable Personalized Experimentation
Interpretable Personalized Experimentation
Han Wu
S. Tan
Weiwei Li
Mia Garrard
Adam Obeng
Drew Dimmery
Shaun Singh
Hanson Wang
Daniel R. Jiang
E. Bakshy
33
5
0
05 Nov 2021
Doubly Robust Interval Estimation for Optimal Policy Evaluation in
  Online Learning
Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online Learning
Ye Shen
Hengrui Cai
Rui Song
OffRL
40
2
0
29 Oct 2021
Invariant Policy Learning: A Causal Perspective
Invariant Policy Learning: A Causal Perspective
Sorawit Saengkyongam
Nikolaj Thams
J. Peters
Niklas Pfister
CML
OffRL
27
14
0
01 Jun 2021
Multiply Robust Causal Mediation Analysis with Continuous Treatments
Multiply Robust Causal Mediation Analysis with Continuous Treatments
Numair Sani
Yizhe Xu
AmirEmad Ghassami
I. Shpitser
12
7
0
19 May 2021
Automatic Double Machine Learning for Continuous Treatment Effects
Automatic Double Machine Learning for Continuous Treatment Effects
Sylvia Klosin
41
8
0
21 Apr 2021
Minimax Kernel Machine Learning for a Class of Doubly Robust Functionals
  with Application to Proximal Causal Inference
Minimax Kernel Machine Learning for a Class of Doubly Robust Functionals with Application to Proximal Causal Inference
AmirEmad Ghassami
Andrew Ying
I. Shpitser
E. T. Tchetgen
32
43
0
07 Apr 2021
Causal World Models by Unsupervised Deconfounding of Physical Dynamics
Causal World Models by Unsupervised Deconfounding of Physical Dynamics
Minne Li
Mengyue Yang
Furui Liu
Xu Chen
Zhitang Chen
Jun Wang
SyDa
CML
33
12
0
28 Dec 2020
Fairness, Welfare, and Equity in Personalized Pricing
Fairness, Welfare, and Equity in Personalized Pricing
Nathan Kallus
Angela Zhou
24
39
0
21 Dec 2020
Doubly Robust Off-Policy Learning on Low-Dimensional Manifolds by Deep
  Neural Networks
Doubly Robust Off-Policy Learning on Low-Dimensional Manifolds by Deep Neural Networks
Minshuo Chen
Hao Liu
Wenjing Liao
T. Zhao
CML
OOD
OffRL
20
7
0
03 Nov 2020
Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment
  Settings
Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings
Hengrui Cai
C. Shi
R. Song
Wenbin Lu
OffRL
10
13
0
29 Oct 2020
Kernel Methods for Causal Functions: Dose, Heterogeneous, and
  Incremental Response Curves
Kernel Methods for Causal Functions: Dose, Heterogeneous, and Incremental Response Curves
Rahul Singh
Liyuan Xu
Arthur Gretton
OffRL
68
28
0
10 Oct 2020
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible
  Off-Policy Evaluation
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
OffRL
24
73
0
17 Aug 2020
12
Next