ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.02362
  4. Cited By
Counterfactual Risk Minimization: Learning from Logged Bandit Feedback

Counterfactual Risk Minimization: Learning from Logged Bandit Feedback

9 February 2015
Adith Swaminathan
Thorsten Joachims
    OffRL
ArXivPDFHTML

Papers citing "Counterfactual Risk Minimization: Learning from Logged Bandit Feedback"

36 / 36 papers shown
Title
ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems
ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems
Yi Zhang
Ruihong Qiu
Jiajun Liu
Sen Wang
OffRL
26
0
0
18 Jul 2024
Increasing Entropy to Boost Policy Gradient Performance on
  Personalization Tasks
Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks
Andrew Starnes
Anton Dereventsov
Clayton Webster
24
0
0
09 Oct 2023
Model-based Constrained MDP for Budget Allocation in Sequential
  Incentive Marketing
Model-based Constrained MDP for Budget Allocation in Sequential Incentive Marketing
Shuai Xiao
Le Guo
Zaifan Jiang
Lei Lv
Yuanbo Chen
Jun Zhu
Shuang Yang
30
21
0
02 Mar 2023
Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old
  Data in Nonstationary Environments
Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments
Vincent Liu
Yash Chandak
Philip S. Thomas
Martha White
OffRL
24
0
0
23 Feb 2023
How to select predictive models for causal inference?
How to select predictive models for causal inference?
M. Doutreligne
Gaël Varoquaux
ELM
CML
29
2
0
01 Feb 2023
Situating Recommender Systems in Practice: Towards Inductive Learning
  and Incremental Updates
Situating Recommender Systems in Practice: Towards Inductive Learning and Incremental Updates
Tobias Schnabel
Mengting Wan
Longqi Yang
HAI
27
8
0
11 Nov 2022
Simulated Contextual Bandits for Personalization Tasks from
  Recommendation Datasets
Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets
Anton Dereventsov
A. Bibin
24
1
0
12 Oct 2022
Offline Policy Optimization with Eligible Actions
Offline Policy Optimization with Eligible Actions
Yao Liu
Yannis Flet-Berliac
Emma Brunskill
OffRL
31
5
0
01 Jul 2022
Assessing Fairness in the Presence of Missing Data
Assessing Fairness in the Presence of Missing Data
Yiliang Zhang
Q. Long
FaML
31
35
0
07 Dec 2021
Optimal Decision Rules Under Partial Identification
Optimal Decision Rules Under Partial Identification
Kohei Yata
41
19
0
09 Nov 2021
Interpretable Personalized Experimentation
Interpretable Personalized Experimentation
Han Wu
S. Tan
Weiwei Li
Mia Garrard
Adam Obeng
Drew Dimmery
Shaun Singh
Hanson Wang
Daniel R. Jiang
E. Bakshy
33
5
0
05 Nov 2021
Off-Policy Evaluation in Partially Observed Markov Decision Processes
  under Sequential Ignorability
Off-Policy Evaluation in Partially Observed Markov Decision Processes under Sequential Ignorability
Yupeng Tang
Seung-seob Lee
OffRL
59
22
0
24 Oct 2021
Counterfactual Adversarial Learning with Representation Interpolation
Counterfactual Adversarial Learning with Representation Interpolation
Wen Wang
Wei Ping
Ning Shi
Jinfeng Li
Bingyu Zhu
Xiangyu Liu
Rongxin Zhang
AAML
OOD
CML
26
2
0
10 Sep 2021
Constrained Classification and Policy Learning
Constrained Classification and Policy Learning
T. Kitagawa
Shosei Sakaguchi
A. Tetenov
OffRL
37
12
0
24 Jun 2021
Human-AI Collaboration with Bandit Feedback
Human-AI Collaboration with Bandit Feedback
Ruijiang Gao
M. Saar-Tsechansky
Maria De-Arteaga
Ligong Han
Min Kyung Lee
Matthew Lease
54
49
0
22 May 2021
Understanding the role of importance weighting for deep learning
Understanding the role of importance weighting for deep learning
Da Xu
Yuting Ye
Chuanwei Ruan
FAtt
39
43
0
28 Mar 2021
The Causal Learning of Retail Delinquency
The Causal Learning of Retail Delinquency
Yiyan Huang
Cheuk Hang Leung
Xing Yan
Qi Wu
Nanbo Peng
DongDong Wang
Zhixiang Huang
CML
22
8
0
17 Dec 2020
CoinDICE: Off-Policy Confidence Interval Estimation
CoinDICE: Off-Policy Confidence Interval Estimation
Bo Dai
Ofir Nachum
Yinlam Chow
Lihong Li
Csaba Szepesvári
Dale Schuurmans
OffRL
29
84
0
22 Oct 2020
Unbiased Learning for the Causal Effect of Recommendation
Unbiased Learning for the Causal Effect of Recommendation
Masahiro Sato
S. Takemori
Janmajay Singh
Tomoko Ohkuma
CML
OffRL
17
69
0
11 Aug 2020
Efficient Contextual Bandits with Continuous Actions
Efficient Contextual Bandits with Continuous Actions
Maryam Majzoubi
Chicheng Zhang
Rajan Chari
A. Krishnamurthy
John Langford
Aleksandrs Slivkins
OffRL
29
32
0
10 Jun 2020
Off-policy Learning for Remote Electrical Tilt Optimization
Off-policy Learning for Remote Electrical Tilt Optimization
Filippo Vannella
Jaeseong Jeong
Alexandre Proutiere
OffRL
22
14
0
21 May 2020
Combining Offline Causal Inference and Online Bandit Learning for Data
  Driven Decision
Combining Offline Causal Inference and Online Bandit Learning for Data Driven Decision
Li Ye
Yishi Lin
Hong Xie
John C. S. Lui
CML
8
11
0
16 Jan 2020
Kinematic State Abstraction and Provably Efficient Rich-Observation
  Reinforcement Learning
Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning
Dipendra Kumar Misra
Mikael Henaff
A. Krishnamurthy
John Langford
33
151
0
13 Nov 2019
Learning from Bandit Feedback: An Overview of the State-of-the-art
Learning from Bandit Feedback: An Overview of the State-of-the-art
Olivier Jeunen
Dmytro Mykhaylov
D. Rohde
Flavian Vasile
Alexandre Gilotte
Martin Bompaire
OffRL
22
10
0
18 Sep 2019
Productization Challenges of Contextual Multi-Armed Bandits
Productization Challenges of Contextual Multi-Armed Bandits
D. Abensur
Ivan Balashov
S. Bar
R. Lempel
Nurit Moscovici
I. Orlov
Danny Rosenstein
Ido Tamir
22
3
0
10 Jul 2019
Distributionally Robust Counterfactual Risk Minimization
Distributionally Robust Counterfactual Risk Minimization
Louis Faury
Ugo Tanielian
Flavian Vasile
E. Smirnova
Elvis Dohmatob
28
45
0
14 Jun 2019
Semi-Parametric Efficient Policy Learning with Continuous Actions
Semi-Parametric Efficient Policy Learning with Continuous Actions
Mert Demirer
Vasilis Syrgkanis
Greg Lewis
Victor Chernozhukov
OffRL
6
51
0
24 May 2019
Classifying Treatment Responders Under Causal Effect Monotonicity
Classifying Treatment Responders Under Causal Effect Monotonicity
Nathan Kallus
CML
28
16
0
14 Feb 2019
Interval Estimation of Individual-Level Causal Effects Under Unobserved
  Confounding
Interval Estimation of Individual-Level Causal Effects Under Unobserved Confounding
Nathan Kallus
Xiaojie Mao
Angela Zhou
CML
22
91
0
05 Oct 2018
Optimization over Continuous and Multi-dimensional Decisions with
  Observational Data
Optimization over Continuous and Multi-dimensional Decisions with Observational Data
Dimitris Bertsimas
Christopher McCord
34
27
0
11 Jul 2018
Counterfactual Mean Embeddings
Counterfactual Mean Embeddings
Krikamol Muandet
Motonobu Kanagawa
Sorawit Saengkyongam
S. Marukatat
CML
OffRL
26
39
0
22 May 2018
Confounding-Robust Policy Improvement
Confounding-Robust Policy Improvement
Nathan Kallus
Angela Zhou
CML
OffRL
40
152
0
22 May 2018
Learning Optimal Policies from Observational Data
Learning Optimal Policies from Observational Data
Onur Atan
W. Zame
M. Schaar
CML
OOD
OffRL
24
18
0
23 Feb 2018
Learning Weighted Representations for Generalization Across Designs
Learning Weighted Representations for Generalization Across Designs
Fredrik D. Johansson
Nathan Kallus
Uri Shalit
David Sontag
OOD
39
87
0
23 Feb 2018
Policy Evaluation and Optimization with Continuous Treatments
Policy Evaluation and Optimization with Continuous Treatments
Nathan Kallus
Angela Zhou
OffRL
11
132
0
16 Feb 2018
Predicting Counterfactuals from Large Historical Data and Small
  Randomized Trials
Predicting Counterfactuals from Large Historical Data and Small Randomized Trials
Nir Rosenfeld
Yishay Mansour
E. Yom-Tov
CML
61
25
0
24 Oct 2016
1