ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1209.2355
  4. Cited By
Counterfactual Reasoning and Learning Systems

Counterfactual Reasoning and Learning Systems

11 September 2012
Léon Bottou
J. Peters
J. Q. Candela
Denis Xavier Charles
D. M. Chickering
Elon Portugaly
Dipankar Ray
Patrice Y. Simard
Edward Snelson
    CML
    OffRL
ArXivPDFHTML

Papers citing "Counterfactual Reasoning and Learning Systems"

50 / 363 papers shown
Title
Adaptive Robot-Assisted Feeding: An Online Learning Framework for
  Acquiring Previously Unseen Food Items
Adaptive Robot-Assisted Feeding: An Online Learning Framework for Acquiring Previously Unseen Food Items
E. Gordon
Xiang Meng
Matt Barnes
Tapomayukh Bhattacharjee
S. Srinivasa
OffRL
OnRL
13
45
0
19 Aug 2019
Policy Evaluation with Latent Confounders via Optimal Balance
Policy Evaluation with Latent Confounders via Optimal Balance
Andrew Bennett
Nathan Kallus
CML
22
18
0
06 Aug 2019
Deep Reinforcement Learning for Personalized Search Story Recommendation
Deep Reinforcement Learning for Personalized Search Story Recommendation
Jason Zhang
Zhang
Junming Yin
Dongwon Lee
Linhong Zhu
33
2
0
26 Jul 2019
On the Value of Bandit Feedback for Offline Recommender System
  Evaluation
On the Value of Bandit Feedback for Offline Recommender System Evaluation
Olivier Jeunen
D. Rohde
Flavian Vasile
OffRL
14
10
0
26 Jul 2019
Off-policy Learning for Multiple Loggers
Off-policy Learning for Multiple Loggers
Li He
Long Xia
Wei Zeng
Zhi-Ming Ma
Yue Zhao
Dawei Yin
OffRL
15
10
0
23 Jul 2019
Doubly robust off-policy evaluation with shrinkage
Doubly robust off-policy evaluation with shrinkage
Yi-Hsun Su
Maria Dimakopoulou
A. Krishnamurthy
Miroslav Dudík
OffRL
19
104
0
22 Jul 2019
The Dangers of Post-hoc Interpretability: Unjustified Counterfactual
  Explanations
The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations
Thibault Laugel
Marie-Jeanne Lesot
Christophe Marsala
X. Renard
Marcin Detyniecki
22
194
0
22 Jul 2019
Addressing Delayed Feedback for Continuous Training with Neural Networks
  in CTR prediction
Addressing Delayed Feedback for Continuous Training with Neural Networks in CTR prediction
S. Ktena
Alykhan Tejani
Lucas Theis
Pranay K. Myana
D. Dilipkumar
Ferenc Huszár
Steven Yoo
Wenzhe Shi
NoLa
19
52
0
15 Jul 2019
Exploration by Optimisation in Partial Monitoring
Exploration by Optimisation in Partial Monitoring
Tor Lattimore
Csaba Szepesvári
33
38
0
12 Jul 2019
An Optimistic Perspective on Offline Reinforcement Learning
An Optimistic Perspective on Offline Reinforcement Learning
Rishabh Agarwal
Dale Schuurmans
Mohammad Norouzi
OffRL
OnRL
38
69
0
10 Jul 2019
On Open-Universe Causal Reasoning
On Open-Universe Causal Reasoning
D. Ibeling
Thomas Icard
LRM
AI4CE
23
8
0
04 Jul 2019
Interpretable and Personalized Apprenticeship Scheduling: Learning
  Interpretable Scheduling Policies from Heterogeneous User Demonstrations
Interpretable and Personalized Apprenticeship Scheduling: Learning Interpretable Scheduling Policies from Heterogeneous User Demonstrations
Rohan R. Paleja
Andrew Silva
Letian Chen
Matthew C. Gombolay
30
31
0
14 Jun 2019
Distributionally Robust Counterfactual Risk Minimization
Distributionally Robust Counterfactual Risk Minimization
Louis Faury
Ugo Tanielian
Flavian Vasile
E. Smirnova
Elvis Dohmatob
28
45
0
14 Jun 2019
Issues with post-hoc counterfactual explanations: a discussion
Issues with post-hoc counterfactual explanations: a discussion
Thibault Laugel
Marie-Jeanne Lesot
Christophe Marsala
Marcin Detyniecki
CML
107
44
0
11 Jun 2019
Towards Optimal Off-Policy Evaluation for Reinforcement Learning with
  Marginalized Importance Sampling
Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling
Tengyang Xie
Yifei Ma
Yu Wang
OffRL
46
178
0
08 Jun 2019
Empirical Likelihood for Contextual Bandits
Empirical Likelihood for Contextual Bandits
Nikos Karampatziakis
John Langford
Paul Mineiro
OffRL
23
9
0
07 Jun 2019
The Label Complexity of Active Learning from Observational Data
The Label Complexity of Active Learning from Observational Data
Songbai Yan
Kamalika Chaudhuri
T. Javidi
14
9
0
29 May 2019
Privacy-Preserving Causal Inference via Inverse Probability Weighting
Privacy-Preserving Causal Inference via Inverse Probability Weighting
Si Kai Lee
Luigi Gresele
Mijung Park
Krikamol Muandet
17
10
0
29 May 2019
Miss Tools and Mr Fruit: Emergent communication in agents learning about
  object affordances
Miss Tools and Mr Fruit: Emergent communication in agents learning about object affordances
Diane Bouchacourt
Marco Baroni
21
22
0
28 May 2019
Spatial Positioning Token (SPToken) for Smart Mobility
Spatial Positioning Token (SPToken) for Smart Mobility
R. Overko
Rodrigo H. Ordóñez-Hurtado
Sergiy Zhuk
P. Ferraro
Andrew Cullen
Robert Shorten
25
15
0
16 May 2019
Active Learning for Decision-Making from Imbalanced Observational Data
Active Learning for Decision-Making from Imbalanced Observational Data
Iiris Sundin
Peter F. Schulam
E. Siivola
Aki Vehtari
Suchi Saria
Samuel Kaski
OOD
CML
12
29
0
10 Apr 2019
Robust Multi-agent Counterfactual Prediction
Robust Multi-agent Counterfactual Prediction
A. Peysakhovich
Christian Kroer
Adam Lerer
27
11
0
03 Apr 2019
Bayesian Optimization for Policy Search via Online-Offline
  Experimentation
Bayesian Optimization for Policy Search via Online-Offline Experimentation
Benjamin Letham
E. Bakshy
OffRL
27
56
0
01 Apr 2019
Classifying Treatment Responders Under Causal Effect Monotonicity
Classifying Treatment Responders Under Causal Effect Monotonicity
Nathan Kallus
CML
28
16
0
14 Feb 2019
Fair Decisions Despite Imperfect Predictions
Fair Decisions Despite Imperfect Predictions
Niki Kilbertus
Manuel Gomez Rodriguez
Bernhard Schölkopf
Krikamol Muandet
Isabel Valera
FaML
OffRL
28
5
0
08 Feb 2019
Computing large market equilibria using abstractions
Computing large market equilibria using abstractions
Christian Kroer
A. Peysakhovich
Eric Sodomka
Nicolas Stier-Moses
28
31
0
18 Jan 2019
Imitation-Regularized Offline Learning
Imitation-Regularized Offline Learning
Yifei Ma
Yu Wang
Balakrishnan
Balakrishnan Narayanaswamy
OffRL
14
22
0
15 Jan 2019
Gradient Regularized Budgeted Boosting
Gradient Regularized Budgeted Boosting
Z. Xu
Matt J. Kusner
Kilian Q. Weinberger
A. Zheng
19
2
0
13 Jan 2019
Off-Policy Evaluation of Probabilistic Identity Data in Lookalike
  Modeling
Off-Policy Evaluation of Probabilistic Identity Data in Lookalike Modeling
R. Cotta
Mingyang Hu
Dan Jiang
Peizhou Liao
OffRL
11
6
0
04 Jan 2019
The Music Streaming Sessions Dataset
The Music Streaming Sessions Dataset
B. Brost
Rishabh Mehrotra
T. Jehan
21
74
0
31 Dec 2018
Top-K Off-Policy Correction for a REINFORCE Recommender System
Top-K Off-Policy Correction for a REINFORCE Recommender System
Minmin Chen
Alex Beutel
Paul Covington
Sagar Jain
Francois Belletti
Ed H. Chi
CML
OffRL
33
474
0
06 Dec 2018
A Survey on Semantic Parsing
A Survey on Semantic Parsing
Aishwarya Kamath
Rajarshi Das
22
117
0
03 Dec 2018
Counterfactual Learning from Human Proofreading Feedback for Semantic
  Parsing
Counterfactual Learning from Human Proofreading Feedback for Semantic Parsing
Carolin (Haas) Lawrence
Stefan Riezler
OffRL
15
7
0
29 Nov 2018
Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
Lars Buesing
T. Weber
Yori Zwols
S. Racanière
A. Guez
Jean-Baptiste Lespiau
N. Heess
CML
37
135
0
15 Nov 2018
Explaining Deep Learning Models using Causal Inference
Explaining Deep Learning Models using Causal Inference
Tanmayee Narendra
A. Sankaran
Deepak Vijaykeerthy
Senthil Mani
CML
26
52
0
11 Nov 2018
CAB: Continuous Adaptive Blending Estimator for Policy Evaluation and
  Learning
CAB: Continuous Adaptive Blending Estimator for Policy Evaluation and Learning
Yi-Hsun Su
Lequn Wang
Michele Santacatterina
Mohsen Guizani
CML
OffRL
8
6
0
06 Nov 2018
contextual: Evaluating Contextual Multi-Armed Bandit Problems in R
contextual: Evaluating Contextual Multi-Armed Bandit Problems in R
Sayed Chhattan Shah
M. Kaptein
CML
24
8
0
06 Nov 2018
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform
J. Gauci
Edoardo Conti
Yitao Liang
Kittipat Virochsiri
Yuchen He
Zachary Kaden
Vivek Narayanan
Xiaohui Ye
Zhengxing Chen
Scott Fujimoto
30
139
0
01 Nov 2018
Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation
Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation
Qiang Liu
Lihong Li
Ziyang Tang
Dengyong Zhou
OffRL
18
350
0
29 Oct 2018
Software Engineering Challenges of Deep Learning
Software Engineering Challenges of Deep Learning
Anders Arpteg
B. Brinne
L. Crnkovic-Friis
J. Bosch
30
174
0
29 Oct 2018
A Survey of Learning Causality with Data: Problems and Methods
A Survey of Learning Causality with Data: Problems and Methods
Ruocheng Guo
Lu Cheng
Jundong Li
P. R. Hahn
Huan Liu
CML
37
168
0
25 Sep 2018
LensKit for Python: Next-Generation Software for Recommender System
  Experiments
LensKit for Python: Next-Generation Software for Recommender System Experiments
Michael D. Ekstrand
VLM
14
79
0
10 Sep 2018
Efficient Counterfactual Learning from Bandit Feedback
Efficient Counterfactual Learning from Bandit Feedback
Yusuke Narita
Shota Yasui
Kohei Yata
OffRL
16
47
0
10 Sep 2018
Robust Counterfactual Inferences using Feature Learning and their
  Applications
Robust Counterfactual Inferences using Feature Learning and their Applications
A. Mitra
Kannan Achan
Sushant Kumar
CML
OffRL
13
0
0
22 Aug 2018
Genie: An Open Box Counterfactual Policy Estimator for Optimizing
  Sponsored Search Marketplace
Genie: An Open Box Counterfactual Policy Estimator for Optimizing Sponsored Search Marketplace
Murat Ali Bayir
Mingsen Xu
Yaojia Zhu
Yifan Shi
OffRL
13
10
0
22 Aug 2018
Off-Policy Evaluation and Learning from Logged Bandit Feedback: Error
  Reduction via Surrogate Policy
Off-Policy Evaluation and Learning from Logged Bandit Feedback: Error Reduction via Surrogate Policy
Yuan Xie
Boyi Liu
Qiang Liu
Zhaoran Wang
Yuanshuo Zhou
Jian-wei Peng
OffRL
9
19
0
01 Aug 2018
Troubling Trends in Machine Learning Scholarship
Troubling Trends in Machine Learning Scholarship
Zachary Chase Lipton
Jacob Steinhardt
29
288
0
09 Jul 2018
Cause-Effect Deep Information Bottleneck For Systematically Missing
  Covariates
Cause-Effect Deep Information Bottleneck For Systematically Missing Covariates
S. Parbhoo
Mario Wieser
Aleksander Wieczorek
Volker Roth
CML
20
5
0
06 Jul 2018
Bayesian Counterfactual Risk Minimization
Bayesian Counterfactual Risk Minimization
Ben London
Ted Sandler
OffRL
11
30
0
29 Jun 2018
A Tour of Reinforcement Learning: The View from Continuous Control
A Tour of Reinforcement Learning: The View from Continuous Control
Benjamin Recht
24
620
0
25 Jun 2018
Previous
12345678
Next