ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.05824
  4. Cited By
Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal
  Models

Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models

14 May 2019
Michael Oberst
David Sontag
    CML
    OffRL
ArXivPDFHTML

Papers citing "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models"

36 / 36 papers shown
Title
When Counterfactual Reasoning Fails: Chaos and Real-World Complexity
When Counterfactual Reasoning Fails: Chaos and Real-World Complexity
Yahya Aalaila
Gerrit Großmann
Sumantrak Mukherjee
Jonas Wahl
Sebastian Vollmer
CML
LRM
64
0
0
31 Mar 2025
Methodology for Interpretable Reinforcement Learning for Optimizing Mechanical Ventilation
Methodology for Interpretable Reinforcement Learning for Optimizing Mechanical Ventilation
Joo Seung Lee
Malini Mahendra
Anil Aswani
OffRL
61
1
0
10 Jan 2025
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making
Stelios Triantafyllou
A. Sukovic
Yasaman Zolfimoselo
Goran Radanović
CML
37
0
0
16 Oct 2024
Counterfactual Token Generation in Large Language Models
Counterfactual Token Generation in Large Language Models
Ivi Chatzi
N. C. Benz
Eleni Straitouri
Stratis Tsirtsis
Manuel Gomez Rodriguez
LRM
34
3
0
25 Sep 2024
Preference Elicitation for Offline Reinforcement Learning
Preference Elicitation for Offline Reinforcement Learning
Alizée Pace
Bernhard Schölkopf
Gunnar Rätsch
Giorgia Ramponi
OffRL
69
1
0
26 Jun 2024
Counterfactual Influence in Markov Decision Processes
Counterfactual Influence in Markov Decision Processes
M. Kazemi
Jessica Lally
Ekaterina Tishchenko
Hana Chockler
Nicola Paoletti
23
1
0
13 Feb 2024
Finding Counterfactually Optimal Action Sequences in Continuous State
  Spaces
Finding Counterfactually Optimal Action Sequences in Continuous State Spaces
Stratis Tsirtsis
Manuel Gomez Rodriguez
CML
OffRL
27
9
0
06 Jun 2023
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden
  Confounding
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding
Alizée Pace
Hugo Yèche
Bernhard Schölkopf
Gunnar Rätsch
Guy Tennenholtz
OffRL
16
6
0
01 Jun 2023
Leveraging Factored Action Spaces for Efficient Offline Reinforcement
  Learning in Healthcare
Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare
Shengpu Tang
Maggie Makar
Michael Sjoding
Finale Doshi-Velez
Jenna Wiens
OffRL
53
37
0
02 May 2023
Towards Computationally Efficient Responsibility Attribution in
  Decentralized Partially Observable MDPs
Towards Computationally Efficient Responsibility Attribution in Decentralized Partially Observable MDPs
Stelios Triantafyllou
Goran Radanović
13
5
0
24 Feb 2023
HOPE: Human-Centric Off-Policy Evaluation for E-Learning and Healthcare
HOPE: Human-Centric Off-Policy Evaluation for E-Learning and Healthcare
Ge Gao
Song Ju
Markel Sanz Ausin
Min Chi
OffRL
29
8
0
18 Feb 2023
Counterfactuals for the Future
Counterfactuals for the Future
Lucius E.J. Bynum
Joshua R. Loftus
Julia Stoyanovich
33
10
0
07 Dec 2022
Offline Policy Evaluation and Optimization under Confounding
Offline Policy Evaluation and Optimization under Confounding
Chinmaya Kausik
Yangyi Lu
Kevin Tan
Maggie Makar
Yixin Wang
Ambuj Tewari
OffRL
23
8
0
29 Nov 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Daniel Jarrett
Corentin Tallec
Florent Altché
Thomas Mesnard
Rémi Munos
Michal Valko
45
5
0
18 Nov 2022
Counterfactual Data Augmentation via Perspective Transition for
  Open-Domain Dialogues
Counterfactual Data Augmentation via Perspective Transition for Open-Domain Dialogues
Jiao Ou
Jinchao Zhang
Yang Feng
Jie Zhou
36
13
0
30 Oct 2022
MoCoDA: Model-based Counterfactual Data Augmentation
MoCoDA: Model-based Counterfactual Data Augmentation
Silviu Pitis
Elliot Creager
Ajay Mandlekar
Animesh Garg
OffRL
48
33
0
20 Oct 2022
Is More Data All You Need? A Causal Exploration
Is More Data All You Need? A Causal Exploration
Athanasios Vlontzos
Hadrien Reynaud
Bernhard Kainz
CML
29
2
0
06 Jun 2022
DÁRTAGNAN: Counterfactual Video Generation
DÁRTAGNAN: Counterfactual Video Generation
Hadrien Reynaud
Athanasios Vlontzos
Mischa Dombrowski
Ciarán M. Gilligan-Lee
A. Beqiri
Paul Leeson
Bernhard Kainz
VGen
CML
MedIm
30
19
0
03 Jun 2022
Counterfactual Analysis in Dynamic Latent State Models
Counterfactual Analysis in Dynamic Latent State Models
Martin Haugh
Raghav Singal
CML
29
6
0
27 May 2022
Pessimism in the Face of Confounders: Provably Efficient Offline
  Reinforcement Learning in Partially Observable Markov Decision Processes
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Miao Lu
Yifei Min
Zhaoran Wang
Zhuoran Yang
OffRL
54
22
0
26 May 2022
Model-Free and Model-Based Policy Evaluation when Causality is Uncertain
Model-Free and Model-Based Policy Evaluation when Causality is Uncertain
David Bruns-Smith
CML
ELM
OffRL
24
12
0
02 Apr 2022
Counterfactual Temporal Point Processes
Counterfactual Temporal Point Processes
Kimia Noorbakhsh
Manuel Gomez Rodriguez
22
22
0
15 Nov 2021
A Review of the Gumbel-max Trick and its Extensions for Discrete
  Stochasticity in Machine Learning
A Review of the Gumbel-max Trick and its Extensions for Discrete Stochasticity in Machine Learning
Iris A. M. Huijben
W. Kool
Max B. Paulus
Ruud J. G. van Sloun
26
93
0
04 Oct 2021
A Spectral Approach to Off-Policy Evaluation for POMDPs
A Spectral Approach to Off-Policy Evaluation for POMDPs
Yash Nair
Nan Jiang
OffRL
26
17
0
22 Sep 2021
Estimating Categorical Counterfactuals via Deep Twin Networks
Estimating Categorical Counterfactuals via Deep Twin Networks
Athanasios Vlontzos
Bernhard Kainz
Ciarán M. Gilligan-Lee
OOD
CML
BDL
26
16
0
04 Sep 2021
Online Bootstrap Inference For Policy Evaluation in Reinforcement
  Learning
Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Pratik Ramprasad
Yuantong Li
Zhuoran Yang
Zhaoran Wang
W. Sun
Guang Cheng
OffRL
50
26
0
08 Aug 2021
Model Selection for Offline Reinforcement Learning: Practical
  Considerations for Healthcare Settings
Model Selection for Offline Reinforcement Learning: Practical Considerations for Healthcare Settings
Shengpu Tang
Jenna Wiens
OffRL
26
78
0
23 Jul 2021
Instrumental Variable Value Iteration for Causal Offline Reinforcement
  Learning
Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning
Luofeng Liao
Zuyue Fu
Zhuoran Yang
Yixin Wang
Mladen Kolar
Zhaoran Wang
OffRL
18
34
0
19 Feb 2021
Sample-Efficient Reinforcement Learning via Counterfactual-Based Data
  Augmentation
Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation
Chaochao Lu
Erdun Gao
Ke Wang
José Miguel Hernández-Lobato
Anton van den Hengel
Bernhard Schölkopf
CML
OOD
OffRL
15
56
0
16 Dec 2020
Deep Structural Causal Models for Tractable Counterfactual Inference
Deep Structural Causal Models for Tractable Counterfactual Inference
Nick Pawlowski
Daniel Coelho De Castro
Ben Glocker
CML
MedIm
33
229
0
11 Jun 2020
Counterfactual VQA: A Cause-Effect Look at Language Bias
Counterfactual VQA: A Cause-Effect Look at Language Bias
Yulei Niu
Kaihua Tang
Hanwang Zhang
Zhiwu Lu
Xiansheng Hua
Ji-Rong Wen
CML
41
394
0
08 Jun 2020
Off-policy Policy Evaluation For Sequential Decisions Under Unobserved
  Confounding
Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding
Hongseok Namkoong
Ramtin Keramati
Steve Yadlowsky
Emma Brunskill
OffRL
8
63
0
12 Mar 2020
Confounding-Robust Policy Evaluation in Infinite-Horizon Reinforcement
  Learning
Confounding-Robust Policy Evaluation in Infinite-Horizon Reinforcement Learning
Nathan Kallus
Angela Zhou
OffRL
35
58
0
11 Feb 2020
POPCORN: Partially Observed Prediction COnstrained ReiNforcement
  Learning
POPCORN: Partially Observed Prediction COnstrained ReiNforcement Learning
Joseph D. Futoma
M. C. Hughes
Finale Doshi-Velez
OffRL
18
49
0
13 Jan 2020
Empirical Study of Off-Policy Policy Evaluation for Reinforcement
  Learning
Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning
Cameron Voloshin
Hoang Minh Le
Nan Jiang
Yisong Yue
OffRL
30
152
0
15 Nov 2019
Learning Representations for Counterfactual Inference
Learning Representations for Counterfactual Inference
Fredrik D. Johansson
Uri Shalit
David Sontag
CML
OOD
BDL
232
719
0
12 May 2016
1