ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.15877
  4. Cited By
Exponential Smoothing for Off-Policy Learning
v1v2 (latest)

Exponential Smoothing for Off-Policy Learning

25 May 2023
Imad Aouali
Victor-Emmanuel Brunel
D. Rohde
Anna Korba
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Exponential Smoothing for Off-Policy Learning"

16 / 16 papers shown
Title
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Imad Aouali
Victor-Emmanuel Brunel
David Rohde
Anna Korba
OffRL
148
5
0
22 Feb 2024
PAC-Bayesian Offline Contextual Bandits With Guarantees
PAC-Bayesian Offline Contextual Bandits With Guarantees
Otmane Sakhi
Pierre Alquier
Nicolas Chopin
OffRL
128
14
0
24 Oct 2022
Probabilistic Rank and Reward: A Scalable Model for Slate Recommendation
Probabilistic Rank and Reward: A Scalable Model for Slate Recommendation
Imad Aouali
Achraf Ait Sidi Hammou
Otmane Sakhi
D. Rohde
Flavian Vasile
OffRL
33
7
0
10 Aug 2022
Generalization Bounds for Meta-Learning via PAC-Bayes and Uniform
  Stability
Generalization Bounds for Meta-Learning via PAC-Bayes and Uniform Stability
Alec Farid
Anirudha Majumdar
70
36
0
12 Feb 2021
Optimal Off-Policy Evaluation from Multiple Logging Policies
Optimal Off-Policy Evaluation from Multiple Logging Policies
Nathan Kallus
Yuta Saito
Masatoshi Uehara
OffRL
58
40
0
21 Oct 2020
BLOB : A Probabilistic Model for Recommendation that Combines Organic
  and Bandit Signals
BLOB : A Probabilistic Model for Recommendation that Combines Organic and Bandit Signals
Otmane Sakhi
Stephen Bonner
D. Rohde
Flavian Vasile
65
35
0
28 Aug 2020
Efron-Stein PAC-Bayesian Inequalities
Efron-Stein PAC-Bayesian Inequalities
Ilja Kuzborskij
Csaba Szepesvári
62
22
0
04 Sep 2019
Distributionally Robust Counterfactual Risk Minimization
Distributionally Robust Counterfactual Risk Minimization
Louis Faury
Ugo Tanielian
Flavian Vasile
E. Smirnova
Elvis Dohmatob
60
45
0
14 Jun 2019
A Primer on PAC-Bayesian Learning
A Primer on PAC-Bayesian Learning
Benjamin Guedj
159
223
0
16 Jan 2019
Offline A/B testing for Recommender Systems
Offline A/B testing for Recommender Systems
Alexandre Gilotte
Clément Calauzènes
Thomas Nedelec
A. Abraham
Simon Dollé
OffRL
78
222
0
22 Jan 2018
Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning
  Algorithms
Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms
Han Xiao
Kashif Rasul
Roland Vollgraf
283
8,904
0
25 Aug 2017
Variational Dropout and the Local Reparameterization Trick
Variational Dropout and the Local Reparameterization Trick
Diederik P. Kingma
Tim Salimans
Max Welling
BDL
226
1,514
0
08 Jun 2015
Doubly Robust Policy Evaluation and Optimization
Doubly Robust Policy Evaluation and Optimization
Miroslav Dudík
D. Erhan
John Langford
Lihong Li
OffRL
182
286
0
10 Mar 2015
Rényi Divergence and Kullback-Leibler Divergence
Rényi Divergence and Kullback-Leibler Divergence
T. Erven
P. Harremoes
84
1,340
0
12 Jun 2012
Empirical Bernstein Bounds and Sample Variance Penalization
Empirical Bernstein Bounds and Sample Variance Penalization
Andreas Maurer
Massimiliano Pontil
397
545
0
21 Jul 2009
Exponential inequalities for self-normalized martingales with
  applications
Exponential inequalities for self-normalized martingales with applications
Bernard Bercu
A. Touati
141
124
0
25 Jul 2007
1