ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.06835
  4. Cited By
Improving Offline Contextual Bandits with Distributional Robustness

Improving Offline Contextual Bandits with Distributional Robustness

13 November 2020
Otmane Sakhi
Louis Faury
Flavian Vasile
    OffRL
ArXivPDFHTML

Papers citing "Improving Offline Contextual Bandits with Distributional Robustness"

4 / 4 papers shown
Title
Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection
  and Learning
Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning
Otmane Sakhi
Imad Aouali
Pierre Alquier
Nicolas Chopin
OffRL
43
1
0
23 May 2024
Stochastic Re-weighted Gradient Descent via Distributionally Robust
  Optimization
Stochastic Re-weighted Gradient Descent via Distributionally Robust Optimization
Ramnath Kumar
Kushal Majmundar
Dheeraj M. Nagaraj
A. Suggala
ODL
32
6
0
15 Jun 2023
PAC-Bayesian Offline Contextual Bandits With Guarantees
PAC-Bayesian Offline Contextual Bandits With Guarantees
Otmane Sakhi
Pierre Alquier
Nicolas Chopin
OffRL
29
12
0
24 Oct 2022
Fast Offline Policy Optimization for Large Scale Recommendation
Fast Offline Policy Optimization for Large Scale Recommendation
Otmane Sakhi
D. Rohde
Alexandre Gilotte
OffRL
45
3
0
08 Aug 2022
1