ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.05630
  4. Cited By
Distributionally Robust Batch Contextual Bandits
v1v2v3v4v5v6v7 (latest)

Distributionally Robust Batch Contextual Bandits

10 June 2020
Nian Si
Fan Zhang
Zhengyuan Zhou
Jose H. Blanchet
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Distributionally Robust Batch Contextual Bandits"

10 / 10 papers shown
Title
Best Arm Identification with Possibly Biased Offline Data
Best Arm Identification with Possibly Biased Offline Data
Le Yang
Vincent Y. F. Tan
Wang Chi Cheung
49
0
0
29 May 2025
Towards Optimal Offline Reinforcement Learning
Towards Optimal Offline Reinforcement Learning
Mengmeng Li
Daniel Kuhn
Tobias Sutter
OffRL
145
0
0
15 Mar 2025
Towards Domain Adaptive Neural Contextual Bandits
Towards Domain Adaptive Neural Contextual Bandits
Ziyan Wang
Hao Wang
Hao Wang
220
0
0
13 Jun 2024
Distributionally Robust Reinforcement Learning with Interactive Data
  Collection: Fundamental Hardness and Near-Optimal Algorithm
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm
Miao Lu
Han Zhong
Tong Zhang
Jose H. Blanchet
OffRLOOD
107
10
0
04 Apr 2024
Double Pessimism is Provably Efficient for Distributionally Robust
  Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage
Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage
Jose H. Blanchet
Miao Lu
Tong Zhang
Han Zhong
OffRL
124
32
0
16 May 2023
Policy Learning under Biased Sample Selection
Policy Learning under Biased Sample Selection
Lihua Lei
Roshni Sahoo
Stefan Wager
179
13
0
23 Apr 2023
Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR
Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR
Kaiwen Wang
Nathan Kallus
Wen Sun
161
20
0
07 Feb 2023
Risk-Aware Linear Bandits: Theory and Applications in Smart Order
  Routing
Risk-Aware Linear Bandits: Theory and Applications in Smart Order Routing
Jingwei Ji
Renyuan Xu
Ruihao Zhu
65
0
0
04 Aug 2022
Towards Robust Off-policy Learning for Runtime Uncertainty
Towards Robust Off-policy Learning for Runtime Uncertainty
Da Xu
Yuting Ye
Chuanwei Ruan
Bo Yang
OffRL
55
5
0
27 Feb 2022
Robust Bandit Learning with Imperfect Context
Robust Bandit Learning with Imperfect Context
Jianyi Yang
Shaolei Ren
79
8
0
09 Feb 2021
1