Model selection for contextual bandits

3 June 2019

Papers citing "Model selection for contextual bandits"

20 / 20 papers shown

Title
Budgeted Online Model Selection and Fine-Tuning via Federated Learning P. M. Ghari Yanning Shen FedML 46 1 0 19 Jan 2024
Anytime Model Selection in Linear Bandits Parnian Kassraie N. Emmenegger Andreas Krause Aldo Pacchiano 46 2 0 24 Jul 2023
High-dimensional Contextual Bandit Problem without Sparsity Junpei Komiyama Masaaki Imaizumi 29 0 0 19 Jun 2023
Active Policy Improvement from Multiple Black-box Oracles Xuefeng Liu Takuma Yoneda Chaoqi Wang Matthew R. Walter Yuxin Chen 33 8 0 17 Jun 2023
Efficient Deep Reinforcement Learning Requires Regulating Overfitting Qiyang Li Aviral Kumar Ilya Kostrikov Sergey Levine OffRL 24 31 0 20 Apr 2023
Estimating Optimal Policy Value in General Linear Contextual Bandits Jonathan Lee Weihao Kong Aldo Pacchiano Vidya Muthukumar Emma Brunskill 25 0 0 19 Feb 2023
Linear Bandits with Memory: from Rotting to Rising Giulia Clerici Pierre Laforgue Nicolò Cesa-Bianchi 22 3 0 16 Feb 2023
Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees Andrea Tirinzoni Matteo Papini Ahmed Touati A. Lazaric Matteo Pirotta 28 4 0 24 Oct 2022
Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference Debangshu Banerjee Avishek Ghosh Sayak Ray Chowdhury Aditya Gopalan 22 9 0 23 Jul 2022
Best of Both Worlds Model Selection Aldo Pacchiano Christoph Dann Claudio Gentile 26 10 0 29 Jun 2022
$Breaking the $\sqrt{T}$ Barrier: Instance-Independent Logarithmic Regret in Stochastic Contextual Linear Bandits$ Breaking the $\sqrt{T}$ Barrier: Instance-Independent Logarithmic Regret in Stochastic Contextual Linear Bandits Avishek Ghosh Abishek Sankararaman 27 3 0 19 May 2022
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles Aldo G. Carranza Sanath Kumar Krishnamurthy Susan Athey 16 1 0 30 Mar 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits Haipeng Luo Mengxiao Zhang Peng Zhao Zhi-Hua Zhou 28 17 0 12 Feb 2022
Near Instance Optimal Model Selection for Pure Exploration Linear Bandits Yinglun Zhu Julian Katz-Samuels Robert D. Nowak 30 6 0 10 Sep 2021
Leveraging Good Representations in Linear Contextual Bandits Matteo Papini Andrea Tirinzoni Marcello Restelli A. Lazaric Matteo Pirotta 21 26 0 08 Apr 2021
Minimax Regret for Stochastic Shortest Path with Adversarial Costs and Known Transition Liyu Chen Haipeng Luo Chen-Yu Wei 21 32 0 07 Dec 2020
Regret Balancing for Bandit and RL Model Selection Yasin Abbasi-Yadkori Aldo Pacchiano My Phan 16 26 0 09 Jun 2020
Rate-adaptive model selection over a collection of black-box contextual bandit algorithms Aurélien F. Bibaut Antoine Chambaz Mark van der Laan 24 6 0 05 Jun 2020
Learning to Optimize under Non-Stationarity Wang Chi Cheung D. Simchi-Levi Ruihao Zhu 20 132 0 06 Oct 2018
Estimating Learnability in the Sublinear Data Regime Weihao Kong Gregory Valiant 43 29 0 04 May 2018