Model Selection in Contextual Stochastic Bandit Problems

3 March 2020

Papers citing "Model Selection in Contextual Stochastic Bandit Problems"

20 / 20 papers shown

Title
Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations around Unknown Marginals Ziyi Liu Idan Attias Daniel M. Roy CML 29 0 0 01 Jul 2024
Budgeted Online Model Selection and Fine-Tuning via Federated Learning P. M. Ghari Yanning Shen FedML 46 1 0 19 Jan 2024
Anytime Model Selection in Linear Bandits Parnian Kassraie N. Emmenegger Andreas Krause Aldo Pacchiano 46 2 0 24 Jul 2023
Active Policy Improvement from Multiple Black-box Oracles Xuefeng Liu Takuma Yoneda Chaoqi Wang Matthew R. Walter Yuxin Chen 33 8 0 17 Jun 2023
Robust Lipschitz Bandits to Adversarial Corruptions Yue Kang Cho-Jui Hsieh T. C. Lee AAML 30 8 0 29 May 2023
Estimating Optimal Policy Value in General Linear Contextual Bandits Jonathan Lee Weihao Kong Aldo Pacchiano Vidya Muthukumar Emma Brunskill 28 0 0 19 Feb 2023
Linear Bandits with Memory: from Rotting to Rising Giulia Clerici Pierre Laforgue Nicolò Cesa-Bianchi 22 3 0 16 Feb 2023
Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit Algorithms Osama A. Hanna Lin F. Yang Christina Fragouli 27 11 0 08 Nov 2022
Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees Andrea Tirinzoni Matteo Papini Ahmed Touati A. Lazaric Matteo Pirotta 28 4 0 24 Oct 2022
Best of Both Worlds Model Selection Aldo Pacchiano Christoph Dann Claudio Gentile 26 10 0 29 Jun 2022
Adversarial Bandits against Arbitrary Strategies Jung-hun Kim Se-Young Yun 49 0 0 30 May 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits Haipeng Luo Mengxiao Zhang Peng Zhao Zhi-Hua Zhou 31 17 0 12 Feb 2022
Misspecified Gaussian Process Bandit Optimization Ilija Bogunovic Andreas Krause 55 42 0 09 Nov 2021
Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification Clémence Réda Andrea Tirinzoni Rémy Degenne 25 9 0 02 Nov 2021
Near Instance Optimal Model Selection for Pure Exploration Linear Bandits Yinglun Zhu Julian Katz-Samuels Robert D. Nowak 32 6 0 10 Sep 2021
Neural Active Learning with Performance Guarantees Pranjal Awasthi Christoph Dann Claudio Gentile Ayush Sekhari Zhilei Wang 26 22 0 06 Jun 2021
Leveraging Good Representations in Linear Contextual Bandits Matteo Papini Andrea Tirinzoni Marcello Restelli A. Lazaric Matteo Pirotta 24 26 0 08 Apr 2021
Policy Optimization as Online Learning with Mediator Feedback Alberto Maria Metelli Matteo Papini P. DÓro Marcello Restelli OffRL 27 10 0 15 Dec 2020
Regret Balancing for Bandit and RL Model Selection Yasin Abbasi-Yadkori Aldo Pacchiano My Phan 18 26 0 09 Jun 2020
Rate-adaptive model selection over a collection of black-box contextual bandit algorithms Aurélien F. Bibaut Antoine Chambaz Mark van der Laan 24 6 0 05 Jun 2020