Jointly Efficient and Optimal Algorithms for Logistic Bandits

Jointly Efficient and Optimal Algorithms for Logistic Bandits

6 January 2022

Clément Calauzènes

Papers citing "Jointly Efficient and Optimal Algorithms for Logistic Bandits"

17 / 17 papers shown

Title
Neural Logistic Bandits Seoungbin Bae Dabeen Lee 139 0 0 04 May 2025
Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation Long-Fei Li Yu-Jie Zhang Peng Zhao Zhi-Hua Zhou 101 4 0 17 Jan 2025
Near Optimal Pure Exploration in Logistic Bandits Eduardo Ochoa Rivera Ambuj Tewari 25 0 0 28 Oct 2024
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits Junghyun Lee Se-Young Yun Kwang-Sung Jun 33 4 0 19 Jul 2024
Open Problem: Tight Bounds for Kernelized Multi-Armed Bandits with Bernoulli Rewards Marco Mussi Simone Drago Alberto Maria Metelli 24 1 0 08 Jul 2024
Bandits with Preference Feedback: A Stackelberg Game Perspective Barna Pásztor Parnian Kassraie Andreas Krause 40 2 0 24 Jun 2024
Nearly Minimax Optimal Regret for Multinomial Logistic Bandit Joongkyu Lee Min-hwan Oh 44 6 0 16 May 2024
Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback Qiwei Di Jiafan He Quanquan Gu 29 1 0 16 Apr 2024
Generalized Linear Bandits with Limited Adaptivity Ayush Sawarni Nirjhar Das Siddharth Barman Gaurav Sinha 40 3 0 10 Apr 2024
Active Preference Optimization for Sample Efficient RLHF Nirjhar Das Souradip Chakraborty Aldo Pacchiano Sayak Ray Chowdhury 27 13 0 16 Feb 2024
Exploration via linearly perturbed loss minimisation David Janz Shuai Liu Alex Ayoub Csaba Szepesvári 16 6 0 13 Nov 2023
Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion Junghyun Lee Se-Young Yun Kwang-Sung Jun 41 12 0 28 Oct 2023
VITS : Variational Inference Thompson Sampling for contextual bandits Pierre Clavier Tom Huix Alain Durmus 27 3 0 19 Jul 2023
Overcoming Prior Misspecification in Online Learning to Rank Javad Azizi Ofer Meshi M. Zoghi Maryam Karimzadehgan 17 1 0 25 Jan 2023
Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits Gergely Neu Julia Olkhovskaya Matteo Papini Ludovic Schwartz 33 16 0 27 May 2022
Apple Tasting Revisited: Bayesian Approaches to Partially Monitored Online Binary Classification James A. Grant David S. Leslie 44 3 0 29 Sep 2021
Instance-Wise Minimax-Optimal Algorithms for Logistic Bandits Marc Abeille Louis Faury Clément Calauzènes 96 37 0 23 Oct 2020