Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2201.01985
Cited By
Jointly Efficient and Optimal Algorithms for Logistic Bandits
6 January 2022
Louis Faury
Marc Abeille
Kwang-Sung Jun
Clément Calauzènes
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Jointly Efficient and Optimal Algorithms for Logistic Bandits"
17 / 17 papers shown
Title
Neural Logistic Bandits
Seoungbin Bae
Dabeen Lee
139
0
0
04 May 2025
Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation
Long-Fei Li
Yu-Jie Zhang
Peng Zhao
Zhi-Hua Zhou
101
4
0
17 Jan 2025
Near Optimal Pure Exploration in Logistic Bandits
Eduardo Ochoa Rivera
Ambuj Tewari
25
0
0
28 Oct 2024
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits
Junghyun Lee
Se-Young Yun
Kwang-Sung Jun
33
4
0
19 Jul 2024
Open Problem: Tight Bounds for Kernelized Multi-Armed Bandits with Bernoulli Rewards
Marco Mussi
Simone Drago
Alberto Maria Metelli
24
1
0
08 Jul 2024
Bandits with Preference Feedback: A Stackelberg Game Perspective
Barna Pásztor
Parnian Kassraie
Andreas Krause
40
2
0
24 Jun 2024
Nearly Minimax Optimal Regret for Multinomial Logistic Bandit
Joongkyu Lee
Min-hwan Oh
44
6
0
16 May 2024
Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback
Qiwei Di
Jiafan He
Quanquan Gu
29
1
0
16 Apr 2024
Generalized Linear Bandits with Limited Adaptivity
Ayush Sawarni
Nirjhar Das
Siddharth Barman
Gaurav Sinha
40
3
0
10 Apr 2024
Active Preference Optimization for Sample Efficient RLHF
Nirjhar Das
Souradip Chakraborty
Aldo Pacchiano
Sayak Ray Chowdhury
27
13
0
16 Feb 2024
Exploration via linearly perturbed loss minimisation
David Janz
Shuai Liu
Alex Ayoub
Csaba Szepesvári
16
6
0
13 Nov 2023
Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion
Junghyun Lee
Se-Young Yun
Kwang-Sung Jun
41
12
0
28 Oct 2023
VITS : Variational Inference Thompson Sampling for contextual bandits
Pierre Clavier
Tom Huix
Alain Durmus
27
3
0
19 Jul 2023
Overcoming Prior Misspecification in Online Learning to Rank
Javad Azizi
Ofer Meshi
M. Zoghi
Maryam Karimzadehgan
17
1
0
25 Jan 2023
Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits
Gergely Neu
Julia Olkhovskaya
Matteo Papini
Ludovic Schwartz
33
16
0
27 May 2022
Apple Tasting Revisited: Bayesian Approaches to Partially Monitored Online Binary Classification
James A. Grant
David S. Leslie
44
3
0
29 Sep 2021
Instance-Wise Minimax-Optimal Algorithms for Logistic Bandits
Marc Abeille
Louis Faury
Clément Calauzènes
96
37
0
23 Oct 2020
1