Bandits with heavy tail

8 September 2012

Gábor Lugosi

Papers citing "Bandits with heavy tail"

50 / 54 papers shown

Title
Catoni Contextual Bandits are Robust to Heavy-tailed Rewards Chenlu Ye Yujia Jin Alekh Agarwal Tong Zhang 112 0 0 04 Feb 2025
From Gradient Clipping to Normalization for Heavy Tailed SGD Florian Hübler Ilyas Fatkhullin Niao He 45 5 0 17 Oct 2024
uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs Yu Chen Jiatai Huang Yan Dai Longbo Huang 34 0 0 04 Oct 2024
On Lai's Upper Confidence Bound in Multi-Armed Bandits Huachen Ren Cun-Hui Zhang 34 1 0 03 Oct 2024
Reinforcement Learning and Regret Bounds for Admission Control Lucas Weber A. Busic Jiamin Zhu 35 0 0 07 Jun 2024
Trustworthy Distributed AI Systems: Robustness, Privacy, and Governance Wenqi Wei Ling Liu 36 16 0 02 Feb 2024
Zero-Inflated Bandits Haoyu Wei Runzhe Wan Lei Shi Rui Song 46 0 0 25 Dec 2023
Optimality in Mean Estimation: Beyond Worst-Case, Beyond Sub-Gaussian, and Beyond $1+α$ Moments Trung Dang Jasper C. H. Lee Maoyuan Song Paul Valiant 27 1 0 21 Nov 2023
Provably Robust Temporal Difference Learning for Heavy-Tailed Rewards Semih Cayci A. Eryilmaz 31 2 0 20 Jun 2023
On Private and Robust Bandits Yulian Wu Xingyu Zhou Youming Tao Di Wang 28 5 0 06 Feb 2023
On deviation probabilities in non-parametric regression Anna Ben-Hamou A. Guyader 37 1 0 25 Jan 2023
Quantum Heavy-tailed Bandits Yulian Wu Chaowen Guan Vaneet Aggarwal Di Wang 20 5 0 23 Jan 2023
Materials Discovery using Max K-Armed Bandit N. Kikkawa H. Ohno 30 4 0 16 Dec 2022
On Medians of (Randomized) Pairwise Means Pierre Laforgue Stéphan Clémençon Patrice Bertail 29 12 0 01 Nov 2022
Private and Byzantine-Proof Cooperative Decision-Making Abhimanyu Dubey Alex Pentland 19 24 0 27 May 2022
Logarithmic regret bounds for continuous-time average-reward Markov decision processes Xuefeng Gao X. Zhou 39 8 0 23 May 2022
Federated Multi-Armed Bandits Under Byzantine Attacks Artun Saday Ilker Demirel Yiğit Yıldırım Cem Tekin AAML 37 13 0 09 May 2022
Efficient Algorithms for Extreme Bandits Dorian Baudry Yoan Russac E. Kaufmann 19 3 0 21 Mar 2022
Approximate Function Evaluation via Multi-Armed Bandits Tavor Z. Baharav Gary Cheng Mert Pilanci David Tse 27 6 0 18 Mar 2022
Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits Jiatai Huang Yan Dai Longbo Huang 27 14 0 28 Jan 2022
Robust parameter estimation of regression model under weakened moment assumptions Kangqiang Li Songqiao Tang Lixin Zhang 31 0 0 08 Dec 2021
Uniform Concentration Bounds toward a Unified Framework for Robust Clustering Debolina Paul Saptarshi Chakraborty Swagatam Das Jason Xu 22 16 0 27 Oct 2021
Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs Han Zhong Jiayi Huang Lin F. Yang Liwei Wang 27 7 0 26 Oct 2021
Extreme Bandits using Robust Statistics Sujay Bhatt Ping Li G. Samorodnitsky 30 7 0 09 Sep 2021
Optimal Order Simple Regret for Gaussian Process Bandits Sattar Vakili N. Bouziani Sepehr Jalali A. Bernacchia Da-Shan Shiu 39 51 0 20 Aug 2021
Fast Federated Learning in the Presence of Arbitrary Device Unavailability Xinran Gu Kaixuan Huang Jingzhao Zhang Longbo Huang FedML 35 96 0 08 Jun 2021
The Elliptical Potential Lemma for General Distributions with an Application to Linear Thompson Sampling N. Hamidi Mohsen Bayati 22 1 0 16 Feb 2021
Local Differential Privacy for Bayesian Optimization Xingyu Zhou Jian Tan 22 24 0 13 Oct 2020
$A generalized Catoni's ${\rm M}$-estimator under finite {$α$-th moment assumption} with $α\in (1,2)$$ A generalized Catoni's ${\rm M}$ -estimator under finite { $α$ -th moment assumption} with $α\in (1,2)$ Peng Chen Xinghu Jin Xiang Li Lihu Xu 24 24 0 10 Oct 2020
Generalization Bounds in the Presence of Outliers: a Median-of-Means Study Pierre Laforgue Guillaume Staerman Stéphan Clémençon 16 3 0 09 Jun 2020
Online Learning and Optimization for Revenue Management Problems with Add-on Discounts D. Simchi-Levi Rui Sun Huanan Zhang 16 11 0 02 May 2020
Robust subgaussian estimation with VC-dimension Jules Depersin 27 12 0 24 Apr 2020
Budget-Constrained Bandits over General Cost and Reward Distributions Semih Cayci A. Eryilmaz R. Srikant 6 31 0 29 Feb 2020
Robust Stochastic Bandit Algorithms under Probabilistic Unbounded Adversarial Attack Ziwei Guan Kaiyi Ji Donald J. Bucci Timothy Y. Hu J. Palombo Michael J. Liston Yingbin Liang AAML 29 27 0 17 Feb 2020
Multi-Armed Bandits with Correlated Arms Samarth Gupta Shreyas Chaudhari Gauri Joshi Osman Yağan 22 50 0 06 Nov 2019
Restless dependent bandits with fading memory O. Zadorozhnyi Gilles Blanchard Alexandra Carpentier 16 0 0 25 Jun 2019
Robust subgaussian estimation of a mean vector in nearly linear time Jules Depersin Guillaume Lecué 21 92 0 07 Jun 2019
Feedback graph regret bounds for Thompson Sampling and UCB Thodoris Lykouris Éva Tardos Drishti Wali 16 29 0 23 May 2019
Robust Inference via Multiplier Bootstrap Xi Chen Wen-Xin Zhou 27 31 0 18 Mar 2019
Better Algorithms for Stochastic Bandits with Adversarial Corruptions Anupam Gupta Tomer Koren Kunal Talwar AAML 8 151 0 22 Feb 2019
Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching Tom Zahavy Shie Mannor HAI 36 30 0 24 Jan 2019
Stochastic Gradient Descent on a Tree: an Adaptive and Robust Approach to Stochastic Convex Optimization Sattar Vakili Sudeep Salgia Qing Zhao 25 7 0 17 Jan 2019
Solvable Integration Problems and Optimal Sample Size Selection R. Kunsch E. Novak Daniel Rudolf 25 15 0 22 May 2018
Stochastic bandits robust to adversarial corruptions Thodoris Lykouris Vahab Mirrokni R. Leme AAML 19 202 0 25 Mar 2018
Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator Stephen Tu Benjamin Recht OffRL 37 130 0 22 Dec 2017
Max K-armed bandit: On the ExtremeHunter algorithm and beyond Mastane Achab Stéphan Clémençon Aurélien Garivier Anne Sabourin Claire Vernade 25 60 0 27 Jul 2017
Convergence rates of least squares regression estimators with heavy-tailed errors Q. Han J. Wellner 19 45 0 07 Jun 2017
Combinatorial Multi-Armed Bandits with Filtered Feedback James A. Grant David S. Leslie K. Glazebrook R. Szechtman 40 1 0 26 May 2017
On the estimation of the mean of a random vector Émilien Joly Gábor Lugosi R. Oliveira 22 39 0 19 Jul 2016
Risk-Averse Multi-Armed Bandit Problems under Mean-Variance Measure Sattar Vakili Qing Zhao 21 88 0 18 Apr 2016