Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1209.1727
Cited By
Bandits with heavy tail
8 September 2012
Sébastien Bubeck
Nicolò Cesa-Bianchi
Gábor Lugosi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Bandits with heavy tail"
50 / 54 papers shown
Title
Catoni Contextual Bandits are Robust to Heavy-tailed Rewards
Chenlu Ye
Yujia Jin
Alekh Agarwal
Tong Zhang
112
0
0
04 Feb 2025
From Gradient Clipping to Normalization for Heavy Tailed SGD
Florian Hübler
Ilyas Fatkhullin
Niao He
45
5
0
17 Oct 2024
uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs
Yu Chen
Jiatai Huang
Yan Dai
Longbo Huang
34
0
0
04 Oct 2024
On Lai's Upper Confidence Bound in Multi-Armed Bandits
Huachen Ren
Cun-Hui Zhang
34
1
0
03 Oct 2024
Reinforcement Learning and Regret Bounds for Admission Control
Lucas Weber
A. Busic
Jiamin Zhu
35
0
0
07 Jun 2024
Trustworthy Distributed AI Systems: Robustness, Privacy, and Governance
Wenqi Wei
Ling Liu
36
16
0
02 Feb 2024
Zero-Inflated Bandits
Haoyu Wei
Runzhe Wan
Lei Shi
Rui Song
46
0
0
25 Dec 2023
Optimality in Mean Estimation: Beyond Worst-Case, Beyond Sub-Gaussian, and Beyond
1
+
α
1+α
1
+
α
Moments
Trung Dang
Jasper C. H. Lee
Maoyuan Song
Paul Valiant
27
1
0
21 Nov 2023
Provably Robust Temporal Difference Learning for Heavy-Tailed Rewards
Semih Cayci
A. Eryilmaz
31
2
0
20 Jun 2023
On Private and Robust Bandits
Yulian Wu
Xingyu Zhou
Youming Tao
Di Wang
28
5
0
06 Feb 2023
On deviation probabilities in non-parametric regression
Anna Ben-Hamou
A. Guyader
37
1
0
25 Jan 2023
Quantum Heavy-tailed Bandits
Yulian Wu
Chaowen Guan
Vaneet Aggarwal
Di Wang
20
5
0
23 Jan 2023
Materials Discovery using Max K-Armed Bandit
N. Kikkawa
H. Ohno
30
4
0
16 Dec 2022
On Medians of (Randomized) Pairwise Means
Pierre Laforgue
Stéphan Clémençon
Patrice Bertail
29
12
0
01 Nov 2022
Private and Byzantine-Proof Cooperative Decision-Making
Abhimanyu Dubey
Alex Pentland
19
24
0
27 May 2022
Logarithmic regret bounds for continuous-time average-reward Markov decision processes
Xuefeng Gao
X. Zhou
39
8
0
23 May 2022
Federated Multi-Armed Bandits Under Byzantine Attacks
Artun Saday
Ilker Demirel
Yiğit Yıldırım
Cem Tekin
AAML
37
13
0
09 May 2022
Efficient Algorithms for Extreme Bandits
Dorian Baudry
Yoan Russac
E. Kaufmann
19
3
0
21 Mar 2022
Approximate Function Evaluation via Multi-Armed Bandits
Tavor Z. Baharav
Gary Cheng
Mert Pilanci
David Tse
27
6
0
18 Mar 2022
Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits
Jiatai Huang
Yan Dai
Longbo Huang
27
14
0
28 Jan 2022
Robust parameter estimation of regression model under weakened moment assumptions
Kangqiang Li
Songqiao Tang
Lixin Zhang
31
0
0
08 Dec 2021
Uniform Concentration Bounds toward a Unified Framework for Robust Clustering
Debolina Paul
Saptarshi Chakraborty
Swagatam Das
Jason Xu
22
16
0
27 Oct 2021
Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs
Han Zhong
Jiayi Huang
Lin F. Yang
Liwei Wang
27
7
0
26 Oct 2021
Extreme Bandits using Robust Statistics
Sujay Bhatt
Ping Li
G. Samorodnitsky
30
7
0
09 Sep 2021
Optimal Order Simple Regret for Gaussian Process Bandits
Sattar Vakili
N. Bouziani
Sepehr Jalali
A. Bernacchia
Da-Shan Shiu
39
51
0
20 Aug 2021
Fast Federated Learning in the Presence of Arbitrary Device Unavailability
Xinran Gu
Kaixuan Huang
Jingzhao Zhang
Longbo Huang
FedML
35
96
0
08 Jun 2021
The Elliptical Potential Lemma for General Distributions with an Application to Linear Thompson Sampling
N. Hamidi
Mohsen Bayati
22
1
0
16 Feb 2021
Local Differential Privacy for Bayesian Optimization
Xingyu Zhou
Jian Tan
22
24
0
13 Oct 2020
A generalized Catoni's
M
{\rm M}
M
-estimator under finite {
α
α
α
-th moment assumption} with
α
∈
(
1
,
2
)
α\in (1,2)
α
∈
(
1
,
2
)
Peng Chen
Xinghu Jin
Xiang Li
Lihu Xu
24
24
0
10 Oct 2020
Generalization Bounds in the Presence of Outliers: a Median-of-Means Study
Pierre Laforgue
Guillaume Staerman
Stéphan Clémençon
16
3
0
09 Jun 2020
Online Learning and Optimization for Revenue Management Problems with Add-on Discounts
D. Simchi-Levi
Rui Sun
Huanan Zhang
16
11
0
02 May 2020
Robust subgaussian estimation with VC-dimension
Jules Depersin
27
12
0
24 Apr 2020
Budget-Constrained Bandits over General Cost and Reward Distributions
Semih Cayci
A. Eryilmaz
R. Srikant
6
31
0
29 Feb 2020
Robust Stochastic Bandit Algorithms under Probabilistic Unbounded Adversarial Attack
Ziwei Guan
Kaiyi Ji
Donald J. Bucci
Timothy Y. Hu
J. Palombo
Michael J. Liston
Yingbin Liang
AAML
29
27
0
17 Feb 2020
Multi-Armed Bandits with Correlated Arms
Samarth Gupta
Shreyas Chaudhari
Gauri Joshi
Osman Yağan
22
50
0
06 Nov 2019
Restless dependent bandits with fading memory
O. Zadorozhnyi
Gilles Blanchard
Alexandra Carpentier
16
0
0
25 Jun 2019
Robust subgaussian estimation of a mean vector in nearly linear time
Jules Depersin
Guillaume Lecué
21
92
0
07 Jun 2019
Feedback graph regret bounds for Thompson Sampling and UCB
Thodoris Lykouris
Éva Tardos
Drishti Wali
16
29
0
23 May 2019
Robust Inference via Multiplier Bootstrap
Xi Chen
Wen-Xin Zhou
27
31
0
18 Mar 2019
Better Algorithms for Stochastic Bandits with Adversarial Corruptions
Anupam Gupta
Tomer Koren
Kunal Talwar
AAML
8
151
0
22 Feb 2019
Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching
Tom Zahavy
Shie Mannor
HAI
36
30
0
24 Jan 2019
Stochastic Gradient Descent on a Tree: an Adaptive and Robust Approach to Stochastic Convex Optimization
Sattar Vakili
Sudeep Salgia
Qing Zhao
25
7
0
17 Jan 2019
Solvable Integration Problems and Optimal Sample Size Selection
R. Kunsch
E. Novak
Daniel Rudolf
25
15
0
22 May 2018
Stochastic bandits robust to adversarial corruptions
Thodoris Lykouris
Vahab Mirrokni
R. Leme
AAML
19
202
0
25 Mar 2018
Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator
Stephen Tu
Benjamin Recht
OffRL
37
130
0
22 Dec 2017
Max K-armed bandit: On the ExtremeHunter algorithm and beyond
Mastane Achab
Stéphan Clémençon
Aurélien Garivier
Anne Sabourin
Claire Vernade
25
60
0
27 Jul 2017
Convergence rates of least squares regression estimators with heavy-tailed errors
Q. Han
J. Wellner
19
45
0
07 Jun 2017
Combinatorial Multi-Armed Bandits with Filtered Feedback
James A. Grant
David S. Leslie
K. Glazebrook
R. Szechtman
40
1
0
26 May 2017
On the estimation of the mean of a random vector
Émilien Joly
Gábor Lugosi
R. Oliveira
22
39
0
19 Jul 2016
Risk-Averse Multi-Armed Bandit Problems under Mean-Variance Measure
Sattar Vakili
Qing Zhao
21
88
0
18 Apr 2016
1
2
Next