Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09514
Cited By
Best Arm Identification for Contaminated Bandits
26 February 2018
Jason M. Altschuler
Victor-Emmanuel Brunel
Alan Malek
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Best Arm Identification for Contaminated Bandits"
15 / 15 papers shown
Title
CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic Corruption
Shubhada Agrawal
Timothée Mathieu
D. Basu
Odalric-Ambrym Maillard
30
2
0
28 Sep 2023
Distributional Off-Policy Evaluation for Slate Recommendations
Shreyas Chaudhari
David Arbour
Georgios Theocharous
N. Vlassis
OffRL
44
0
0
27 Aug 2023
Statistically Optimal Robust Mean and Covariance Estimation for Anisotropic Gaussians
A. Minasyan
Nikita Zhivotovskiy
29
9
0
21 Jan 2023
Reward Delay Attacks on Deep Reinforcement Learning
Anindya Sarkar
Jiarui Feng
Yevgeniy Vorobeychik
Christopher Gill
Ning Zhang
AAML
13
6
0
08 Sep 2022
Enforcing Delayed-Impact Fairness Guarantees
Aline Weber
Blossom Metevier
Yuriy Brun
Philip S. Thomas
Bruno Castro da Silva
FaML
30
9
0
24 Aug 2022
Private and Byzantine-Proof Cooperative Decision-Making
Abhimanyu Dubey
Alex Pentland
19
24
0
27 May 2022
Federated Multi-Armed Bandits Under Byzantine Attacks
Artun Saday
Ilker Demirel
Yiğit Yıldırım
Cem Tekin
AAML
37
13
0
09 May 2022
One More Step Towards Reality: Cooperative Bandits with Imperfect Communication
Udari Madhushani
Abhimanyu Dubey
Naomi Ehrich Leonard
Alex Pentland
28
25
0
24 Nov 2021
Mean-based Best Arm Identification in Stochastic Bandits under Reward Contamination
Arpan Mukherjee
A. Tajer
Pin-Yu Chen
Payel Das
AAML
FedML
31
9
0
14 Nov 2021
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
36
52
0
26 Apr 2021
Off-Policy Risk Assessment in Contextual Bandits
Audrey Huang
Liu Leqi
Zachary Chase Lipton
Kamyar Azizzadenesheli
OffRL
27
36
0
18 Apr 2021
Reward Poisoning in Reinforcement Learning: Attacks Against Unknown Learners in Unknown Environments
Amin Rakhsha
Xuezhou Zhang
Xiaojin Zhu
Adish Singla
AAML
OffRL
44
37
0
16 Feb 2021
Adaptive Reward-Poisoning Attacks against Reinforcement Learning
Xuezhou Zhang
Yuzhe Ma
Adish Singla
Xiaojin Zhu
AAML
29
124
0
27 Mar 2020
Robust Stochastic Bandit Algorithms under Probabilistic Unbounded Adversarial Attack
Ziwei Guan
Kaiyi Ji
Donald J. Bucci
Timothy Y. Hu
J. Palombo
Michael J. Liston
Yingbin Liang
AAML
29
27
0
17 Feb 2020
Better Algorithms for Stochastic Bandits with Adversarial Corruptions
Anupam Gupta
Tomer Koren
Kunal Talwar
AAML
8
151
0
22 Feb 2019
1