Best Arm Identification for Contaminated Bandits

26 February 2018

Papers citing "Best Arm Identification for Contaminated Bandits"

15 / 15 papers shown

Title
CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic Corruption Shubhada Agrawal Timothée Mathieu D. Basu Odalric-Ambrym Maillard 30 2 0 28 Sep 2023
Distributional Off-Policy Evaluation for Slate Recommendations Shreyas Chaudhari David Arbour Georgios Theocharous N. Vlassis OffRL 44 0 0 27 Aug 2023
Statistically Optimal Robust Mean and Covariance Estimation for Anisotropic Gaussians A. Minasyan Nikita Zhivotovskiy 29 9 0 21 Jan 2023
Reward Delay Attacks on Deep Reinforcement Learning Anindya Sarkar Jiarui Feng Yevgeniy Vorobeychik Christopher Gill Ning Zhang AAML 13 6 0 08 Sep 2022
Enforcing Delayed-Impact Fairness Guarantees Aline Weber Blossom Metevier Yuriy Brun Philip S. Thomas Bruno Castro da Silva FaML 30 9 0 24 Aug 2022
Private and Byzantine-Proof Cooperative Decision-Making Abhimanyu Dubey Alex Pentland 19 24 0 27 May 2022
Federated Multi-Armed Bandits Under Byzantine Attacks Artun Saday Ilker Demirel Yiğit Yıldırım Cem Tekin AAML 37 13 0 09 May 2022
One More Step Towards Reality: Cooperative Bandits with Imperfect Communication Udari Madhushani Abhimanyu Dubey Naomi Ehrich Leonard Alex Pentland 28 25 0 24 Nov 2021
Mean-based Best Arm Identification in Stochastic Bandits under Reward Contamination Arpan Mukherjee A. Tajer Pin-Yu Chen Payel Das AAML FedML 31 9 0 14 Nov 2021
Universal Off-Policy Evaluation Yash Chandak S. Niekum Bruno C. da Silva Erik Learned-Miller Emma Brunskill Philip S. Thomas OffRL ELM 36 52 0 26 Apr 2021
Off-Policy Risk Assessment in Contextual Bandits Audrey Huang Liu Leqi Zachary Chase Lipton Kamyar Azizzadenesheli OffRL 27 36 0 18 Apr 2021
Reward Poisoning in Reinforcement Learning: Attacks Against Unknown Learners in Unknown Environments Amin Rakhsha Xuezhou Zhang Xiaojin Zhu Adish Singla AAML OffRL 44 37 0 16 Feb 2021
Adaptive Reward-Poisoning Attacks against Reinforcement Learning Xuezhou Zhang Yuzhe Ma Adish Singla Xiaojin Zhu AAML 29 124 0 27 Mar 2020
Robust Stochastic Bandit Algorithms under Probabilistic Unbounded Adversarial Attack Ziwei Guan Kaiyi Ji Donald J. Bucci Timothy Y. Hu J. Palombo Michael J. Liston Yingbin Liang AAML 29 27 0 17 Feb 2020
Better Algorithms for Stochastic Bandits with Adversarial Corruptions Anupam Gupta Tomer Koren Kunal Talwar AAML 8 151 0 22 Feb 2019