Introduction to Multi-Armed Bandits

15 April 2019

Papers citing "Introduction to Multi-Armed Bandits"

50 / 164 papers shown

Title
A Field Test of Bandit Algorithms for Recommendations: Understanding the Validity of Assumptions on Human Preferences in Multi-armed Bandits Liu Leqi Giulio Zhou Fatma Kilincc-Karzan Zachary Chase Lipton A. Montgomery 27 2 0 16 Apr 2023
Mixing predictions for online metric algorithms A. Antoniadis Christian Coester Marek Eliáš Adam Polak Bertrand Simon 26 12 0 04 Apr 2023
Thompson Sampling for Linear Bandit Problems with Normal-Gamma Priors Björn Lindenberg Karl-Olof Lindahl 30 0 0 06 Mar 2023
Design-Based Inference for Multi-arm Bandits D. Ham Iavor Bojinov Michael Lindon M. Tingley 34 1 0 27 Feb 2023
Bandit Social Learning: Exploration under Myopic Behavior Kiarash Banihashem Mohammadtaghi Hajiaghayi Suho Shin Aleksandrs Slivkins 21 4 0 15 Feb 2023
Learning in quantum games Kyriakos Lotidis P. Mertikopoulos Nicholas Bambos 24 7 0 05 Feb 2023
Learning with Exposure Constraints in Recommendation Systems Omer Ben-Porat Rotem Torkan 29 12 0 02 Feb 2023
A Framework for Adapting Offline Algorithms to Solve Combinatorial Multi-Armed Bandit Problems with Bandit Feedback G. Nie Yididiya Y. Nadew Yanhui Zhu Vaneet Aggarwal Christopher J. Quinn OffRL 19 13 0 30 Jan 2023
Smooth Non-Stationary Bandits S. Jia Qian Xie Nathan Kallus P. Frazier 106 9 0 29 Jan 2023
Complexity Analysis of a Countable-armed Bandit Problem Anand Kalvit A. Zeevi 21 3 0 18 Jan 2023
Contextual Bandits and Optimistically Universal Learning Moise Blanchard Steve Hanneke Patrick Jaillet OffRL 28 1 0 31 Dec 2022
Invariant Lipschitz Bandits: A Side Observation Approach Nam-Phuong Tran Long Tran-Thanh 51 1 0 14 Dec 2022
A survey on multi-player bandits Etienne Boursier Vianney Perchet 32 13 0 29 Nov 2022
Incorporating Multi-armed Bandit with Local Search for MaxSAT Jiongzhi Zheng Kun He Jianrong Zhou Yan Jin ChuMin Li F. Manyà 19 1 0 29 Nov 2022
Eluder-based Regret for Stochastic Contextual MDPs Orin Levy Asaf B. Cassel Alon Cohen Yishay Mansour 35 5 0 27 Nov 2022
Distributed Resource Allocation for URLLC in IIoT Scenarios: A Multi-Armed Bandit Approach Francesco Pase M. Giordani Giampaolo Cuozzo Sara Cavallero J. Eichinger Roberto Verdone M. Zorzi 34 9 0 22 Nov 2022
Bandit Algorithms for Prophet Inequality and Pandora's Box Khashayar Gatmiry Thomas Kesselheim Sahil Singla Yuran Wang 31 9 0 16 Nov 2022
Adaptive Data Depth via Multi-Armed Bandits Tavor Z. Baharav T. Lai 23 1 0 08 Nov 2022
Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook Baihan Lin OffRL AI4TS 37 27 0 24 Oct 2022
Optimal Contextual Bandits with Knapsacks under Realizability via Regression Oracles Yuxuan Han Jialin Zeng Yang Wang Yangzhen Xiang Jiheng Zhang 59 9 0 21 Oct 2022
Vertical Federated Linear Contextual Bandits Zeyu Cao Zhipeng Liang Shu Zhen Zhang Hang Li Ouyang Wen Yu Rong P. Zhao Bing Wu FedML 32 0 0 20 Oct 2022
Product Ranking for Revenue Maximization with Multiple Purchases Renzhe Xu Xingxuan Zhang Yangqiu Song Yafeng Zhang Xiaolong Chen Peng Cui 11 2 0 15 Oct 2022
Neuro-symbolic Explainable Artificial Intelligence Twin for Zero-touch IoE in Wireless Network M. S. Munir Ki Tae Kim Apurba Adhikary Walid Saad Sachin Shetty Seong-Bae Park Choong Seon Hong 30 20 0 13 Oct 2022
Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits Siddhartha Banerjee Sean R. Sinclair Milind Tambe Lily Xu Chao Yu AI4TS 33 6 0 30 Sep 2022
Non-monotonic Resource Utilization in the Bandits with Knapsacks Problem Raunak Kumar Robert D. Kleinberg 23 13 0 24 Sep 2022
Multi-armed Bandit Learning on a Graph Tianpeng Zhang Kasper Johansson Na Li 33 6 0 20 Sep 2022
$MC^2$ : Rigorous and Efficient Directed Greybox Fuzzing Abhishek Shah Dongdong She Samanway Sadhu Krishma Singal Peter Coffman Suman Jana 28 23 0 30 Aug 2022
Learning in Stackelberg Games with Non-myopic Agents Nika Haghtalab Thodoris Lykouris Sloan Nietert Alexander Wei 28 29 0 19 Aug 2022
Understanding the stochastic dynamics of sequential decision-making processes: A path-integral analysis of multi-armed bandits Bo Li C. Yeung 24 0 0 11 Aug 2022
Online Prediction in Sub-linear Space Binghui Peng Fred Zhang 25 16 0 16 Jul 2022
Hindsight Learning for MDPs with Exogenous Inputs Sean R. Sinclair Felipe Vieira Frujeri Ching-An Cheng Luke Marshall Hugo Barbalho Jingling Li Jennifer Neville Ishai Menache Adith Swaminathan 18 23 0 13 Jul 2022
Differentially Private Linear Bandits with Partial Distributed Feedback Fengjiao Li Xingyu Zhou Bo Ji FedML 36 13 0 12 Jul 2022
Autonomous Drug Design with Multi-Armed Bandits Hampus Gummesson Svensson E. Bjerrum C. Tyrchan Ola Engkvist M. Chehreghani 37 5 0 04 Jul 2022
Online Resource Allocation under Horizon Uncertainty S. Balseiro Christian Kroer Rachitesh Kumar 29 15 0 27 Jun 2022
Differentially Private Federated Combinatorial Bandits with Constraints Sambhav Solanki Samhita Kanaparthy Sankarshan Damle Sujit Gujar FedML 31 4 0 27 Jun 2022
Multiple-Play Stochastic Bandits with Shareable Finite-Capacity Arms Xuchuang Wang Hong Xie John C. S. Lui 30 6 0 17 Jun 2022
Lazy Queries Can Reduce Variance in Zeroth-order Optimization Quan-Wu Xiao Qing Ling Tianyi Chen 46 0 0 14 Jun 2022
Decentralized, Communication- and Coordination-free Learning in Structured Matching Markets C. Maheshwari Eric Mazumdar S. Shankar Sastry 21 11 0 06 Jun 2022
Indirect Active Learning Shashank Singh 21 0 0 03 Jun 2022
Contextual Bandits with Knapsacks for a Conversion Model Zerui Li Gilles Stoltz 68 3 0 01 Jun 2022
Decentralized Competing Bandits in Non-Stationary Matching Markets Avishek Ghosh Abishek Sankararaman Kannan Ramchandran T. Javidi A. Mazumdar 33 4 0 31 May 2022
Survey on Fair Reinforcement Learning: Theory and Practice Pratik Gajane A. Saxena M. Tavakol George Fletcher Mykola Pechenizkiy FaML OffRL 40 13 0 20 May 2022
Offline Reinforcement Learning Under Value and Density-Ratio Realizability: The Power of Gaps Jinglin Chen Nan Jiang OffRL 23 34 0 25 Mar 2022
Modeling Attrition in Recommender Systems with Departing Bandits Omer Ben-Porat Lee Cohen Liu Leqi Zachary Chase Lipton Yishay Mansour 21 11 0 25 Mar 2022
Universal Regression with Adversarial Responses Moise Blanchard Patrick Jaillet 29 6 0 09 Mar 2022
Safe Exploration for Efficient Policy Evaluation and Comparison Runzhe Wan Branislav Kveton Rui Song OffRL 36 10 0 26 Feb 2022
Communicating Robot Conventions through Shared Autonomy Ananth Jonnavittula Dylan P. Losey 28 7 0 22 Feb 2022
No-Regret Learning in Partially-Informed Auctions Wenshuo Guo Michael I. Jordan Ellen Vitercik 18 9 0 22 Feb 2022
Learning Revenue Maximization using Posted Prices for Stochastic Strategic Patient Buyers Eitan-Hai Mashiah Idan Attias Yishay Mansour 17 1 0 12 Feb 2022
Online V2X Scheduling for Raw-Level Cooperative Perception Yukuan Jia Ruiqing Mao Yuxuan Sun Sheng Zhou Z. Niu 30 8 0 12 Feb 2022