Introduction to Multi-Armed Bandits

15 April 2019

Papers citing "Introduction to Multi-Armed Bandits"

50 / 163 papers shown

Title
Robust Online Learning with Private Information Kyohei Okumura 51 0 0 08 May 2025
Curiosity Driven Exploration to Optimize Structure-Property Learning in Microscopy Aditya Vatsavai Ganesh Narasimha Yongtao Liu Jan-Chi Yang Hiroshu Funakubo M. Ziatdinov Rama K Vasudevan 19 0 0 28 Apr 2025
OptimAI: Optimization from Natural Language Using LLM-Powered AI Agents Raghav Thind Youran Sun Ling Liang Haizhao Yang LLMAG 40 0 0 23 Apr 2025
Evolution of Optimization Algorithms for Global Placement via Large Language Models Xufeng Yao Jiaxi Jiang Yuxuan Zhao Peiyu Liao Yibo Lin Bei Yu 68 0 0 18 Apr 2025
A New Benchmark for Online Learning with Budget-Balancing Constraints M. Braverman Jingyi Liu Jieming Mao Jon Schneider Eric Xue 60 0 0 19 Mar 2025
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure Aleksandrs Slivkins Yunzong Xu Shiliang Zuo 86 1 0 06 Mar 2025
A Theoretical Model for Grit in Pursuing Ambitious Ends Avrim Blum Emily Diana Kavya Ravichandran A. Tolbert 42 0 0 04 Mar 2025
Functional multi-armed bandit and the best function identification problems Yuriy Dorn Aleksandr Katrutsa Ilgam Latypov Anastasiia Soboleva 32 0 0 01 Mar 2025
Multi-agent Multi-armed Bandits with Minimum Reward Guarantee Fairness Piyushi Manupriya Himanshu S. Jagarlapudi Ganesh Ghalme FaML 54 0 0 24 Feb 2025
Online Planning of Power Flows for Power Systems Against Bushfires Using Spatial Context Jianyu Xu Qiuzhuang Sun Yang Yang Huadong Mo Daoyi Dong 83 0 0 24 Feb 2025
AI-Assisted Decision Making with Human Learning Gali Noti Kate Donahue Jon M. Kleinberg Sigal Oren 138 0 0 18 Feb 2025
Learning in Markets with Heterogeneous Agents: Dynamics and Survival of Bayesian vs. No-Regret Learners David Easley Yoav Kolumbus Éva Tardos 92 0 0 12 Feb 2025
Tracking Most Significant Shifts in Infinite-Armed Bandits Joe Suk Jung-hun Kim 60 0 0 31 Jan 2025
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization Zishun Yu Tengyu Xu Di Jin Karthik Abinav Sankararaman Yun He ... Eryk Helenowski Chen Zhu Sinong Wang Hao Ma Han Fang LRM 56 5 0 29 Jan 2025
Fuzzing at Scale: The Untold Story of the Scheduler Ivica Nikolić Racchit Jain 74 0 0 28 Jan 2025
Reinforcement learning Based Automated Design of Differential Evolution Algorithm for Black-box Optimization Xu Yang Rui Wang Kaiwen Li Ling Wang 56 0 0 22 Jan 2025
Network Diffuser for Placing-Scheduling Service Function Chains with Inverse Demonstration Zuyuan Zhang Vaneet Aggarwal Tian-Shing Lan DiffM 42 0 0 10 Jan 2025
Online Joint Assortment-Inventory Optimization under MNL Choices Yong Liang Xiaojie Mao Shiyuan Wang 53 0 0 03 Jan 2025
HR-Bandit: Human-AI Collaborated Linear Recourse Bandit Junyu Cao Ruijiang Gao Esmaeil Keyvanshokooh 42 1 0 18 Oct 2024
AutoPersuade: A Framework for Evaluating and Explaining Persuasive Arguments Till Raphael Saenger Musashi Hinck Justin Grimmer Brandon M Stewart 38 2 0 11 Oct 2024
Accurate and Regret-aware Numerical Problem Solver for Tabular Question Answering Yuxiang Wang Jianzhong Qi Junhao Gan LMTD 53 2 0 10 Oct 2024
uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs Yu Chen Jiatai Huang Yan Dai Longbo Huang 34 0 0 04 Oct 2024
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback Arun Verma Zhongxiang Dai Xiaoqiang Lin Patrick Jaillet K. H. Low 39 5 0 24 Jul 2024
Learning to Mitigate Externalities: the Coase Theorem with Hindsight Rationality Antoine Scheid Aymeric Capitaine Etienne Boursier Eric Moulines Michael I. Jordan Alain Durmus 47 2 0 28 Jun 2024
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond Xutong Liu Siwei Wang Jinhang Zuo Han Zhong Xuchuang Wang Zhiyong Wang Shuai Li Mohammad Hajiesmaili J. C. Lui Wei Chen 85 1 0 03 Jun 2024
Paying to Do Better: Games with Payments between Learning Agents Y. Kolumbus Joe Halpern Éva Tardos 32 1 0 31 May 2024
Federated Combinatorial Multi-Agent Multi-Armed Bandits Fares Fourati Mohamed-Slim Alouini Vaneet Aggarwal FedML 33 5 0 09 May 2024
Batched Stochastic Bandit for Nondegenerate Functions Yu Liu Yunlu Shu Tianyu Wang 52 0 0 09 May 2024
DPO: A Differential and Pointwise Control Approach to Reinforcement Learning Minh Nguyen Chandrajit Bajaj 25 0 0 24 Apr 2024
Adaptive Memory Replay for Continual Learning James Seale Smith Lazar Valkov Shaunak Halbe V. Gutta Rogerio Feris Z. Kira Leonid Karlinsky 49 6 0 18 Apr 2024
Reward-Driven Automated Curriculum Learning for Interaction-Aware Self-Driving at Unsignalized Intersections Zeng Peng Xiao Zhou Lei Zheng Yubin Wang Jun Ma 71 4 0 20 Mar 2024
Misalignment, Learning, and Ranking: Harnessing Users Limited Attention Arpit Agarwal Rad Niazadeh Prathamesh Patil 34 0 0 21 Feb 2024
Incentivized Exploration via Filtered Posterior Sampling Anand Kalvit Aleksandrs Slivkins Yonatan Gur 29 1 0 20 Feb 2024
Trust Regions for Explanations via Black-Box Probabilistic Certification Amit Dhurandhar Swagatam Haldar Dennis L. Wei Karthikeyan N. Ramamurthy FAtt 42 2 0 17 Feb 2024
Cooperative Multi-Agent Graph Bandits: UCB Algorithm and Regret Analysis Phevos Paschalidis Runyu Zhang Na Li 33 0 0 18 Jan 2024
Multi-Agent Join Vahid Ghadakchi Mian Xie Arash Termehchy Bakhtiyar Doskenov Bharghav Srikhakollu Summit Haque Huazheng Wang 20 0 0 21 Dec 2023
Active teacher selection for reinforcement learning from human feedback Rachel Freedman Justin Svegliato K. H. Wray Stuart J. Russell 31 6 0 23 Oct 2023
Adaptive maximization of social welfare Nicolò Cesa-Bianchi Roberto Colomboni Maximilian Kasy 30 0 0 14 Oct 2023
Cost-Efficient Online Decision Making: A Combinatorial Multi-Armed Bandit Approach Arman Rahbar Niklas Åkerblom M. Chehreghani 28 0 0 21 Aug 2023
Consensus-based Participatory Budgeting for Legitimacy: Decision Support via Multi-agent Reinforcement Learning Srijoni Majumdar Evangelos Pournaras 16 3 0 24 Jul 2023
Approximate information for efficient exploration-exploitation strategies A. Barbier–Chebbah Christian L. Vestergaard Jean-Baptiste Masson 29 2 0 04 Jul 2023
Allocating Divisible Resources on Arms with Unknown and Random Rewards Ningyuan Chen Wenhao Li 24 0 0 28 Jun 2023
Trading-off price for data quality to achieve fair online allocation M. Molina Nicolas Gast P. Loiseau Vianney Perchet 32 4 0 23 Jun 2023
Decentralized Randomly Distributed Multi-agent Multi-armed Bandit with Heterogeneous Rewards Mengfan Xu Diego Klabjan 37 6 0 08 Jun 2023
Incentivizing Exploration with Linear Contexts and Combinatorial Actions Mark Sellke 29 3 0 03 Jun 2023
Robust Lipschitz Bandits to Adversarial Corruptions Yue Kang Cho-Jui Hsieh T. C. Lee AAML 30 8 0 29 May 2023
Green Runner: A tool for efficient model selection from model repositories Jai Kannan Scott Barnett Anj Simmons Taylan Selvi Luís Cruz 30 1 0 26 May 2023
Theoretically Principled Federated Learning for Balancing Privacy and Utility Xiaojin Zhang Wenjie Li Kai Chen Shutao Xia Qian Yang FedML 30 9 0 24 May 2023
Optimal Activation of Halting Multi-Armed Bandit Models Wesley Cowan M. Katehakis S. Ross 15 1 0 20 Apr 2023
A Field Test of Bandit Algorithms for Recommendations: Understanding the Validity of Assumptions on Human Preferences in Multi-armed Bandits Liu Leqi Giulio Zhou Fatma Kilincc-Karzan Zachary Chase Lipton A. Montgomery 27 2 0 16 Apr 2023