ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.07272
  4. Cited By
Introduction to Multi-Armed Bandits

Introduction to Multi-Armed Bandits

15 April 2019
Aleksandrs Slivkins
ArXivPDFHTML

Papers citing "Introduction to Multi-Armed Bandits"

50 / 163 papers shown
Title
Robust Online Learning with Private Information
Robust Online Learning with Private Information
Kyohei Okumura
51
0
0
08 May 2025
Curiosity Driven Exploration to Optimize Structure-Property Learning in Microscopy
Curiosity Driven Exploration to Optimize Structure-Property Learning in Microscopy
Aditya Vatsavai
Ganesh Narasimha
Yongtao Liu
Jan-Chi Yang
Hiroshu Funakubo
M. Ziatdinov
Rama K Vasudevan
19
0
0
28 Apr 2025
OptimAI: Optimization from Natural Language Using LLM-Powered AI Agents
OptimAI: Optimization from Natural Language Using LLM-Powered AI Agents
Raghav Thind
Youran Sun
Ling Liang
Haizhao Yang
LLMAG
40
0
0
23 Apr 2025
Evolution of Optimization Algorithms for Global Placement via Large Language Models
Evolution of Optimization Algorithms for Global Placement via Large Language Models
Xufeng Yao
Jiaxi Jiang
Yuxuan Zhao
Peiyu Liao
Yibo Lin
Bei Yu
68
0
0
18 Apr 2025
A New Benchmark for Online Learning with Budget-Balancing Constraints
A New Benchmark for Online Learning with Budget-Balancing Constraints
M. Braverman
Jingyi Liu
Jieming Mao
Jon Schneider
Eric Xue
60
0
0
19 Mar 2025
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure
Aleksandrs Slivkins
Yunzong Xu
Shiliang Zuo
86
1
0
06 Mar 2025
A Theoretical Model for Grit in Pursuing Ambitious Ends
A Theoretical Model for Grit in Pursuing Ambitious Ends
Avrim Blum
Emily Diana
Kavya Ravichandran
A. Tolbert
42
0
0
04 Mar 2025
Functional multi-armed bandit and the best function identification problems
Yuriy Dorn
Aleksandr Katrutsa
Ilgam Latypov
Anastasiia Soboleva
32
0
0
01 Mar 2025
Multi-agent Multi-armed Bandits with Minimum Reward Guarantee Fairness
Multi-agent Multi-armed Bandits with Minimum Reward Guarantee Fairness
Piyushi Manupriya
Himanshu
S. Jagarlapudi
Ganesh Ghalme
FaML
54
0
0
24 Feb 2025
Online Planning of Power Flows for Power Systems Against Bushfires Using Spatial Context
Online Planning of Power Flows for Power Systems Against Bushfires Using Spatial Context
Jianyu Xu
Qiuzhuang Sun
Yang Yang
Huadong Mo
Daoyi Dong
83
0
0
24 Feb 2025
AI-Assisted Decision Making with Human Learning
AI-Assisted Decision Making with Human Learning
Gali Noti
Kate Donahue
Jon M. Kleinberg
Sigal Oren
138
0
0
18 Feb 2025
Learning in Markets with Heterogeneous Agents: Dynamics and Survival of Bayesian vs. No-Regret Learners
Learning in Markets with Heterogeneous Agents: Dynamics and Survival of Bayesian vs. No-Regret Learners
David Easley
Yoav Kolumbus
Éva Tardos
92
0
0
12 Feb 2025
Tracking Most Significant Shifts in Infinite-Armed Bandits
Joe Suk
Jung-hun Kim
60
0
0
31 Jan 2025
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Zishun Yu
Tengyu Xu
Di Jin
Karthik Abinav Sankararaman
Yun He
...
Eryk Helenowski
Chen Zhu
Sinong Wang
Hao Ma
Han Fang
LRM
56
5
0
29 Jan 2025
Fuzzing at Scale: The Untold Story of the Scheduler
Fuzzing at Scale: The Untold Story of the Scheduler
Ivica Nikolić
Racchit Jain
74
0
0
28 Jan 2025
Reinforcement learning Based Automated Design of Differential Evolution Algorithm for Black-box Optimization
Reinforcement learning Based Automated Design of Differential Evolution Algorithm for Black-box Optimization
Xu Yang
Rui Wang
Kaiwen Li
Ling Wang
56
0
0
22 Jan 2025
Network Diffuser for Placing-Scheduling Service Function Chains with Inverse Demonstration
Network Diffuser for Placing-Scheduling Service Function Chains with Inverse Demonstration
Zuyuan Zhang
Vaneet Aggarwal
Tian-Shing Lan
DiffM
42
0
0
10 Jan 2025
Online Joint Assortment-Inventory Optimization under MNL Choices
Online Joint Assortment-Inventory Optimization under MNL Choices
Yong Liang
Xiaojie Mao
Shiyuan Wang
53
0
0
03 Jan 2025
HR-Bandit: Human-AI Collaborated Linear Recourse Bandit
HR-Bandit: Human-AI Collaborated Linear Recourse Bandit
Junyu Cao
Ruijiang Gao
Esmaeil Keyvanshokooh
42
1
0
18 Oct 2024
AutoPersuade: A Framework for Evaluating and Explaining Persuasive Arguments
AutoPersuade: A Framework for Evaluating and Explaining Persuasive Arguments
Till Raphael Saenger
Musashi Hinck
Justin Grimmer
Brandon M Stewart
38
2
0
11 Oct 2024
Accurate and Regret-aware Numerical Problem Solver for Tabular Question Answering
Accurate and Regret-aware Numerical Problem Solver for Tabular Question Answering
Yuxiang Wang
Jianzhong Qi
Junhao Gan
LMTD
53
2
0
10 Oct 2024
uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs
uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs
Yu Chen
Jiatai Huang
Yan Dai
Longbo Huang
34
0
0
04 Oct 2024
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Arun Verma
Zhongxiang Dai
Xiaoqiang Lin
Patrick Jaillet
K. H. Low
39
5
0
24 Jul 2024
Learning to Mitigate Externalities: the Coase Theorem with Hindsight Rationality
Learning to Mitigate Externalities: the Coase Theorem with Hindsight Rationality
Antoine Scheid
Aymeric Capitaine
Etienne Boursier
Eric Moulines
Michael I. Jordan
Alain Durmus
47
2
0
28 Jun 2024
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond
Xutong Liu
Siwei Wang
Jinhang Zuo
Han Zhong
Xuchuang Wang
Zhiyong Wang
Shuai Li
Mohammad Hajiesmaili
J. C. Lui
Wei Chen
85
1
0
03 Jun 2024
Paying to Do Better: Games with Payments between Learning Agents
Paying to Do Better: Games with Payments between Learning Agents
Y. Kolumbus
Joe Halpern
Éva Tardos
32
1
0
31 May 2024
Federated Combinatorial Multi-Agent Multi-Armed Bandits
Federated Combinatorial Multi-Agent Multi-Armed Bandits
Fares Fourati
Mohamed-Slim Alouini
Vaneet Aggarwal
FedML
33
5
0
09 May 2024
Batched Stochastic Bandit for Nondegenerate Functions
Batched Stochastic Bandit for Nondegenerate Functions
Yu Liu
Yunlu Shu
Tianyu Wang
52
0
0
09 May 2024
DPO: A Differential and Pointwise Control Approach to Reinforcement Learning
DPO: A Differential and Pointwise Control Approach to Reinforcement Learning
Minh Nguyen
Chandrajit Bajaj
25
0
0
24 Apr 2024
Adaptive Memory Replay for Continual Learning
Adaptive Memory Replay for Continual Learning
James Seale Smith
Lazar Valkov
Shaunak Halbe
V. Gutta
Rogerio Feris
Z. Kira
Leonid Karlinsky
49
6
0
18 Apr 2024
Reward-Driven Automated Curriculum Learning for Interaction-Aware Self-Driving at Unsignalized Intersections
Reward-Driven Automated Curriculum Learning for Interaction-Aware Self-Driving at Unsignalized Intersections
Zeng Peng
Xiao Zhou
Lei Zheng
Yubin Wang
Jun Ma
71
4
0
20 Mar 2024
Misalignment, Learning, and Ranking: Harnessing Users Limited Attention
Misalignment, Learning, and Ranking: Harnessing Users Limited Attention
Arpit Agarwal
Rad Niazadeh
Prathamesh Patil
34
0
0
21 Feb 2024
Incentivized Exploration via Filtered Posterior Sampling
Incentivized Exploration via Filtered Posterior Sampling
Anand Kalvit
Aleksandrs Slivkins
Yonatan Gur
29
1
0
20 Feb 2024
Trust Regions for Explanations via Black-Box Probabilistic Certification
Trust Regions for Explanations via Black-Box Probabilistic Certification
Amit Dhurandhar
Swagatam Haldar
Dennis L. Wei
Karthikeyan N. Ramamurthy
FAtt
42
2
0
17 Feb 2024
Cooperative Multi-Agent Graph Bandits: UCB Algorithm and Regret Analysis
Cooperative Multi-Agent Graph Bandits: UCB Algorithm and Regret Analysis
Phevos Paschalidis
Runyu Zhang
Na Li
33
0
0
18 Jan 2024
Multi-Agent Join
Multi-Agent Join
Vahid Ghadakchi
Mian Xie
Arash Termehchy
Bakhtiyar Doskenov
Bharghav Srikhakollu
Summit Haque
Huazheng Wang
20
0
0
21 Dec 2023
Active teacher selection for reinforcement learning from human feedback
Active teacher selection for reinforcement learning from human feedback
Rachel Freedman
Justin Svegliato
K. H. Wray
Stuart J. Russell
31
6
0
23 Oct 2023
Adaptive maximization of social welfare
Adaptive maximization of social welfare
Nicolò Cesa-Bianchi
Roberto Colomboni
Maximilian Kasy
30
0
0
14 Oct 2023
Cost-Efficient Online Decision Making: A Combinatorial Multi-Armed Bandit Approach
Cost-Efficient Online Decision Making: A Combinatorial Multi-Armed Bandit Approach
Arman Rahbar
Niklas Åkerblom
M. Chehreghani
28
0
0
21 Aug 2023
Consensus-based Participatory Budgeting for Legitimacy: Decision Support
  via Multi-agent Reinforcement Learning
Consensus-based Participatory Budgeting for Legitimacy: Decision Support via Multi-agent Reinforcement Learning
Srijoni Majumdar
Evangelos Pournaras
16
3
0
24 Jul 2023
Approximate information for efficient exploration-exploitation
  strategies
Approximate information for efficient exploration-exploitation strategies
A. Barbier–Chebbah
Christian L. Vestergaard
Jean-Baptiste Masson
29
2
0
04 Jul 2023
Allocating Divisible Resources on Arms with Unknown and Random Rewards
Allocating Divisible Resources on Arms with Unknown and Random Rewards
Ningyuan Chen
Wenhao Li
24
0
0
28 Jun 2023
Trading-off price for data quality to achieve fair online allocation
Trading-off price for data quality to achieve fair online allocation
M. Molina
Nicolas Gast
P. Loiseau
Vianney Perchet
32
4
0
23 Jun 2023
Decentralized Randomly Distributed Multi-agent Multi-armed Bandit with
  Heterogeneous Rewards
Decentralized Randomly Distributed Multi-agent Multi-armed Bandit with Heterogeneous Rewards
Mengfan Xu
Diego Klabjan
37
6
0
08 Jun 2023
Incentivizing Exploration with Linear Contexts and Combinatorial Actions
Incentivizing Exploration with Linear Contexts and Combinatorial Actions
Mark Sellke
29
3
0
03 Jun 2023
Robust Lipschitz Bandits to Adversarial Corruptions
Robust Lipschitz Bandits to Adversarial Corruptions
Yue Kang
Cho-Jui Hsieh
T. C. Lee
AAML
30
8
0
29 May 2023
Green Runner: A tool for efficient model selection from model
  repositories
Green Runner: A tool for efficient model selection from model repositories
Jai Kannan
Scott Barnett
Anj Simmons
Taylan Selvi
Luís Cruz
30
1
0
26 May 2023
Theoretically Principled Federated Learning for Balancing Privacy and
  Utility
Theoretically Principled Federated Learning for Balancing Privacy and Utility
Xiaojin Zhang
Wenjie Li
Kai Chen
Shutao Xia
Qian Yang
FedML
30
9
0
24 May 2023
Optimal Activation of Halting Multi-Armed Bandit Models
Optimal Activation of Halting Multi-Armed Bandit Models
Wesley Cowan
M. Katehakis
S. Ross
15
1
0
20 Apr 2023
A Field Test of Bandit Algorithms for Recommendations: Understanding the
  Validity of Assumptions on Human Preferences in Multi-armed Bandits
A Field Test of Bandit Algorithms for Recommendations: Understanding the Validity of Assumptions on Human Preferences in Multi-armed Bandits
Liu Leqi
Giulio Zhou
Fatma Kilincc-Karzan
Zachary Chase Lipton
A. Montgomery
27
2
0
16 Apr 2023
1234
Next