ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.07272
  4. Cited By
Introduction to Multi-Armed Bandits

Introduction to Multi-Armed Bandits

15 April 2019
Aleksandrs Slivkins
ArXivPDFHTML

Papers citing "Introduction to Multi-Armed Bandits"

50 / 164 papers shown
Title
A Field Test of Bandit Algorithms for Recommendations: Understanding the
  Validity of Assumptions on Human Preferences in Multi-armed Bandits
A Field Test of Bandit Algorithms for Recommendations: Understanding the Validity of Assumptions on Human Preferences in Multi-armed Bandits
Liu Leqi
Giulio Zhou
Fatma Kilincc-Karzan
Zachary Chase Lipton
A. Montgomery
27
2
0
16 Apr 2023
Mixing predictions for online metric algorithms
Mixing predictions for online metric algorithms
A. Antoniadis
Christian Coester
Marek Eliáš
Adam Polak
Bertrand Simon
26
12
0
04 Apr 2023
Thompson Sampling for Linear Bandit Problems with Normal-Gamma Priors
Thompson Sampling for Linear Bandit Problems with Normal-Gamma Priors
Björn Lindenberg
Karl-Olof Lindahl
30
0
0
06 Mar 2023
Design-Based Inference for Multi-arm Bandits
Design-Based Inference for Multi-arm Bandits
D. Ham
Iavor Bojinov
Michael Lindon
M. Tingley
34
1
0
27 Feb 2023
Bandit Social Learning: Exploration under Myopic Behavior
Bandit Social Learning: Exploration under Myopic Behavior
Kiarash Banihashem
Mohammadtaghi Hajiaghayi
Suho Shin
Aleksandrs Slivkins
21
4
0
15 Feb 2023
Learning in quantum games
Learning in quantum games
Kyriakos Lotidis
P. Mertikopoulos
Nicholas Bambos
24
7
0
05 Feb 2023
Learning with Exposure Constraints in Recommendation Systems
Learning with Exposure Constraints in Recommendation Systems
Omer Ben-Porat
Rotem Torkan
29
12
0
02 Feb 2023
A Framework for Adapting Offline Algorithms to Solve Combinatorial
  Multi-Armed Bandit Problems with Bandit Feedback
A Framework for Adapting Offline Algorithms to Solve Combinatorial Multi-Armed Bandit Problems with Bandit Feedback
G. Nie
Yididiya Y. Nadew
Yanhui Zhu
Vaneet Aggarwal
Christopher J. Quinn
OffRL
19
13
0
30 Jan 2023
Smooth Non-Stationary Bandits
Smooth Non-Stationary Bandits
S. Jia
Qian Xie
Nathan Kallus
P. Frazier
106
9
0
29 Jan 2023
Complexity Analysis of a Countable-armed Bandit Problem
Complexity Analysis of a Countable-armed Bandit Problem
Anand Kalvit
A. Zeevi
21
3
0
18 Jan 2023
Contextual Bandits and Optimistically Universal Learning
Contextual Bandits and Optimistically Universal Learning
Moise Blanchard
Steve Hanneke
Patrick Jaillet
OffRL
28
1
0
31 Dec 2022
Invariant Lipschitz Bandits: A Side Observation Approach
Invariant Lipschitz Bandits: A Side Observation Approach
Nam-Phuong Tran
Long Tran-Thanh
51
1
0
14 Dec 2022
A survey on multi-player bandits
A survey on multi-player bandits
Etienne Boursier
Vianney Perchet
32
13
0
29 Nov 2022
Incorporating Multi-armed Bandit with Local Search for MaxSAT
Incorporating Multi-armed Bandit with Local Search for MaxSAT
Jiongzhi Zheng
Kun He
Jianrong Zhou
Yan Jin
ChuMin Li
F. Manyà
19
1
0
29 Nov 2022
Eluder-based Regret for Stochastic Contextual MDPs
Eluder-based Regret for Stochastic Contextual MDPs
Orin Levy
Asaf B. Cassel
Alon Cohen
Yishay Mansour
35
5
0
27 Nov 2022
Distributed Resource Allocation for URLLC in IIoT Scenarios: A
  Multi-Armed Bandit Approach
Distributed Resource Allocation for URLLC in IIoT Scenarios: A Multi-Armed Bandit Approach
Francesco Pase
M. Giordani
Giampaolo Cuozzo
Sara Cavallero
J. Eichinger
Roberto Verdone
M. Zorzi
34
9
0
22 Nov 2022
Bandit Algorithms for Prophet Inequality and Pandora's Box
Bandit Algorithms for Prophet Inequality and Pandora's Box
Khashayar Gatmiry
Thomas Kesselheim
Sahil Singla
Yuran Wang
31
9
0
16 Nov 2022
Adaptive Data Depth via Multi-Armed Bandits
Adaptive Data Depth via Multi-Armed Bandits
Tavor Z. Baharav
T. Lai
23
1
0
08 Nov 2022
Reinforcement Learning and Bandits for Speech and Language Processing:
  Tutorial, Review and Outlook
Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook
Baihan Lin
OffRL
AI4TS
37
27
0
24 Oct 2022
Optimal Contextual Bandits with Knapsacks under Realizability via
  Regression Oracles
Optimal Contextual Bandits with Knapsacks under Realizability via Regression Oracles
Yuxuan Han
Jialin Zeng
Yang Wang
Yangzhen Xiang
Jiheng Zhang
59
9
0
21 Oct 2022
Vertical Federated Linear Contextual Bandits
Vertical Federated Linear Contextual Bandits
Zeyu Cao
Zhipeng Liang
Shu Zhen Zhang
Hang Li
Ouyang Wen
Yu Rong
P. Zhao
Bing Wu
FedML
32
0
0
20 Oct 2022
Product Ranking for Revenue Maximization with Multiple Purchases
Product Ranking for Revenue Maximization with Multiple Purchases
Renzhe Xu
Xingxuan Zhang
Yangqiu Song
Yafeng Zhang
Xiaolong Chen
Peng Cui
11
2
0
15 Oct 2022
Neuro-symbolic Explainable Artificial Intelligence Twin for Zero-touch
  IoE in Wireless Network
Neuro-symbolic Explainable Artificial Intelligence Twin for Zero-touch IoE in Wireless Network
M. S. Munir
Ki Tae Kim
Apurba Adhikary
Walid Saad
Sachin Shetty
Seong-Bae Park
Choong Seon Hong
30
20
0
13 Oct 2022
Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits
Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits
Siddhartha Banerjee
Sean R. Sinclair
Milind Tambe
Lily Xu
Chao Yu
AI4TS
33
6
0
30 Sep 2022
Non-monotonic Resource Utilization in the Bandits with Knapsacks Problem
Non-monotonic Resource Utilization in the Bandits with Knapsacks Problem
Raunak Kumar
Robert D. Kleinberg
23
13
0
24 Sep 2022
Multi-armed Bandit Learning on a Graph
Multi-armed Bandit Learning on a Graph
Tianpeng Zhang
Kasper Johansson
Na Li
33
6
0
20 Sep 2022
$MC^2$: Rigorous and Efficient Directed Greybox Fuzzing
MC2MC^2MC2: Rigorous and Efficient Directed Greybox Fuzzing
Abhishek Shah
Dongdong She
Samanway Sadhu
Krishma Singal
Peter Coffman
Suman Jana
28
23
0
30 Aug 2022
Learning in Stackelberg Games with Non-myopic Agents
Learning in Stackelberg Games with Non-myopic Agents
Nika Haghtalab
Thodoris Lykouris
Sloan Nietert
Alexander Wei
28
29
0
19 Aug 2022
Understanding the stochastic dynamics of sequential decision-making
  processes: A path-integral analysis of multi-armed bandits
Understanding the stochastic dynamics of sequential decision-making processes: A path-integral analysis of multi-armed bandits
Bo Li
C. Yeung
24
0
0
11 Aug 2022
Online Prediction in Sub-linear Space
Online Prediction in Sub-linear Space
Binghui Peng
Fred Zhang
25
16
0
16 Jul 2022
Hindsight Learning for MDPs with Exogenous Inputs
Hindsight Learning for MDPs with Exogenous Inputs
Sean R. Sinclair
Felipe Vieira Frujeri
Ching-An Cheng
Luke Marshall
Hugo Barbalho
Jingling Li
Jennifer Neville
Ishai Menache
Adith Swaminathan
18
23
0
13 Jul 2022
Differentially Private Linear Bandits with Partial Distributed Feedback
Differentially Private Linear Bandits with Partial Distributed Feedback
Fengjiao Li
Xingyu Zhou
Bo Ji
FedML
36
13
0
12 Jul 2022
Autonomous Drug Design with Multi-Armed Bandits
Autonomous Drug Design with Multi-Armed Bandits
Hampus Gummesson Svensson
E. Bjerrum
C. Tyrchan
Ola Engkvist
M. Chehreghani
37
5
0
04 Jul 2022
Online Resource Allocation under Horizon Uncertainty
Online Resource Allocation under Horizon Uncertainty
S. Balseiro
Christian Kroer
Rachitesh Kumar
29
15
0
27 Jun 2022
Differentially Private Federated Combinatorial Bandits with Constraints
Differentially Private Federated Combinatorial Bandits with Constraints
Sambhav Solanki
Samhita Kanaparthy
Sankarshan Damle
Sujit Gujar
FedML
31
4
0
27 Jun 2022
Multiple-Play Stochastic Bandits with Shareable Finite-Capacity Arms
Multiple-Play Stochastic Bandits with Shareable Finite-Capacity Arms
Xuchuang Wang
Hong Xie
John C. S. Lui
30
6
0
17 Jun 2022
Lazy Queries Can Reduce Variance in Zeroth-order Optimization
Lazy Queries Can Reduce Variance in Zeroth-order Optimization
Quan-Wu Xiao
Qing Ling
Tianyi Chen
46
0
0
14 Jun 2022
Decentralized, Communication- and Coordination-free Learning in
  Structured Matching Markets
Decentralized, Communication- and Coordination-free Learning in Structured Matching Markets
C. Maheshwari
Eric Mazumdar
S. Shankar Sastry
21
11
0
06 Jun 2022
Indirect Active Learning
Indirect Active Learning
Shashank Singh
21
0
0
03 Jun 2022
Contextual Bandits with Knapsacks for a Conversion Model
Contextual Bandits with Knapsacks for a Conversion Model
Zerui Li
Gilles Stoltz
68
3
0
01 Jun 2022
Decentralized Competing Bandits in Non-Stationary Matching Markets
Decentralized Competing Bandits in Non-Stationary Matching Markets
Avishek Ghosh
Abishek Sankararaman
Kannan Ramchandran
T. Javidi
A. Mazumdar
33
4
0
31 May 2022
Survey on Fair Reinforcement Learning: Theory and Practice
Survey on Fair Reinforcement Learning: Theory and Practice
Pratik Gajane
A. Saxena
M. Tavakol
George Fletcher
Mykola Pechenizkiy
FaML
OffRL
40
13
0
20 May 2022
Offline Reinforcement Learning Under Value and Density-Ratio
  Realizability: The Power of Gaps
Offline Reinforcement Learning Under Value and Density-Ratio Realizability: The Power of Gaps
Jinglin Chen
Nan Jiang
OffRL
23
34
0
25 Mar 2022
Modeling Attrition in Recommender Systems with Departing Bandits
Modeling Attrition in Recommender Systems with Departing Bandits
Omer Ben-Porat
Lee Cohen
Liu Leqi
Zachary Chase Lipton
Yishay Mansour
21
11
0
25 Mar 2022
Universal Regression with Adversarial Responses
Universal Regression with Adversarial Responses
Moise Blanchard
Patrick Jaillet
29
6
0
09 Mar 2022
Safe Exploration for Efficient Policy Evaluation and Comparison
Safe Exploration for Efficient Policy Evaluation and Comparison
Runzhe Wan
Branislav Kveton
Rui Song
OffRL
36
10
0
26 Feb 2022
Communicating Robot Conventions through Shared Autonomy
Communicating Robot Conventions through Shared Autonomy
Ananth Jonnavittula
Dylan P. Losey
28
7
0
22 Feb 2022
No-Regret Learning in Partially-Informed Auctions
No-Regret Learning in Partially-Informed Auctions
Wenshuo Guo
Michael I. Jordan
Ellen Vitercik
18
9
0
22 Feb 2022
Learning Revenue Maximization using Posted Prices for Stochastic
  Strategic Patient Buyers
Learning Revenue Maximization using Posted Prices for Stochastic Strategic Patient Buyers
Eitan-Hai Mashiah
Idan Attias
Yishay Mansour
17
1
0
12 Feb 2022
Online V2X Scheduling for Raw-Level Cooperative Perception
Online V2X Scheduling for Raw-Level Cooperative Perception
Yukuan Jia
Ruiqing Mao
Yuxuan Sun
Sheng Zhou
Z. Niu
30
8
0
12 Feb 2022
Previous
1234
Next