Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.12950
Cited By
Equipping Experts/Bandits with Long-term Memory
30 May 2019
Kai Zheng
Haipeng Luo
Ilias Diakonikolas
Liwei Wang
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Equipping Experts/Bandits with Long-term Memory"
7 / 7 papers shown
Title
Refined Regret for Adversarial MDPs with Linear Function Approximation
Yan Dai
Haipeng Luo
Chen-Yu Wei
Julian Zimmert
31
12
0
30 Jan 2023
Online Bilevel Optimization: Regret Analysis of Online Alternating Gradient Methods
Davoud Ataee Tarzanagh
Parvin Nazari
Bojian Hou
Li Shen
Laura Balzano
53
10
0
06 Jul 2022
Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Javad Azizi
T. Duong
Yasin Abbasi-Yadkori
András Gyorgy
Claire Vernade
Mohammad Ghavamzadeh
34
8
0
25 Feb 2022
A Simple Approach for Non-stationary Linear Bandits
Peng Zhao
Lijun Zhang
Yuan Jiang
Zhi-Hua Zhou
36
81
0
09 Mar 2021
Non-stationary Online Learning with Memory and Non-stochastic Control
Peng Zhao
Yu-Hu Yan
Yu-Xiang Wang
Zhi-Hua Zhou
40
47
0
07 Feb 2021
Active Online Learning with Hidden Shifting Domains
Yining Chen
Haipeng Luo
Tengyu Ma
Chicheng Zhang
31
5
0
25 Jun 2020
A Closer Look at Small-loss Bounds for Bandits with Graph Feedback
Chung-Wei Lee
Haipeng Luo
Mengxiao Zhang
17
23
0
02 Feb 2020
1