Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2102.01046
Cited By
Impossible Tuning Made Possible: A New Expert Algorithm and Its Applications
1 February 2021
Liyu Chen
Haipeng Luo
Chen-Yu Wei
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Impossible Tuning Made Possible: A New Expert Algorithm and Its Applications"
15 / 15 papers shown
Title
Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement Learning
Aneesh Muppidi
Zhiyu Zhang
Heng Yang
34
4
0
26 May 2024
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees
Toshinori Kitamura
Tadashi Kozuno
Masahiro Kato
Yuki Ichihara
Soichiro Nishimori
Akiyoshi Sannai
Sho Sonoda
Wataru Kumagai
Yutaka Matsuo
42
2
0
31 Jan 2024
Unconstrained Online Learning with Unbounded Losses
Andrew Jacobsen
Ashok Cutkosky
32
16
0
08 Jun 2023
Uncoupled and Convergent Learning in Two-Player Zero-Sum Markov Games with Bandit Feedback
Yang Cai
Haipeng Luo
Chen-Yu Wei
Weiqiang Zheng
29
17
0
05 Mar 2023
Repeated Bilateral Trade Against a Smoothed Adversary
Nicolò Cesa-Bianchi
Tommaso Cesari
Roberto Colomboni
Federico Fusco
S. Leonardi
36
16
0
21 Feb 2023
Unconstrained Dynamic Regret via Sparse Coding
Zhiyu Zhang
Ashok Cutkosky
I. Paschalidis
34
7
0
31 Jan 2023
Dynamic Regret of Online Markov Decision Processes
Peng Zhao
Longfei Li
Zhi-Hua Zhou
OffRL
27
17
0
26 Aug 2022
Parameter-free Mirror Descent
Andrew Jacobsen
Ashok Cutkosky
20
32
0
26 Feb 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits
Haipeng Luo
Mengxiao Zhang
Peng Zhao
Zhi-Hua Zhou
31
17
0
12 Feb 2022
Policy Optimization for Stochastic Shortest Path
Liyu Chen
Haipeng Luo
Aviv A. Rosenberg
19
12
0
07 Feb 2022
Parameter-free Online Linear Optimization with Side Information via Universal Coin Betting
Jeonghun Ryu
Alankrita Bhatt
Young-Han Kim
26
1
0
04 Feb 2022
Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints
Liyu Chen
R. Jain
Haipeng Luo
57
25
0
31 Jan 2022
No-Regret Learning in Time-Varying Zero-Sum Games
Mengxiao Zhang
Peng Zhao
Haipeng Luo
Zhi-Hua Zhou
33
38
0
30 Jan 2022
Isotuning With Applications To Scale-Free Online Learning
Laurent Orseau
Marcus Hutter
13
5
0
29 Dec 2021
Minimax Regret for Stochastic Shortest Path with Adversarial Costs and Known Transition
Liyu Chen
Haipeng Luo
Chen-Yu Wei
21
32
0
07 Dec 2020
1