ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.01046
  4. Cited By
Impossible Tuning Made Possible: A New Expert Algorithm and Its
  Applications

Impossible Tuning Made Possible: A New Expert Algorithm and Its Applications

1 February 2021
Liyu Chen
Haipeng Luo
Chen-Yu Wei
ArXivPDFHTML

Papers citing "Impossible Tuning Made Possible: A New Expert Algorithm and Its Applications"

15 / 15 papers shown
Title
Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement
  Learning
Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement Learning
Aneesh Muppidi
Zhiyu Zhang
Heng Yang
34
4
0
26 May 2024
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with
  Uniform PAC Guarantees
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees
Toshinori Kitamura
Tadashi Kozuno
Masahiro Kato
Yuki Ichihara
Soichiro Nishimori
Akiyoshi Sannai
Sho Sonoda
Wataru Kumagai
Yutaka Matsuo
42
2
0
31 Jan 2024
Unconstrained Online Learning with Unbounded Losses
Unconstrained Online Learning with Unbounded Losses
Andrew Jacobsen
Ashok Cutkosky
32
16
0
08 Jun 2023
Uncoupled and Convergent Learning in Two-Player Zero-Sum Markov Games
  with Bandit Feedback
Uncoupled and Convergent Learning in Two-Player Zero-Sum Markov Games with Bandit Feedback
Yang Cai
Haipeng Luo
Chen-Yu Wei
Weiqiang Zheng
29
17
0
05 Mar 2023
Repeated Bilateral Trade Against a Smoothed Adversary
Repeated Bilateral Trade Against a Smoothed Adversary
Nicolò Cesa-Bianchi
Tommaso Cesari
Roberto Colomboni
Federico Fusco
S. Leonardi
36
16
0
21 Feb 2023
Unconstrained Dynamic Regret via Sparse Coding
Unconstrained Dynamic Regret via Sparse Coding
Zhiyu Zhang
Ashok Cutkosky
I. Paschalidis
34
7
0
31 Jan 2023
Dynamic Regret of Online Markov Decision Processes
Dynamic Regret of Online Markov Decision Processes
Peng Zhao
Longfei Li
Zhi-Hua Zhou
OffRL
27
17
0
26 Aug 2022
Parameter-free Mirror Descent
Parameter-free Mirror Descent
Andrew Jacobsen
Ashok Cutkosky
20
32
0
26 Feb 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret
  for Linear Bandits
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits
Haipeng Luo
Mengxiao Zhang
Peng Zhao
Zhi-Hua Zhou
31
17
0
12 Feb 2022
Policy Optimization for Stochastic Shortest Path
Policy Optimization for Stochastic Shortest Path
Liyu Chen
Haipeng Luo
Aviv A. Rosenberg
19
12
0
07 Feb 2022
Parameter-free Online Linear Optimization with Side Information via
  Universal Coin Betting
Parameter-free Online Linear Optimization with Side Information via Universal Coin Betting
Jeonghun Ryu
Alankrita Bhatt
Young-Han Kim
26
1
0
04 Feb 2022
Learning Infinite-Horizon Average-Reward Markov Decision Processes with
  Constraints
Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints
Liyu Chen
R. Jain
Haipeng Luo
57
25
0
31 Jan 2022
No-Regret Learning in Time-Varying Zero-Sum Games
No-Regret Learning in Time-Varying Zero-Sum Games
Mengxiao Zhang
Peng Zhao
Haipeng Luo
Zhi-Hua Zhou
33
38
0
30 Jan 2022
Isotuning With Applications To Scale-Free Online Learning
Isotuning With Applications To Scale-Free Online Learning
Laurent Orseau
Marcus Hutter
13
5
0
29 Dec 2021
Minimax Regret for Stochastic Shortest Path with Adversarial Costs and
  Known Transition
Minimax Regret for Stochastic Shortest Path with Adversarial Costs and Known Transition
Liyu Chen
Haipeng Luo
Chen-Yu Wei
21
32
0
07 Dec 2020
1