ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.23498
  4. Cited By
Kernel-Based Function Approximation for Average Reward Reinforcement
  Learning: An Optimist No-Regret Algorithm

Kernel-Based Function Approximation for Average Reward Reinforcement Learning: An Optimist No-Regret Algorithm

30 October 2024
Sattar Vakili
Julia Olkhovskaya
ArXivPDFHTML

Papers citing "Kernel-Based Function Approximation for Average Reward Reinforcement Learning: An Optimist No-Regret Algorithm"

16 / 16 papers shown
Title
Demystifying Linear MDPs and Novel Dynamics Aggregation Framework
Demystifying Linear MDPs and Novel Dynamics Aggregation Framework
Joongkyu Lee
Min-hwan Oh
53
2
0
31 Oct 2024
Open Problem: Order Optimal Regret Bounds for Kernel-Based Reinforcement
  Learning
Open Problem: Order Optimal Regret Bounds for Kernel-Based Reinforcement Learning
Sattar Vakili
OffRL
42
1
0
21 Jun 2024
Kernelized Reinforcement Learning with Order Optimal Regret Bounds
Kernelized Reinforcement Learning with Order Optimal Regret Bounds
Sattar Vakili
Julia Olkhovskaya
47
9
0
13 Jun 2023
Sample Complexity of Kernel-Based Q-Learning
Sample Complexity of Kernel-Based Q-Learning
Sing-Yuan Yeh
Fu-Chieh Chang
Chang-Wei Yueh
Pei-Yuan Wu
A. Bernacchia
Sattar Vakili
OffRL
66
4
0
01 Feb 2023
Scaling Gaussian Process Optimization by Evaluating a Few Unique
  Candidates Multiple Times
Scaling Gaussian Process Optimization by Evaluating a Few Unique Candidates Multiple Times
Daniele Calandriello
Luigi Carratino
A. Lazaric
Michal Valko
Lorenzo Rosasco
48
14
0
30 Jan 2022
Optimal Order Simple Regret for Gaussian Process Bandits
Optimal Order Simple Regret for Gaussian Process Bandits
Sattar Vakili
N. Bouziani
Sepehr Jalali
A. Bernacchia
Da-shan Shiu
73
54
0
20 Aug 2021
Nearly Minimax Optimal Regret for Learning Infinite-horizon
  Average-reward MDPs with Linear Function Approximation
Nearly Minimax Optimal Regret for Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
Yue Wu
Dongruo Zhou
Quanquan Gu
41
21
0
15 Feb 2021
A Domain-Shrinking based Bayesian Optimization Algorithm with
  Order-Optimal Regret Performance
A Domain-Shrinking based Bayesian Optimization Algorithm with Order-Optimal Regret Performance
Sudeep Salgia
Sattar Vakili
Qing Zhao
68
34
0
27 Oct 2020
On Information Gain and Regret Bounds in Gaussian Process Bandits
On Information Gain and Regret Bounds in Gaussian Process Bandits
Sattar Vakili
Kia Khezeli
Victor Picheny
GP
52
133
0
15 Sep 2020
Variational Policy Gradient Method for Reinforcement Learning with
  General Utilities
Variational Policy Gradient Method for Reinforcement Learning with General Utilities
Junyu Zhang
Alec Koppel
Amrit Singh Bedi
Csaba Szepesvári
Mengdi Wang
59
139
0
04 Jul 2020
Provably Efficient Reinforcement Learning for Discounted MDPs with
  Feature Mapping
Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping
Dongruo Zhou
Jiafan He
Quanquan Gu
58
135
0
23 Jun 2020
Optimism in Reinforcement Learning with Generalized Linear Function
  Approximation
Optimism in Reinforcement Learning with Generalized Linear Function Approximation
Yining Wang
Ruosong Wang
S. Du
A. Krishnamurthy
168
136
0
09 Dec 2019
On Batch Bayesian Optimization
On Batch Bayesian Optimization
Sayak Ray Chowdhury
Aditya Gopalan
39
10
0
04 Nov 2019
Model-free Reinforcement Learning in Infinite-horizon Average-reward
  Markov Decision Processes
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
121
105
0
15 Oct 2019
Provably Efficient Reinforcement Learning with Linear Function
  Approximation
Provably Efficient Reinforcement Learning with Linear Function Approximation
Chi Jin
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
86
557
0
11 Jul 2019
Finite-Time Analysis of Kernelised Contextual Bandits
Finite-Time Analysis of Kernelised Contextual Bandits
Michal Valko
N. Korda
Rémi Munos
I. Flaounas
N. Cristianini
180
273
0
26 Sep 2013
1