ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.00954
  4. Cited By
Problem Dependent Reinforcement Learning Bounds Which Can Identify
  Bandit Structure in MDPs

Problem Dependent Reinforcement Learning Bounds Which Can Identify Bandit Structure in MDPs

3 November 2019
Andrea Zanette
Emma Brunskill
ArXivPDFHTML

Papers citing "Problem Dependent Reinforcement Learning Bounds Which Can Identify Bandit Structure in MDPs"

1 / 1 papers shown
Title
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement
  Learning
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
Christoph Dann
Tor Lattimore
Emma Brunskill
60
307
0
22 Mar 2017
1