ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.09704
  4. Cited By
Unknown mixing times in apprenticeship and reinforcement learning

Unknown mixing times in apprenticeship and reinforcement learning

23 May 2019
Tom Zahavy
Alon Cohen
Haim Kaplan
Yishay Mansour
    OffRL
ArXivPDFHTML

Papers citing "Unknown mixing times in apprenticeship and reinforcement learning"

9 / 9 papers shown
Title
Inverse Reinforcement Learning with the Average Reward Criterion
Inverse Reinforcement Learning with the Average Reward Criterion
Feiyang Wu
Jingyang Ke
Anqi Wu
37
9
0
24 May 2023
Concentration Phenomenon for Random Dynamical Systems: An Operator
  Theoretic Approach
Concentration Phenomenon for Random Dynamical Systems: An Operator Theoretic Approach
Muhammad Naeem
Miroslav Pajic
39
1
0
07 Dec 2022
Transportation-Inequalities, Lyapunov Stability and Sampling for
  Dynamical Systems on Continuous State Space
Transportation-Inequalities, Lyapunov Stability and Sampling for Dynamical Systems on Continuous State Space
Muhammad Naeem
Miroslav Pajic
24
3
0
25 May 2022
Discovering Diverse Nearly Optimal Policies with Successor Features
Discovering Diverse Nearly Optimal Policies with Successor Features
Tom Zahavy
Brendan O'Donoghue
André Barreto
Volodymyr Mnih
Sebastian Flennerhag
Satinder Singh
28
20
0
01 Jun 2021
Reward is enough for convex MDPs
Reward is enough for convex MDPs
Tom Zahavy
Brendan O'Donoghue
Guillaume Desjardins
Satinder Singh
77
73
0
01 Jun 2021
Discovering a set of policies for the worst case reward
Discovering a set of policies for the worst case reward
Tom Zahavy
André Barreto
D. Mankowitz
Shaobo Hou
Brendan O'Donoghue
Iurii Kemaev
Satinder Singh
OffRL
27
23
0
08 Feb 2021
Learning Expected Reward for Switched Linear Control Systems: A
  Non-Asymptotic View
Learning Expected Reward for Switched Linear Control Systems: A Non-Asymptotic View
Muhammad Naeem
Miroslav Pajic
16
1
0
15 Jun 2020
Apprenticeship Learning via Frank-Wolfe
Apprenticeship Learning via Frank-Wolfe
Tom Zahavy
Alon Cohen
Haim Kaplan
Yishay Mansour
29
18
0
05 Nov 2019
Inverse Reinforcement Learning in Contextual MDPs
Inverse Reinforcement Learning in Contextual MDPs
Stav Belogolovsky
Philip Korsunsky
Shie Mannor
Chen Tessler
Tom Zahavy
OffRL
BDL
31
18
0
23 May 2019
1