Stochastic approximation with cone-contractive operators: Sharp
  $\ell_\infty$-bounds for $Q$-learning
v1v2 (latest)

Stochastic approximation with cone-contractive operators: Sharp \ell_\infty-bounds for QQ-learning

Papers citing "Stochastic approximation with cone-contractive operators: Sharp $\ell_\infty$-bounds for $Q$-learning"