Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.03517
Cited By
No-Regret Exploration in Goal-Oriented Reinforcement Learning
7 December 2019
Jean Tarbouriech
Evrard Garcelon
Michal Valko
Matteo Pirotta
A. Lazaric
Re-assign community
ArXiv
PDF
HTML
Papers citing
"No-Regret Exploration in Goal-Oriented Reinforcement Learning"
15 / 15 papers shown
Title
Layered State Discovery for Incremental Autonomous Exploration
Liyu Chen
Andrea Tirinzoni
A. Lazaric
Matteo Pirotta
39
0
0
07 Feb 2023
Multi-Agent Congestion Cost Minimization With Linear Function Approximations
Prashant Trivedi
N. Hemachandra
40
0
0
26 Jan 2023
A Unified Algorithm for Stochastic Path Problems
Christoph Dann
Chen-Yu Wei
Julian Zimmert
37
0
0
17 Oct 2022
Convex duality for stochastic shortest path problems in known and unknown environments
Kelli Francis-Staite
29
0
0
31 Jul 2022
Offline Stochastic Shortest Path: Learning, Evaluation and Towards Optimality
Ming Yin
Wenjing Chen
Mengdi Wang
Yu-Xiang Wang
OffRL
32
4
0
10 Jun 2022
Policy Optimization for Stochastic Shortest Path
Liyu Chen
Haipeng Luo
Aviv A. Rosenberg
21
12
0
07 Feb 2022
Learning Stochastic Shortest Path with Linear Function Approximation
Steffen Czolbe
Jiafan He
Adrian Dalca
Quanquan Gu
44
30
0
25 Oct 2021
Deep Reinforcement Learning for L3 Slice Localization in Sarcopenia Assessment
Othmane Laousy
G. Chassagnon
Edouard Oyallon
Nikos Paragios
M. Revel
Maria Vakalopoulou
22
2
0
27 Jul 2021
Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret
Jean Tarbouriech
Runlong Zhou
S. Du
Matteo Pirotta
M. Valko
A. Lazaric
65
35
0
22 Apr 2021
Confidence-Budget Matching for Sequential Budgeted Learning
Yonathan Efroni
Nadav Merlis
Aadirupa Saha
Shie Mannor
26
10
0
05 Feb 2021
Geometric Entropic Exploration
Z. Guo
M. G. Azar
Alaa Saade
S. Thakoor
Bilal Piot
Bernardo Avila-Pires
Michal Valko
Thomas Mesnard
Tor Lattimore
Rémi Munos
38
30
0
06 Jan 2021
Learning Adversarial Markov Decision Processes with Delayed Feedback
Tal Lancewicki
Aviv A. Rosenberg
Yishay Mansour
43
32
0
29 Dec 2020
Minimax Regret for Stochastic Shortest Path with Adversarial Costs and Known Transition
Liyu Chen
Haipeng Luo
Chen-Yu Wei
29
32
0
07 Dec 2020
A Provably Efficient Sample Collection Strategy for Reinforcement Learning
Jean Tarbouriech
Matteo Pirotta
Michal Valko
A. Lazaric
OffRL
25
16
0
13 Jul 2020
Adaptive Reward-Free Exploration
E. Kaufmann
Pierre Ménard
O. D. Domingues
Anders Jonsson
Edouard Leurent
Michal Valko
30
80
0
11 Jun 2020
1