Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.07710
Cited By
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
22 March 2017
Christoph Dann
Tor Lattimore
Emma Brunskill
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning"
44 / 94 papers shown
Title
NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Khaled Nakhleh
Santosh Ganji
Ping-Chun Hsieh
I.-Hong Hou
S. Shakkottai
61
38
0
05 Oct 2021
Gap-Dependent Unsupervised Exploration for Reinforcement Learning
Jingfeng Wu
Vladimir Braverman
Lin F. Yang
35
12
0
11 Aug 2021
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning
Tengyang Xie
Nan Jiang
Huan Wang
Caiming Xiong
Yu Bai
OffRL
OnRL
44
162
0
09 Jun 2021
Learning Policies with Zero or Bounded Constraint Violation for Constrained MDPs
Tao-Wen Liu
Ruida Zhou
D. Kalathil
P. R. Kumar
Chao Tian
44
78
0
04 Jun 2021
Reinforcement learning for linear-convex models with jumps via stability analysis of feedback controls
Xin Guo
Anran Hu
Yufei Zhang
24
24
0
19 Apr 2021
Nearly Horizon-Free Offline Reinforcement Learning
Tongzheng Ren
Jialian Li
Bo Dai
S. Du
Sujay Sanghavi
OffRL
32
49
0
25 Mar 2021
UCB Momentum Q-learning: Correcting the bias without forgetting
Pierre Menard
O. D. Domingues
Xuedong Shang
Michal Valko
79
41
0
01 Mar 2021
Reward Poisoning in Reinforcement Learning: Attacks Against Unknown Learners in Unknown Environments
Amin Rakhsha
Xuezhou Zhang
Xiaojin Zhu
Adish Singla
AAML
OffRL
44
37
0
16 Feb 2021
Online Apprenticeship Learning
Lior Shani
Tom Zahavy
Shie Mannor
OffRL
31
25
0
13 Feb 2021
Robust Policy Gradient against Strong Data Corruption
Xuezhou Zhang
Yiding Chen
Xiaojin Zhu
Wen Sun
AAML
45
37
0
11 Feb 2021
Task-Optimal Exploration in Linear Dynamical Systems
Andrew Wagenmaker
Max Simchowitz
Kevin G. Jamieson
32
18
0
10 Feb 2021
Learning Adversarial Markov Decision Processes with Delayed Feedback
Tal Lancewicki
Aviv A. Rosenberg
Yishay Mansour
43
32
0
29 Dec 2020
On Function Approximation in Reinforcement Learning: Optimism in the Face of Large State Spaces
Zhuoran Yang
Chi Jin
Zhaoran Wang
Mengdi Wang
Michael I. Jordan
44
18
0
09 Nov 2020
Improved Worst-Case Regret Bounds for Randomized Least-Squares Value Iteration
Priyank Agrawal
Jinglin Chen
Nan Jiang
33
19
0
23 Oct 2020
CoinDICE: Off-Policy Confidence Interval Estimation
Bo Dai
Ofir Nachum
Yinlam Chow
Lihong Li
Csaba Szepesvári
Dale Schuurmans
OffRL
29
84
0
22 Oct 2020
A Sharp Analysis of Model-based Reinforcement Learning with Self-Play
Qinghua Liu
Tiancheng Yu
Yu Bai
Chi Jin
34
121
0
04 Oct 2020
Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon
Zihan Zhang
Xiangyang Ji
S. Du
OffRL
39
104
0
28 Sep 2020
Private Reinforcement Learning with PAC and Regret Guarantees
G. Vietri
Borja Balle
A. Krishnamurthy
Zhiwei Steven Wu
26
60
0
18 Sep 2020
Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs
Aria HasanzadeZonuzy
Archana Bura
D. Kalathil
S. Shakkottai
32
39
0
01 Aug 2020
Near-Optimal Reinforcement Learning with Self-Play
Yunru Bai
Chi Jin
Tiancheng Yu
24
130
0
22 Jun 2020
Q
Q
Q
-learning with Logarithmic Regret
Kunhe Yang
Lin F. Yang
S. Du
48
59
0
16 Jun 2020
Adaptive Reward-Free Exploration
E. Kaufmann
Pierre Ménard
O. D. Domingues
Anders Jonsson
Edouard Leurent
Michal Valko
30
80
0
11 Jun 2020
Adaptive Experimental Design with Temporal Interference: A Maximum Likelihood Approach
Peter Glynn
Ramesh Johari
M. Rasouli
19
29
0
10 Jun 2020
Temporally-Extended ε-Greedy Exploration
Will Dabney
Georg Ostrovski
André Barreto
27
34
0
02 Jun 2020
Model-Based Reinforcement Learning with Value-Targeted Regression
Alex Ayoub
Zeyu Jia
Csaba Szepesvári
Mengdi Wang
Lin F. Yang
OffRL
59
299
0
01 Jun 2020
Tightening Exploration in Upper Confidence Reinforcement Learning
Hippolyte Bourel
Odalric-Ambrym Maillard
M. S. Talebi
30
31
0
20 Apr 2020
Exploration-Exploitation in Constrained MDPs
Yonathan Efroni
Shie Mannor
Matteo Pirotta
33
171
0
04 Mar 2020
Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium
Qiaomin Xie
Yudong Chen
Zhaoran Wang
Zhuoran Yang
41
124
0
17 Feb 2020
Provable Self-Play Algorithms for Competitive Reinforcement Learning
Yu Bai
Chi Jin
SSL
22
148
0
10 Feb 2020
Reward-Free Exploration for Reinforcement Learning
Chi Jin
A. Krishnamurthy
Max Simchowitz
Tiancheng Yu
OffRL
116
194
0
07 Feb 2020
Asymptotically Efficient Off-Policy Evaluation for Tabular Reinforcement Learning
Ming Yin
Yu Wang
OffRL
29
80
0
29 Jan 2020
Lipschitz Lifelong Reinforcement Learning
Erwan Lecarpentier
David Abel
Kavosh Asadi
Yuu Jinnai
Emmanuel Rachelson
Michael L. Littman
OffRL
CLL
24
39
0
15 Jan 2020
Optimism in Reinforcement Learning with Generalized Linear Function Approximation
Yining Wang
Ruosong Wang
S. Du
A. Krishnamurthy
137
135
0
09 Dec 2019
Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning
Dipendra Kumar Misra
Mikael Henaff
A. Krishnamurthy
John Langford
36
151
0
13 Nov 2019
Model-Based Reinforcement Learning Exploiting State-Action Equivalence
Mahsa Asadi
M. S. Talebi
Hippolyte Bourel
Odalric-Ambrym Maillard
OffRL
6
9
0
09 Oct 2019
Provably Efficient Reinforcement Learning with Linear Function Approximation
Chi Jin
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
52
543
0
11 Jul 2019
From self-tuning regulators to reinforcement learning and back again
Nikolai Matni
Alexandre Proutiere
Anders Rantzer
Stephen Tu
27
88
0
27 Jun 2019
Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies
Yonathan Efroni
Nadav Merlis
Mohammad Ghavamzadeh
Shie Mannor
OffRL
24
68
0
27 May 2019
Q-learning with UCB Exploration is Sample Efficient for Infinite-Horizon MDP
Kefan Dong
Yuanhao Wang
Xiaoyu Chen
Liwei Wang
OffRL
19
95
0
27 Jan 2019
Tighter Problem-Dependent Regret Bounds in Reinforcement Learning without Domain Knowledge using Value Function Bounds
Andrea Zanette
Emma Brunskill
OffRL
56
273
0
01 Jan 2019
Policy Certificates: Towards Accountable Reinforcement Learning
Christoph Dann
Ashutosh Adhikari
Wei Wei
Jimmy J. Lin
OffRL
25
141
0
07 Nov 2018
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
52
670
0
21 Sep 2018
Fast Exploration with Simplified Models and Approximately Optimistic Planning in Model Based Reinforcement Learning
Ramtin Keramati
Jay Whang
Patrick Cho
Emma Brunskill
OffRL
29
7
0
01 Jun 2018
Minimax Regret Bounds for Reinforcement Learning
M. G. Azar
Ian Osband
Rémi Munos
16
762
0
16 Mar 2017
Previous
1
2