Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.15286
Cited By
A Doubly Robust Approach to Sparse Reinforcement Learning
23 October 2023
Wonyoung Hedge Kim
Garud Iyengar
A. Zeevi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Doubly Robust Approach to Sparse Reinforcement Learning"
16 / 16 papers shown
Title
Linear Bandits with Partially Observable Features
Wonyoung Hedge Kim
Sungwoo Park
G. Iyengar
A. Zeevi
Min Hwan Oh
162
1
0
10 Feb 2025
PopArt: Efficient Sparse Regression and Experimental Design for Optimal Sparse Linear Bandits
Kyoungseok Jang
Chicheng Zhang
Kwang-Sung Jun
78
13
0
25 Oct 2022
Nearly Minimax Optimal Reinforcement Learning with Linear Function Approximation
Pihe Hu
Yu Chen
Longbo Huang
48
35
0
23 Jun 2022
Representation Learning for Online and Offline RL in Low-rank MDPs
Masatoshi Uehara
Xuezhou Zhang
Wen Sun
OffRL
102
129
0
09 Oct 2021
Doubly robust Thompson sampling for linear payoffs
Wonyoung Hedge Kim
Gi-Soo Kim
M. Paik
18
25
0
01 Feb 2021
Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes
Dongruo Zhou
Quanquan Gu
Csaba Szepesvári
66
207
0
15 Dec 2020
FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs
Alekh Agarwal
Sham Kakade
A. Krishnamurthy
Wen Sun
OffRL
151
226
0
18 Jun 2020
Model-Based Reinforcement Learning with Value-Targeted Regression
Alex Ayoub
Zeyu Jia
Csaba Szepesvári
Mengdi Wang
Lin F. Yang
OffRL
83
304
0
01 Jun 2020
Provably Efficient Exploration in Policy Optimization
Qi Cai
Zhuoran Yang
Chi Jin
Zhaoran Wang
51
280
0
12 Dec 2019
Doubly-Robust Lasso Bandit
Gi-Soo Kim
M. Paik
53
62
0
26 Jul 2019
Provably Efficient Reinforcement Learning with Linear Function Approximation
Chi Jin
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
86
556
0
11 Jul 2019
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
Christoph Dann
Tor Lattimore
Emma Brunskill
72
308
0
22 Mar 2017
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
270
1,331
0
05 Jun 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
191
8,833
0
04 Feb 2016
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
117
12,223
0
19 Dec 2013
On the conditions used to prove oracle results for the Lasso
Sara van de Geer
Peter Buhlmann
257
731
0
05 Oct 2009
1