A Doubly Robust Approach to Sparse Reinforcement Learning

23 October 2023

Papers citing "A Doubly Robust Approach to Sparse Reinforcement Learning"

16 / 16 papers shown

Title
Linear Bandits with Partially Observable Features Wonyoung Hedge Kim Sungwoo Park G. Iyengar A. Zeevi Min Hwan Oh 162 1 0 10 Feb 2025
PopArt: Efficient Sparse Regression and Experimental Design for Optimal Sparse Linear Bandits Kyoungseok Jang Chicheng Zhang Kwang-Sung Jun 78 13 0 25 Oct 2022
Nearly Minimax Optimal Reinforcement Learning with Linear Function Approximation Pihe Hu Yu Chen Longbo Huang 48 35 0 23 Jun 2022
Representation Learning for Online and Offline RL in Low-rank MDPs Masatoshi Uehara Xuezhou Zhang Wen Sun OffRL 102 129 0 09 Oct 2021
Doubly robust Thompson sampling for linear payoffs Wonyoung Hedge Kim Gi-Soo Kim M. Paik 18 25 0 01 Feb 2021
Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes Dongruo Zhou Quanquan Gu Csaba Szepesvári 66 207 0 15 Dec 2020
FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs Alekh Agarwal Sham Kakade A. Krishnamurthy Wen Sun OffRL 151 226 0 18 Jun 2020
Model-Based Reinforcement Learning with Value-Targeted Regression Alex Ayoub Zeyu Jia Csaba Szepesvári Mengdi Wang Lin F. Yang OffRL 83 304 0 01 Jun 2020
Provably Efficient Exploration in Policy Optimization Qi Cai Zhuoran Yang Chi Jin Zhaoran Wang 51 280 0 12 Dec 2019
Doubly-Robust Lasso Bandit Gi-Soo Kim M. Paik 53 62 0 26 Jul 2019
Provably Efficient Reinforcement Learning with Linear Function Approximation Chi Jin Zhuoran Yang Zhaoran Wang Michael I. Jordan 86 556 0 11 Jul 2019
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning Christoph Dann Tor Lattimore Emma Brunskill 72 308 0 22 Mar 2017
Deep Reinforcement Learning for Dialogue Generation Jiwei Li Will Monroe Alan Ritter Michel Galley Jianfeng Gao Dan Jurafsky 270 1,331 0 05 Jun 2016
Asynchronous Methods for Deep Reinforcement Learning Volodymyr Mnih Adria Puigdomenech Badia M. Berk Mirza Alex Graves Timothy Lillicrap Tim Harley David Silver Koray Kavukcuoglu 191 8,833 0 04 Feb 2016
Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu David Silver Alex Graves Ioannis Antonoglou Daan Wierstra Martin Riedmiller 117 12,223 0 19 Dec 2013
On the conditions used to prove oracle results for the Lasso Sara van de Geer Peter Buhlmann 257 731 0 05 Oct 2009