v1v2 (latest)

Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms

29 September 2022

Papers citing "Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms"

38 / 38 papers shown

Title
PAC Reinforcement Learning for Predictive State Representations Wenhao Zhan Masatoshi Uehara Wen Sun Jason D. Lee 56 39 0 12 Jul 2022
Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings Masatoshi Uehara Ayush Sekhari Jason D. Lee Nathan Kallus Wen Sun 83 6 0 24 Jun 2022
Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems Masatoshi Uehara Ayush Sekhari Jason D. Lee Nathan Kallus Wen Sun OffRL 89 36 0 24 Jun 2022
Model-based RL with Optimistic Posterior Sampling: Structural Conditions and Sample Complexity Alekh Agarwal Tong Zhang 84 23 0 15 Jun 2022
Learning in Observable POMDPs, without Computationally Intractable Oracles Noah Golowich Ankur Moitra Dhruv Rohatgi 71 27 0 07 Jun 2022
Sample-Efficient Reinforcement Learning of Partially Observable Markov Games Qinghua Liu Csaba Szepesvári Chi Jin 98 21 0 02 Jun 2022
Efficient Phi-Regret Minimization in Extensive-Form Games via Online Mirror Descent Yu Bai Chi Jin Song Mei Ziang Song Tiancheng Yu OffRL 79 19 0 30 May 2022
Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency Lingxiao Wang Qi Cai Zhuoran Yang Zhaoran Wang 85 18 0 26 May 2022
Sample-Efficient Learning of Correlated Equilibria in Extensive-Form Games Ziang Song Song Mei Yu Bai 85 10 0 15 May 2022
Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency Qi Cai Zhuoran Yang Zhaoran Wang 66 14 0 20 Apr 2022
When Is Partially Observable Reinforcement Learning Not Scary? Qinghua Liu Alan Chung Csaba Szepesvári Chi Jin 50 98 0 19 Apr 2022
Provable Reinforcement Learning with a Short-Term Memory Yonathan Efroni Chi Jin A. Krishnamurthy Sobhan Miryoosefi OffRL 46 38 0 08 Feb 2022
Near-Optimal Learning of Extensive-Form Games with Imperfect Information Yunru Bai Chi Jin Song Mei Tiancheng Yu 74 26 0 03 Feb 2022
Planning in Observable POMDPs in Quasipolynomial Time Noah Golowich Ankur Moitra Dhruv Rohatgi 46 27 0 12 Jan 2022
The Statistical Complexity of Interactive Decision Making Dylan J. Foster Sham Kakade Jian Qian Alexander Rakhlin 370 183 0 27 Dec 2021
Sublinear Regret for Learning POMDPs Yi Xiong Ningyuan Chen Xuefeng Gao Xiang Zhou 68 25 0 08 Jul 2021
Bilinear Classes: A Structural Framework for Provable Generalization in RL S. Du Sham Kakade Jason D. Lee Shachar Lovett G. Mahajan Wen Sun Ruosong Wang OffRL 171 191 0 19 Mar 2021
Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games Gabriele Farina Robin Schmucker Tuomas Sandholm 161 21 0 08 Mar 2021
Online Learning for Unknown Partially Observable MDPs Mehdi Jafarnia-Jahromi Rahul Jain A. Nayyar 78 20 0 25 Feb 2021
RL for Latent MDPs: Regret Guarantees and a Lower Bound Jeongyeol Kwon Yonathan Efroni Constantine Caramanis Shie Mannor 67 80 0 09 Feb 2021
Bellman Eluder Dimension: New Rich Classes of RL Problems, and Sample-Efficient Algorithms Chi Jin Qinghua Liu Sobhan Miryoosefi OffRL 91 219 0 01 Feb 2021
Sample-Efficient Reinforcement Learning of Undercomplete POMDPs Chi Jin Sham Kakade A. Krishnamurthy Qinghua Liu 84 66 0 22 Jun 2020
FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs Alekh Agarwal Sham Kakade A. Krishnamurthy Wen Sun OffRL 165 227 0 18 Jun 2020
The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies Stephan Zheng Alexander R. Trott Sunil Srinivasa Nikhil Naik Melvin Gruesbeck David C. Parkes R. Socher 56 135 0 28 Apr 2020
Reward-Free Exploration for Reinforcement Learning Chi Jin A. Krishnamurthy Max Simchowitz Tiancheng Yu OffRL 169 197 0 07 Feb 2020
Dota 2 with Large Scale Deep Reinforcement Learning OpenAI OpenAI : Christopher Berner Greg Brockman Brooke Chan ... Szymon Sidor Ilya Sutskever Jie Tang Filip Wolski Susan Zhang GNN VLM CLL AI4CE LRM 169 1,835 0 13 Dec 2019
Solving Rubik's Cube with a Robot Hand OpenAI Ilge Akkaya Marcin Andrychowicz Maciek Chociej Ma-teusz Litwin ... Peter Welinder Lilian Weng Qiming Yuan Wojciech Zaremba Lei Zhang ODL 121 1,232 0 16 Oct 2019
Provably Efficient Reinforcement Learning with Linear Function Approximation Chi Jin Zhuoran Yang Zhaoran Wang Michael I. Jordan 98 559 0 11 Jul 2019
Provably efficient RL with Rich Observations via Latent State Decoding S. Du A. Krishnamurthy Nan Jiang Alekh Agarwal Miroslav Dudík John Langford OffRL 74 230 0 25 Jan 2019
Minimax Regret Bounds for Reinforcement Learning M. G. Azar Ian Osband Rémi Munos 90 778 0 16 Mar 2017
Contextual Decision Processes with Low Bellman Rank are PAC-Learnable Nan Jiang A. Krishnamurthy Alekh Agarwal John Langford Robert Schapire 149 420 0 29 Oct 2016
A PAC RL Algorithm for Episodic POMDPs Z. Guo Shayan Doroudi Emma Brunskill 77 56 0 25 May 2016
Reinforcement Learning of POMDPs using Spectral Methods Kamyar Azizzadenesheli A. Lazaric Anima Anandkumar 44 128 0 25 Feb 2016
Efficient Learning and Planning with Compressed Predictive States William L. Hamilton M. M. Fard Joelle Pineau 68 41 0 01 Dec 2013
Hilbert Space Embeddings of Predictive State Representations Byron Boots Geoffrey J. Gordon Arthur Gretton 103 95 0 26 Sep 2013
Predictive State Representations: A New Theory for Modeling Dynamical Systems Satinder Singh Michael R. James Matthew R. Rudary AI4TS AI4CE 91 289 0 11 Jul 2012
Nonapproximability Results for Partially Observable Markov Decision Processes J. Goldsmith Christopher Lusena M. Mundhenk 90 110 0 01 Jun 2011
Closing the Learning-Planning Loop with Predictive State Representations Byron Boots S. Siddiqi Geoffrey J. Gordon 241 265 0 12 Dec 2009