Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.10814
Cited By
FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs
18 June 2020
Alekh Agarwal
Sham Kakade
A. Krishnamurthy
Wen Sun
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs"
50 / 188 papers shown
Title
Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback
Canzhe Zhao
Ruofeng Yang
Baoxiang Wang
Xuezhou Zhang
Shuai Li
22
2
0
14 Nov 2023
Low-Rank MDPs with Continuous Action Spaces
Andrew Bennett
Nathan Kallus
M. Oprescu
33
2
0
06 Nov 2023
A Doubly Robust Approach to Sparse Reinforcement Learning
Wonyoung Hedge Kim
Garud Iyengar
A. Zeevi
25
3
0
23 Oct 2023
Prospective Side Information for Latent MDPs
Jeongyeol Kwon
Yonathan Efroni
Shie Mannor
C. Caramanis
23
5
0
11 Oct 2023
Spectral Entry-wise Matrix Estimation for Low-Rank Reinforcement Learning
Stefan Stojanovic
Yassir Jedra
Alexandre Proutière
23
5
0
10 Oct 2023
Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs
Zihan Zhou
Honghao Wei
Lei Ying
OffRL
35
1
0
27 Sep 2023
Representation Learning in Low-rank Slate-based Recommender Systems
Yijia Dai
Wen Sun
OffRL
20
0
0
10 Sep 2023
Provably Efficient Algorithm for Nonstationary Low-Rank MDPs
Yuan-Chia Cheng
J. Yang
Yitao Liang
OOD
36
1
0
10 Aug 2023
Model-based Offline Reinforcement Learning with Count-based Conservatism
Byeongchang Kim
Min Hwan Oh
OffRL
17
12
0
21 Jul 2023
Efficient Model-Free Exploration in Low-Rank MDPs
Zakaria Mhammedi
Adam Block
Dylan J. Foster
Alexander Rakhlin
OffRL
19
13
0
08 Jul 2023
Provably Efficient UCB-type Algorithms For Learning Predictive State Representations
Ruiquan Huang
Yitao Liang
J. Yang
OffRL
24
5
0
01 Jul 2023
TD Convergence: An Optimization Perspective
Kavosh Asadi
Shoham Sabach
Yao Liu
Omer Gottesman
Rasool Fakoor
MU
12
8
0
30 Jun 2023
Eigensubspace of Temporal-Difference Dynamics and How It Improves Value Approximation in Reinforcement Learning
Qiang He
Tianyi Zhou
Meng Fang
S. Maghsudi
32
4
0
29 Jun 2023
Supervised Pretraining Can Learn In-Context Reinforcement Learning
Jonathan Lee
Annie Xie
Aldo Pacchiano
Yash Chandak
Chelsea Finn
Ofir Nachum
Emma Brunskill
OffRL
35
74
0
26 Jun 2023
Achieving Sample and Computational Efficient Reinforcement Learning by Action Space Reduction via Grouping
Yining Li
Peizhong Ju
Ness B. Shroff
26
0
0
22 Jun 2023
Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP
Jiacheng Guo
Zihao Li
Huazheng Wang
Mengdi Wang
Zhuoran Yang
Xuezhou Zhang
32
5
0
21 Jun 2023
On the Model-Misspecification in Reinforcement Learning
Yunfan Li
Lin F. Yang
36
5
0
19 Jun 2023
On the Global Convergence of Natural Actor-Critic with Two-layer Neural Network Parametrization
Mudit Gaur
Amrit Singh Bedi
Di-di Wang
Vaneet Aggarwal
35
3
0
18 Jun 2023
The RL Perceptron: Generalisation Dynamics of Policy Learning in High Dimensions
Nishil Patel
Sebastian Lee
Stefano Sarao Mannelli
Sebastian Goldt
Adrew Saxe
OffRL
28
3
0
17 Jun 2023
What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization
Yufeng Zhang
Fengzhuo Zhang
Zhuoran Yang
Zhaoran Wang
BDL
36
62
0
30 May 2023
Provable Reward-Agnostic Preference-Based Reinforcement Learning
Wenhao Zhan
Masatoshi Uehara
Wen Sun
Jason D. Lee
19
7
0
29 May 2023
Reinforcement Learning with Human Feedback: Learning Dynamic Choices via Pessimism
Zihao Li
Zhuoran Yang
Mengdi Wang
OffRL
29
54
0
29 May 2023
Sample Efficient Reinforcement Learning in Mixed Systems through Augmented Samples and Its Applications to Queueing Networks
Honghao Wei
Xin Liu
Weina Wang
Lei Ying
24
10
0
25 May 2023
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Kaiwen Wang
Kevin Zhou
Runzhe Wu
Nathan Kallus
Wen Sun
OffRL
26
17
0
25 May 2023
Matrix Estimation for Offline Reinforcement Learning with Low-Rank Structure
Xumei Xi
C. Yu
Yudong Chen
OffRL
22
0
0
24 May 2023
On the Statistical Efficiency of Mean Field Reinforcement Learning with General Function Approximation
Jiawei Huang
Batuhan Yardim
Niao He
34
10
0
18 May 2023
Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning
Gen Li
Wenhao Zhan
Jason D. Lee
Yuejie Chi
Yuxin Chen
OffRL
OnRL
73
12
0
17 May 2023
Goal-oriented inference of environment from redundant observations
Kazuki Takahashi
T. Fukai
Y. Sakai
T. Takekawa
9
0
0
08 May 2023
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition
Yash Chandak
S. Thakoor
Z. Guo
Yunhao Tang
Rémi Munos
Will Dabney
Diana Borsa
13
2
0
01 May 2023
Provably Feedback-Efficient Reinforcement Learning via Active Reward Learning
Dingwen Kong
Lin F. Yang
29
9
0
18 Apr 2023
Representation Learning with Multi-Step Inverse Kinematics: An Efficient and Optimal Approach to Rich-Observation RL
Zakaria Mhammedi
Dylan J. Foster
Alexander Rakhlin
63
18
0
12 Apr 2023
Does Sparsity Help in Learning Misspecified Linear Bandits?
Jialin Dong
Lin F. Yang
19
1
0
29 Mar 2023
Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPs
Yuan-Chia Cheng
Ruiquan Huang
J. Yang
Yitao Liang
OffRL
37
8
0
20 Mar 2023
A New Policy Iteration Algorithm For Reinforcement Learning in Zero-Sum Markov Games
Anna Winnicki
R. Srikant
34
1
0
17 Mar 2023
On the Provable Advantage of Unsupervised Pretraining
Jiawei Ge
Shange Tang
Jianqing Fan
Chi Jin
SSL
33
16
0
02 Mar 2023
Finite-sample Guarantees for Nash Q-learning with Linear Function Approximation
Pedro Cisneros-Velarde
Oluwasanmi Koyejo
18
1
0
01 Mar 2023
Exponential Hardness of Reinforcement Learning with Linear Function Approximation
Daniel M. Kane
Sihan Liu
Shachar Lovett
G. Mahajan
Csaba Szepesvári
Gellert Weisz
46
3
0
25 Feb 2023
Logarithmic Switching Cost in Reinforcement Learning beyond Linear MDPs
Dan Qiao
Ming Yin
Yu-Xiang Wang
30
6
0
24 Feb 2023
Adversarial Model for Offline Reinforcement Learning
M. Bhardwaj
Tengyang Xie
Byron Boots
Nan Jiang
Ching-An Cheng
AAML
OffRL
27
25
0
21 Feb 2023
Distributional Offline Policy Evaluation with Predictive Error Guarantees
Runzhe Wu
Masatoshi Uehara
Wen Sun
OffRL
33
13
0
19 Feb 2023
Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation
Qiwen Cui
K. Zhang
S. Du
28
23
0
07 Feb 2023
Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR
Kaiwen Wang
Nathan Kallus
Wen Sun
101
18
0
07 Feb 2023
Reinforcement Learning in Low-Rank MDPs with Density Features
Audrey Huang
Jinglin Chen
Nan Jiang
OffRL
13
14
0
04 Feb 2023
Learning in POMDPs is Sample-Efficient with Hindsight Observability
Jonathan Lee
Alekh Agarwal
Christoph Dann
Tong Zhang
26
19
0
31 Jan 2023
Theoretical Analysis of Offline Imitation With Supplementary Dataset
Ziniu Li
Tian Xu
Y. Yu
Zhixun Luo
OffRL
27
2
0
27 Jan 2023
Exploration in Model-based Reinforcement Learning with Randomized Reward
Lingxiao Wang
Ping Li
11
0
0
09 Jan 2023
Latent Variable Representation for Reinforcement Learning
Tongzheng Ren
Chenjun Xiao
Tianjun Zhang
Na Li
Zhaoran Wang
Sujay Sanghavi
Dale Schuurmans
Bo Dai
OffRL
16
10
0
17 Dec 2022
Provably Efficient Model-free RL in Leader-Follower MDP with Linear Function Approximation
A. Ghosh
15
1
0
28 Nov 2022
CIM: Constrained Intrinsic Motivation for Sparse-Reward Continuous Control
Xiang Zheng
Xingjun Ma
Cong Wang
28
1
0
28 Nov 2022
ARMOR: A Model-based Framework for Improving Arbitrary Baseline Policies with Offline Data
Tengyang Xie
M. Bhardwaj
Nan Jiang
Ching-An Cheng
OffRL
20
9
0
08 Nov 2022
Previous
1
2
3
4
Next