ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.10814
  4. Cited By
FLAMBE: Structural Complexity and Representation Learning of Low Rank
  MDPs

FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs

18 June 2020
Alekh Agarwal
Sham Kakade
A. Krishnamurthy
Wen Sun
    OffRL
ArXivPDFHTML

Papers citing "FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs"

50 / 188 papers shown
Title
Learning Adversarial Low-rank Markov Decision Processes with Unknown
  Transition and Full-information Feedback
Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback
Canzhe Zhao
Ruofeng Yang
Baoxiang Wang
Xuezhou Zhang
Shuai Li
22
2
0
14 Nov 2023
Low-Rank MDPs with Continuous Action Spaces
Low-Rank MDPs with Continuous Action Spaces
Andrew Bennett
Nathan Kallus
M. Oprescu
33
2
0
06 Nov 2023
A Doubly Robust Approach to Sparse Reinforcement Learning
A Doubly Robust Approach to Sparse Reinforcement Learning
Wonyoung Hedge Kim
Garud Iyengar
A. Zeevi
25
3
0
23 Oct 2023
Prospective Side Information for Latent MDPs
Prospective Side Information for Latent MDPs
Jeongyeol Kwon
Yonathan Efroni
Shie Mannor
C. Caramanis
23
5
0
11 Oct 2023
Spectral Entry-wise Matrix Estimation for Low-Rank Reinforcement
  Learning
Spectral Entry-wise Matrix Estimation for Low-Rank Reinforcement Learning
Stefan Stojanovic
Yassir Jedra
Alexandre Proutière
23
5
0
10 Oct 2023
Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs
Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs
Zihan Zhou
Honghao Wei
Lei Ying
OffRL
35
1
0
27 Sep 2023
Representation Learning in Low-rank Slate-based Recommender Systems
Representation Learning in Low-rank Slate-based Recommender Systems
Yijia Dai
Wen Sun
OffRL
20
0
0
10 Sep 2023
Provably Efficient Algorithm for Nonstationary Low-Rank MDPs
Provably Efficient Algorithm for Nonstationary Low-Rank MDPs
Yuan-Chia Cheng
J. Yang
Yitao Liang
OOD
36
1
0
10 Aug 2023
Model-based Offline Reinforcement Learning with Count-based Conservatism
Model-based Offline Reinforcement Learning with Count-based Conservatism
Byeongchang Kim
Min Hwan Oh
OffRL
17
12
0
21 Jul 2023
Efficient Model-Free Exploration in Low-Rank MDPs
Efficient Model-Free Exploration in Low-Rank MDPs
Zakaria Mhammedi
Adam Block
Dylan J. Foster
Alexander Rakhlin
OffRL
19
13
0
08 Jul 2023
Provably Efficient UCB-type Algorithms For Learning Predictive State
  Representations
Provably Efficient UCB-type Algorithms For Learning Predictive State Representations
Ruiquan Huang
Yitao Liang
J. Yang
OffRL
24
5
0
01 Jul 2023
TD Convergence: An Optimization Perspective
TD Convergence: An Optimization Perspective
Kavosh Asadi
Shoham Sabach
Yao Liu
Omer Gottesman
Rasool Fakoor
MU
12
8
0
30 Jun 2023
Eigensubspace of Temporal-Difference Dynamics and How It Improves Value
  Approximation in Reinforcement Learning
Eigensubspace of Temporal-Difference Dynamics and How It Improves Value Approximation in Reinforcement Learning
Qiang He
Tianyi Zhou
Meng Fang
S. Maghsudi
32
4
0
29 Jun 2023
Supervised Pretraining Can Learn In-Context Reinforcement Learning
Supervised Pretraining Can Learn In-Context Reinforcement Learning
Jonathan Lee
Annie Xie
Aldo Pacchiano
Yash Chandak
Chelsea Finn
Ofir Nachum
Emma Brunskill
OffRL
35
74
0
26 Jun 2023
Achieving Sample and Computational Efficient Reinforcement Learning by
  Action Space Reduction via Grouping
Achieving Sample and Computational Efficient Reinforcement Learning by Action Space Reduction via Grouping
Yining Li
Peizhong Ju
Ness B. Shroff
26
0
0
22 Jun 2023
Provably Efficient Representation Learning with Tractable Planning in
  Low-Rank POMDP
Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP
Jiacheng Guo
Zihao Li
Huazheng Wang
Mengdi Wang
Zhuoran Yang
Xuezhou Zhang
32
5
0
21 Jun 2023
On the Model-Misspecification in Reinforcement Learning
On the Model-Misspecification in Reinforcement Learning
Yunfan Li
Lin F. Yang
36
5
0
19 Jun 2023
On the Global Convergence of Natural Actor-Critic with Two-layer Neural
  Network Parametrization
On the Global Convergence of Natural Actor-Critic with Two-layer Neural Network Parametrization
Mudit Gaur
Amrit Singh Bedi
Di-di Wang
Vaneet Aggarwal
35
3
0
18 Jun 2023
The RL Perceptron: Generalisation Dynamics of Policy Learning in High
  Dimensions
The RL Perceptron: Generalisation Dynamics of Policy Learning in High Dimensions
Nishil Patel
Sebastian Lee
Stefano Sarao Mannelli
Sebastian Goldt
Adrew Saxe
OffRL
28
3
0
17 Jun 2023
What and How does In-Context Learning Learn? Bayesian Model Averaging,
  Parameterization, and Generalization
What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization
Yufeng Zhang
Fengzhuo Zhang
Zhuoran Yang
Zhaoran Wang
BDL
36
62
0
30 May 2023
Provable Reward-Agnostic Preference-Based Reinforcement Learning
Provable Reward-Agnostic Preference-Based Reinforcement Learning
Wenhao Zhan
Masatoshi Uehara
Wen Sun
Jason D. Lee
19
7
0
29 May 2023
Reinforcement Learning with Human Feedback: Learning Dynamic Choices via
  Pessimism
Reinforcement Learning with Human Feedback: Learning Dynamic Choices via Pessimism
Zihao Li
Zhuoran Yang
Mengdi Wang
OffRL
29
54
0
29 May 2023
Sample Efficient Reinforcement Learning in Mixed Systems through
  Augmented Samples and Its Applications to Queueing Networks
Sample Efficient Reinforcement Learning in Mixed Systems through Augmented Samples and Its Applications to Queueing Networks
Honghao Wei
Xin Liu
Weina Wang
Lei Ying
24
10
0
25 May 2023
The Benefits of Being Distributional: Small-Loss Bounds for
  Reinforcement Learning
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Kaiwen Wang
Kevin Zhou
Runzhe Wu
Nathan Kallus
Wen Sun
OffRL
26
17
0
25 May 2023
Matrix Estimation for Offline Reinforcement Learning with Low-Rank
  Structure
Matrix Estimation for Offline Reinforcement Learning with Low-Rank Structure
Xumei Xi
C. Yu
Yudong Chen
OffRL
22
0
0
24 May 2023
On the Statistical Efficiency of Mean Field Reinforcement Learning with
  General Function Approximation
On the Statistical Efficiency of Mean Field Reinforcement Learning with General Function Approximation
Jiawei Huang
Batuhan Yardim
Niao He
34
10
0
18 May 2023
Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid
  Reinforcement Learning
Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning
Gen Li
Wenhao Zhan
Jason D. Lee
Yuejie Chi
Yuxin Chen
OffRL
OnRL
73
12
0
17 May 2023
Goal-oriented inference of environment from redundant observations
Goal-oriented inference of environment from redundant observations
Kazuki Takahashi
T. Fukai
Y. Sakai
T. Takekawa
9
0
0
08 May 2023
Representations and Exploration for Deep Reinforcement Learning using
  Singular Value Decomposition
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition
Yash Chandak
S. Thakoor
Z. Guo
Yunhao Tang
Rémi Munos
Will Dabney
Diana Borsa
13
2
0
01 May 2023
Provably Feedback-Efficient Reinforcement Learning via Active Reward
  Learning
Provably Feedback-Efficient Reinforcement Learning via Active Reward Learning
Dingwen Kong
Lin F. Yang
29
9
0
18 Apr 2023
Representation Learning with Multi-Step Inverse Kinematics: An Efficient
  and Optimal Approach to Rich-Observation RL
Representation Learning with Multi-Step Inverse Kinematics: An Efficient and Optimal Approach to Rich-Observation RL
Zakaria Mhammedi
Dylan J. Foster
Alexander Rakhlin
63
18
0
12 Apr 2023
Does Sparsity Help in Learning Misspecified Linear Bandits?
Does Sparsity Help in Learning Misspecified Linear Bandits?
Jialin Dong
Lin F. Yang
19
1
0
29 Mar 2023
Improved Sample Complexity for Reward-free Reinforcement Learning under
  Low-rank MDPs
Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPs
Yuan-Chia Cheng
Ruiquan Huang
J. Yang
Yitao Liang
OffRL
37
8
0
20 Mar 2023
A New Policy Iteration Algorithm For Reinforcement Learning in Zero-Sum
  Markov Games
A New Policy Iteration Algorithm For Reinforcement Learning in Zero-Sum Markov Games
Anna Winnicki
R. Srikant
34
1
0
17 Mar 2023
On the Provable Advantage of Unsupervised Pretraining
On the Provable Advantage of Unsupervised Pretraining
Jiawei Ge
Shange Tang
Jianqing Fan
Chi Jin
SSL
33
16
0
02 Mar 2023
Finite-sample Guarantees for Nash Q-learning with Linear Function
  Approximation
Finite-sample Guarantees for Nash Q-learning with Linear Function Approximation
Pedro Cisneros-Velarde
Oluwasanmi Koyejo
18
1
0
01 Mar 2023
Exponential Hardness of Reinforcement Learning with Linear Function
  Approximation
Exponential Hardness of Reinforcement Learning with Linear Function Approximation
Daniel M. Kane
Sihan Liu
Shachar Lovett
G. Mahajan
Csaba Szepesvári
Gellert Weisz
46
3
0
25 Feb 2023
Logarithmic Switching Cost in Reinforcement Learning beyond Linear MDPs
Logarithmic Switching Cost in Reinforcement Learning beyond Linear MDPs
Dan Qiao
Ming Yin
Yu-Xiang Wang
30
6
0
24 Feb 2023
Adversarial Model for Offline Reinforcement Learning
Adversarial Model for Offline Reinforcement Learning
M. Bhardwaj
Tengyang Xie
Byron Boots
Nan Jiang
Ching-An Cheng
AAML
OffRL
27
25
0
21 Feb 2023
Distributional Offline Policy Evaluation with Predictive Error
  Guarantees
Distributional Offline Policy Evaluation with Predictive Error Guarantees
Runzhe Wu
Masatoshi Uehara
Wen Sun
OffRL
33
13
0
19 Feb 2023
Breaking the Curse of Multiagents in a Large State Space: RL in Markov
  Games with Independent Linear Function Approximation
Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation
Qiwen Cui
K. Zhang
S. Du
28
23
0
07 Feb 2023
Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR
Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR
Kaiwen Wang
Nathan Kallus
Wen Sun
101
18
0
07 Feb 2023
Reinforcement Learning in Low-Rank MDPs with Density Features
Reinforcement Learning in Low-Rank MDPs with Density Features
Audrey Huang
Jinglin Chen
Nan Jiang
OffRL
13
14
0
04 Feb 2023
Learning in POMDPs is Sample-Efficient with Hindsight Observability
Learning in POMDPs is Sample-Efficient with Hindsight Observability
Jonathan Lee
Alekh Agarwal
Christoph Dann
Tong Zhang
26
19
0
31 Jan 2023
Theoretical Analysis of Offline Imitation With Supplementary Dataset
Theoretical Analysis of Offline Imitation With Supplementary Dataset
Ziniu Li
Tian Xu
Y. Yu
Zhixun Luo
OffRL
27
2
0
27 Jan 2023
Exploration in Model-based Reinforcement Learning with Randomized Reward
Exploration in Model-based Reinforcement Learning with Randomized Reward
Lingxiao Wang
Ping Li
11
0
0
09 Jan 2023
Latent Variable Representation for Reinforcement Learning
Latent Variable Representation for Reinforcement Learning
Tongzheng Ren
Chenjun Xiao
Tianjun Zhang
Na Li
Zhaoran Wang
Sujay Sanghavi
Dale Schuurmans
Bo Dai
OffRL
16
10
0
17 Dec 2022
Provably Efficient Model-free RL in Leader-Follower MDP with Linear
  Function Approximation
Provably Efficient Model-free RL in Leader-Follower MDP with Linear Function Approximation
A. Ghosh
15
1
0
28 Nov 2022
CIM: Constrained Intrinsic Motivation for Sparse-Reward Continuous Control
Xiang Zheng
Xingjun Ma
Cong Wang
28
1
0
28 Nov 2022
ARMOR: A Model-based Framework for Improving Arbitrary Baseline Policies
  with Offline Data
ARMOR: A Model-based Framework for Improving Arbitrary Baseline Policies with Offline Data
Tengyang Xie
M. Bhardwaj
Nan Jiang
Ching-An Cheng
OffRL
20
9
0
08 Nov 2022
Previous
1234
Next