Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.04652
Cited By
Representation Learning for Online and Offline RL in Low-rank MDPs
9 October 2021
Masatoshi Uehara
Xuezhou Zhang
Wen Sun
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Representation Learning for Online and Offline RL in Low-rank MDPs"
50 / 105 papers shown
Title
Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM
Thang Duong
Minglai Yang
Chicheng Zhang
OffRL
11
0
0
16 May 2025
Offline Action-Free Learning of Ex-BMDPs by Comparing Diverse Datasets
Alexander Levine
Peter Stone
Amy Zhang
OffRL
57
0
0
26 Mar 2025
Accelerating Multi-Task Temporal Difference Learning under Low-Rank Representation
Yitao Bai
Sihan Zeng
Justin Romberg
Thinh T. Doan
OffRL
41
0
0
03 Mar 2025
Towards Understanding the Benefit of Multitask Representation Learning in Decision Process
Rui Lu
Yang Yue
Andrew Zhao
S. Du
Gao Huang
OffRL
52
1
0
01 Mar 2025
Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol
Pai Liu
Lingfeng Zhao
Shivangi Agarwal
Jinghan Liu
Audrey Huang
P. Amortila
Nan Jiang
OODD
OffRL
101
0
0
11 Feb 2025
Demystifying Linear MDPs and Novel Dynamics Aggregation Framework
Joongkyu Lee
Min-hwan Oh
38
2
0
31 Oct 2024
Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix Estimation
Stefan Stojanovic
Yassir Jedra
Alexandre Proutiere
33
0
0
30 Oct 2024
Primal-Dual Spectral Representation for Off-policy Evaluation
Yang Hu
Tianyi Chen
Na Li
Kai Wang
Bo Dai
OffRL
32
0
0
23 Oct 2024
Scalable spectral representations for multi-agent reinforcement learning in network MDPs
Zhaolin Ren
Runyu
Zhang
Bo Dai
17
0
0
22 Oct 2024
Guarantees for Nonlinear Representation Learning: Non-identical Covariates, Dependent Data, Fewer Samples
Thomas T. Zhang
Bruce D. Lee
Ingvar M. Ziemann
George J. Pappas
Nikolai Matni
CML
OOD
44
0
0
15 Oct 2024
Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory
Alexander Levine
Peter Stone
Amy Zhang
OffRL
41
0
0
03 Oct 2024
Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank
Wenhao Zhan
Scott Fujimoto
Zheqing Zhu
Jason D. Lee
Daniel Jiang
Yonathan Efroni
OffRL
29
0
0
01 Oct 2024
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Yen-Ru Lai
Fu-Chieh Chang
Pei-Yuan Wu
OffRL
73
1
0
22 Aug 2024
Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning
Dake Zhang
Boxiang Lyu
Shuang Qiu
Mladen Kolar
Tong Zhang
OffRL
38
0
0
10 Jul 2024
Uncertainty-Aware Reward-Free Exploration with General Function Approximation
Junkai Zhang
Weitong Zhang
Dongruo Zhou
Q. Gu
54
3
0
24 Jun 2024
Diffusion Spectral Representation for Reinforcement Learning
Dmitry Shribak
Chen-Xiao Gao
Yitong Li
Chenjun Xiao
Bo Dai
DiffM
26
3
0
23 Jun 2024
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Subhojyoti Mukherjee
Josiah P. Hanna
Robert Nowak
OffRL
48
0
0
04 Jun 2024
Projection by Convolution: Optimal Sample Complexity for Reinforcement Learning in Continuous-Space MDPs
Davide Maran
Alberto Maria Metelli
Matteo Papini
Marcello Restelli
42
3
0
10 May 2024
Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning
Xiaoyu Wen
Chenjia Bai
Kang Xu
Xudong Yu
Yang Zhang
Xuelong Li
Zhen Wang
41
2
0
10 May 2024
RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation
Chanwoo Park
Mingyang Liu
Dingwen Kong
Kaiqing Zhang
Asuman Ozdaglar
39
28
0
30 Apr 2024
Efficient Duple Perturbation Robustness in Low-rank MDPs
Yang Hu
Haitong Ma
Bo Dai
Na Li
28
0
0
11 Apr 2024
Skill Transfer and Discovery for Sim-to-Real Learning: A Representation-Based Viewpoint
Haitong Ma
Zhaolin Ren
Bo Dai
Na Li
37
1
0
07 Apr 2024
Towards Principled Representation Learning from Videos for Reinforcement Learning
Dipendra Kumar Misra
Akanksha Saran
Tengyang Xie
Alex Lamb
John Langford
SSL
OffRL
34
5
0
20 Mar 2024
Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
Ziping Xu
Zifan Xu
Runxuan Jiang
Peter Stone
Ambuj Tewari
45
1
0
03 Mar 2024
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation
Yu Chen
Xiangcheng Zhang
Siwei Wang
Longbo Huang
39
3
0
28 Feb 2024
Offline Multi-task Transfer RL with Representational Penalization
Avinandan Bose
S. S. Du
Maryam Fazel
OffRL
57
12
0
19 Feb 2024
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning
Zihao Li
Boyi Liu
Zhuoran Yang
Zhaoran Wang
Mengdi Wang
42
1
0
16 Feb 2024
Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption
Chen Ye
Jiafan He
Quanquan Gu
Tong Zhang
46
5
0
14 Feb 2024
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
Kaiwen Wang
Owen Oertell
Alekh Agarwal
Nathan Kallus
Wen Sun
OffRL
85
12
0
11 Feb 2024
Model-Based RL for Mean-Field Games is not Statistically Harder than Single-Agent RL
Jiawei Huang
Niao He
Andreas Krause
32
6
0
08 Feb 2024
No-Regret Reinforcement Learning in Smooth MDPs
Davide Maran
Alberto Maria Metelli
Matteo Papini
Marcello Restell
36
5
0
06 Feb 2024
Sample Complexity Characterization for Linear Contextual MDPs
Junze Deng
Yuan Cheng
Shaofeng Zou
Yingbin Liang
30
1
0
05 Feb 2024
On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond
Thanh Nguyen-Tang
Raman Arora
OffRL
33
3
0
06 Jan 2024
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Tongzheng Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
45
3
0
20 Nov 2023
Provably Efficient CVaR RL in Low-rank MDPs
Yulai Zhao
Wenhao Zhan
Xiaoyan Hu
Ho-fung Leung
Farzan Farnia
Wen Sun
Jason D. Lee
29
4
0
20 Nov 2023
Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees
Yifei Zhou
Ayush Sekhari
Yuda Song
Wen Sun
OffRL
OnRL
30
8
0
14 Nov 2023
Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback
Canzhe Zhao
Ruofeng Yang
Baoxiang Wang
Xuezhou Zhang
Shuai Li
27
2
0
14 Nov 2023
Low-Rank MDPs with Continuous Action Spaces
Andrew Bennett
Nathan Kallus
M. Oprescu
33
2
0
06 Nov 2023
A Doubly Robust Approach to Sparse Reinforcement Learning
Wonyoung Hedge Kim
Garud Iyengar
A. Zeevi
25
3
0
23 Oct 2023
Spectral Entry-wise Matrix Estimation for Low-Rank Reinforcement Learning
Stefan Stojanovic
Yassir Jedra
Alexandre Proutière
31
5
0
10 Oct 2023
When is Agnostic Reinforcement Learning Statistically Tractable?
Zeyu Jia
Gene Li
Alexander Rakhlin
Ayush Sekhari
Nathan Srebro
OffRL
27
5
0
09 Oct 2023
Representation Learning in Low-rank Slate-based Recommender Systems
Yijia Dai
Wen Sun
OffRL
25
0
0
10 Sep 2023
Provably Efficient Algorithm for Nonstationary Low-Rank MDPs
Yuan Cheng
J. Yang
Yitao Liang
OOD
36
1
0
10 Aug 2023
The Optimal Approximation Factors in Misspecified Off-Policy Value Function Estimation
P. Amortila
Nan Jiang
Csaba Szepesvári
OffRL
26
3
0
25 Jul 2023
Efficient Model-Free Exploration in Low-Rank MDPs
Zakaria Mhammedi
Adam Block
Dylan J. Foster
Alexander Rakhlin
OffRL
24
13
0
08 Jul 2023
Provably Efficient UCB-type Algorithms For Learning Predictive State Representations
Ruiquan Huang
Yitao Liang
J. Yang
OffRL
24
5
0
01 Jul 2023
Eigensubspace of Temporal-Difference Dynamics and How It Improves Value Approximation in Reinforcement Learning
Qiang He
Dinesh Manocha
Meng Fang
S. Maghsudi
32
4
0
29 Jun 2023
Context-lumpable stochastic bandits
Chung-Wei Lee
Qinghua Liu
Yasin Abbasi-Yadkori
Chi Jin
Tor Lattimore
Csaba Szepesvári
OffRL
100
2
0
22 Jun 2023
Achieving Sample and Computational Efficient Reinforcement Learning by Action Space Reduction via Grouping
Yining Li
Peizhong Ju
Ness B. Shroff
28
0
0
22 Jun 2023
Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP
Jiacheng Guo
Zihao Li
Huazheng Wang
Mengdi Wang
Zhuoran Yang
Xuezhou Zhang
32
5
0
21 Jun 2023
1
2
3
Next