ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.05388
  4. Cited By
Provably Efficient Reinforcement Learning with Linear Function
  Approximation
v1v2 (latest)

Provably Efficient Reinforcement Learning with Linear Function Approximation

11 July 2019
Chi Jin
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
ArXiv (abs)PDFHTML

Papers citing "Provably Efficient Reinforcement Learning with Linear Function Approximation"

50 / 417 papers shown
Title
Provable Reset-free Reinforcement Learning by No-Regret Reduction
Provable Reset-free Reinforcement Learning by No-Regret Reduction
Hoai-An Nguyen
Ching-An Cheng
OffRL
93
3
0
06 Jan 2023
Model-Based Reinforcement Learning with Multinomial Logistic Function
  Approximation
Model-Based Reinforcement Learning with Multinomial Logistic Function Approximation
Taehyun Hwang
Min Hwan Oh
96
9
0
27 Dec 2022
Offline Reinforcement Learning for Human-Guided Human-Machine
  Interaction with Private Information
Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Zuyue Fu
Zhengling Qi
Zhuoran Yang
Zhaoran Wang
Lan Wang
OffRL
81
0
0
23 Dec 2022
Near-optimal Policy Identification in Active Reinforcement Learning
Near-optimal Policy Identification in Active Reinforcement Learning
Xiang Li
Viraj Mehta
Johannes Kirschner
I. Char
Willie Neiswanger
J. Schneider
Andreas Krause
Ilija Bogunovic
OffRL
89
6
0
19 Dec 2022
Latent Variable Representation for Reinforcement Learning
Latent Variable Representation for Reinforcement Learning
Zhaolin Ren
Chenjun Xiao
Tianjun Zhang
Na Li
Zhaoran Wang
Sujay Sanghavi
Dale Schuurmans
Bo Dai
OffRL
106
10
0
17 Dec 2022
Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision
  Processes
Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes
Jiafan He
Heyang Zhao
Dongruo Zhou
Quanquan Gu
OffRL
136
55
0
12 Dec 2022
Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear
  Contextual Bandits and Markov Decision Processes
Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes
Chen Ye
Wei Xiong
Quanquan Gu
Tong Zhang
218
31
0
12 Dec 2022
Flow to Control: Offline Reinforcement Learning with Lossless Primitive
  Discovery
Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery
Yiqin Yang
Haotian Hu
Wenzhe Li
Siyuan Li
Jun Yang
Qianchuan Zhao
Chongjie Zhang
OffRL
86
10
0
02 Dec 2022
Causal Deep Reinforcement Learning Using Observational Data
Causal Deep Reinforcement Learning Using Observational Data
Wenxuan Zhu
Chao Yu
Qiaosheng Zhang
CMLOffRL
75
5
0
28 Nov 2022
CIM: Constrained Intrinsic Motivation for Sparse-Reward Continuous Control
Xiang Zheng
Xingjun Ma
Cong Wang
102
2
0
28 Nov 2022
Model-Free Reinforcement Learning with the Decision-Estimation
  Coefficient
Model-Free Reinforcement Learning with the Decision-Estimation Coefficient
Dylan J. Foster
Noah Golowich
Jian Qian
Alexander Rakhlin
Ayush Sekhari
OffRL
98
10
0
25 Nov 2022
On Instance-Dependent Bounds for Offline Reinforcement Learning with
  Linear Function Approximation
On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation
Thanh Nguyen-Tang
Ming Yin
Sunil R. Gupta
Svetha Venkatesh
R. Arora
OffRL
95
19
0
23 Nov 2022
Leveraging Offline Data in Online Reinforcement Learning
Leveraging Offline Data in Online Reinforcement Learning
Andrew Wagenmaker
Aldo Pacchiano
OffRLOnRL
105
41
0
09 Nov 2022
Confident Approximate Policy Iteration for Efficient Local Planning in
  $q^π$-realizable MDPs
Confident Approximate Policy Iteration for Efficient Local Planning in qπq^πqπ-realizable MDPs
Gellert Weisz
András Gyorgy
Tadashi Kozuno
Csaba Szepesvári
78
7
0
27 Oct 2022
Efficient Global Planning in Large MDPs via Stochastic Primal-Dual
  Optimization
Efficient Global Planning in Large MDPs via Stochastic Primal-Dual Optimization
Gergely Neu
Nneka Okolo
108
7
0
21 Oct 2022
On the Power of Pre-training for Generalization in RL: Provable Benefits
  and Hardness
On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness
Haotian Ye
Xiaoyu Chen
Liwei Wang
S. Du
OffRL
90
7
0
19 Oct 2022
A Reinforcement Learning Approach in Multi-Phase Second-Price Auction
  Design
A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design
Rui Ai
Boxiang Lyu
Zhaoran Wang
Zhuoran Yang
Michael I. Jordan
78
3
0
19 Oct 2022
Unpacking Reward Shaping: Understanding the Benefits of Reward
  Engineering on Sample Complexity
Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
Abhishek Gupta
Aldo Pacchiano
Yuexiang Zhai
Sham Kakade
Sergey Levine
OffRL
105
67
0
18 Oct 2022
Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient
Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient
Yuda Song
Yi Zhou
Ayush Sekhari
J. Andrew Bagnell
A. Krishnamurthy
Wen Sun
OffRLOnRL
97
105
0
13 Oct 2022
Multi-User Reinforcement Learning with Low Rank Rewards
Multi-User Reinforcement Learning with Low Rank Rewards
Naman Agarwal
Prateek Jain
S. Kowshik
Dheeraj M. Nagaraj
Praneeth Netrapalli
OffRL
91
1
0
11 Oct 2022
Bilinear Exponential Family of MDPs: Frequentist Regret Bound with
  Tractable Exploration and Planning
Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration and Planning
Reda Ouhamma
D. Basu
Odalric-Ambrym Maillard
OffRL
82
12
0
05 Oct 2022
Offline Reinforcement Learning with Differentiable Function
  Approximation is Provably Efficient
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient
Ming Yin
Mengdi Wang
Yu Wang
OffRL
143
12
0
03 Oct 2022
Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning
  with Linear Function Approximation
Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function Approximation
Dan Qiao
Yu Wang
OffRL
133
13
0
03 Oct 2022
A General Framework for Sample-Efficient Function Approximation in
  Reinforcement Learning
A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning
Zixiang Chen
C. J. Li
An Yuan
Quanquan Gu
Michael I. Jordan
OffRL
154
27
0
30 Sep 2022
Partially Observable RL with B-Stability: Unified Structural Condition
  and Sharp Sample-Efficient Algorithms
Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms
Fan Chen
Yu Bai
Song Mei
100
22
0
29 Sep 2022
Conservative Dual Policy Optimization for Efficient Model-Based
  Reinforcement Learning
Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning
Shen Zhang
60
6
0
16 Sep 2022
Understanding Deep Neural Function Approximation in Reinforcement
  Learning via $ε$-Greedy Exploration
Understanding Deep Neural Function Approximation in Reinforcement Learning via εεε-Greedy Exploration
Fanghui Liu
Luca Viano
Volkan Cevher
116
20
0
15 Sep 2022
On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Zixuan Dong
Che Wang
George Andriopoulos
93
3
0
07 Sep 2022
Dynamic Regret of Online Markov Decision Processes
Dynamic Regret of Online Markov Decision Processes
Peng Zhao
Longfei Li
Zhi Zhou
OffRL
101
17
0
26 Aug 2022
A Provably Efficient Model-Free Posterior Sampling Method for Episodic
  Reinforcement Learning
A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning
Christoph Dann
M. Mohri
Tong Zhang
Julian Zimmert
OffRL
58
36
0
23 Aug 2022
Spectral Decomposition Representation for Reinforcement Learning
Spectral Decomposition Representation for Reinforcement Learning
Zhaolin Ren
Tianjun Zhang
Lisa Lee
Joseph E. Gonzalez
Dale Schuurmans
Bo Dai
OffRL
100
29
0
19 Aug 2022
Best Policy Identification in Linear MDPs
Best Policy Identification in Linear MDPs
Jerome Taupin
Yassir Jedra
Alexandre Proutiere
102
4
0
11 Aug 2022
Learning Two-Player Mixture Markov Games: Kernel Function Approximation
  and Correlated Equilibrium
Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium
C. J. Li
Dongruo Zhou
Quanquan Gu
Michael I. Jordan
83
2
0
10 Aug 2022
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning
  in Online Reinforcement Learning
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Shuang Qiu
Lingxiao Wang
Chenjia Bai
Zhuoran Yang
Zhaoran Wang
SSLOffRL
143
32
0
29 Jul 2022
A Few Expert Queries Suffices for Sample-Efficient RL with Resets and
  Linear Value Approximation
A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation
Philip Amortila
Nan Jiang
Dhruv Madeka
Dean Phillips Foster
81
5
0
18 Jul 2022
Making Linear MDPs Practical via Contrastive Representation Learning
Making Linear MDPs Practical via Contrastive Representation Learning
Tianjun Zhang
Zhaolin Ren
Mengjiao Yang
Joseph E. Gonzalez
Dale Schuurmans
Bo Dai
74
44
0
14 Jul 2022
Model Selection in Reinforcement Learning with General Function
  Approximations
Model Selection in Reinforcement Learning with General Function Approximations
Avishek Ghosh
Sayak Ray Chowdhury
55
3
0
06 Jul 2022
Instance-Dependent Near-Optimal Policy Identification in Linear MDPs via
  Online Experiment Design
Instance-Dependent Near-Optimal Policy Identification in Linear MDPs via Online Experiment Design
Andrew Wagenmaker
Kevin Jamieson
OffRL
95
29
0
06 Jul 2022
Provably Efficient Reinforcement Learning for Online Adaptive Influence
  Maximization
Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization
Kaixuan Huang
Yuehua Wu
Xuezhou Zhang
Shenyinying Tu
Qingyun Wu
Mengdi Wang
Huazheng Wang
39
1
0
29 Jun 2022
Safe Exploration Incurs Nearly No Additional Sample Complexity for
  Reward-free RL
Safe Exploration Incurs Nearly No Additional Sample Complexity for Reward-free RL
Ruiquan Huang
J. Yang
Yingbin Liang
OffRL
117
9
0
28 Jun 2022
On the Complexity of Adversarial Decision Making
On the Complexity of Adversarial Decision Making
Dylan J. Foster
Alexander Rakhlin
Ayush Sekhari
Karthik Sridharan
AAML
79
29
0
27 Jun 2022
Computationally Efficient PAC RL in POMDPs with Latent Determinism and
  Conditional Embeddings
Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings
Masatoshi Uehara
Ayush Sekhari
Jason D. Lee
Nathan Kallus
Wen Sun
95
6
0
24 Jun 2022
Provably Efficient Reinforcement Learning in Partially Observable
  Dynamical Systems
Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems
Masatoshi Uehara
Ayush Sekhari
Jason D. Lee
Nathan Kallus
Wen Sun
OffRL
107
36
0
24 Jun 2022
Provably Efficient Model-Free Constrained RL with Linear Function
  Approximation
Provably Efficient Model-Free Constrained RL with Linear Function Approximation
A. Ghosh
Xingyu Zhou
Ness B. Shroff
148
28
0
23 Jun 2022
Nearly Minimax Optimal Reinforcement Learning with Linear Function
  Approximation
Nearly Minimax Optimal Reinforcement Learning with Linear Function Approximation
Pihe Hu
Yu Chen
Longbo Huang
86
35
0
23 Jun 2022
On the Statistical Efficiency of Reward-Free Exploration in Non-Linear
  RL
On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL
Jinglin Chen
Aditya Modi
A. Krishnamurthy
Nan Jiang
Alekh Agarwal
114
26
0
21 Jun 2022
Guarantees for Epsilon-Greedy Reinforcement Learning with Function
  Approximation
Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation
Christoph Dann
Yishay Mansour
M. Mohri
Ayush Sekhari
Karthik Sridharan
120
53
0
19 Jun 2022
Model-based RL with Optimistic Posterior Sampling: Structural Conditions
  and Sample Complexity
Model-based RL with Optimistic Posterior Sampling: Structural Conditions and Sample Complexity
Alekh Agarwal
Tong Zhang
106
23
0
15 Jun 2022
Provably Efficient Offline Reinforcement Learning with Trajectory-Wise
  Reward
Provably Efficient Offline Reinforcement Learning with Trajectory-Wise Reward
Tengyu Xu
Yue Wang
Shaofeng Zou
Yingbin Liang
OffRL
97
13
0
13 Jun 2022
Achieving Zero Constraint Violation for Constrained Reinforcement
  Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm
Qinbo Bai
Amrit Singh Bedi
Vaneet Aggarwal
100
24
0
12 Jun 2022
Previous
123456789
Next