ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.05388
  4. Cited By
Provably Efficient Reinforcement Learning with Linear Function
  Approximation
v1v2 (latest)

Provably Efficient Reinforcement Learning with Linear Function Approximation

11 July 2019
Chi Jin
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
ArXiv (abs)PDFHTML

Papers citing "Provably Efficient Reinforcement Learning with Linear Function Approximation"

50 / 417 papers shown
Title
Understanding Domain Randomization for Sim-to-real Transfer
Understanding Domain Randomization for Sim-to-real Transfer
Xiaoyu Chen
Jiachen Hu
Chi Jin
Lihong Li
Liwei Wang
202
118
0
07 Oct 2021
Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective
Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective
Ting-Han Fan
Peter J. Ramadge
CMLFAttOffRL
68
2
0
06 Oct 2021
Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement
  Learning
Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning
Tong Zhang
87
65
0
02 Oct 2021
Dimension-Free Rates for Natural Policy Gradient in Multi-Agent
  Reinforcement Learning
Dimension-Free Rates for Natural Policy Gradient in Multi-Agent Reinforcement Learning
Carlo Alfano
Patrick Rebeschini
91
5
0
23 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
86
103
0
14 Sep 2021
On the Approximation of Cooperative Heterogeneous Multi-Agent
  Reinforcement Learning (MARL) using Mean Field Control (MFC)
On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC)
Washim Uddin Mondal
Mridul Agarwal
Vaneet Aggarwal
S. Ukkusuri
134
44
0
09 Sep 2021
Adaptive Control of Differentially Private Linear Quadratic Systems
Adaptive Control of Differentially Private Linear Quadratic Systems
Sayak Ray Chowdhury
Xingyu Zhou
Ness B. Shroff
75
10
0
26 Aug 2021
A Boosting Approach to Reinforcement Learning
A Boosting Approach to Reinforcement Learning
Nataly Brukhim
Elad Hazan
Karan Singh
84
14
0
22 Aug 2021
Provable Benefits of Actor-Critic Methods for Offline Reinforcement
  Learning
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning
Andrea Zanette
Martin J. Wainwright
Emma Brunskill
OffRL
104
119
0
19 Aug 2021
Towards General Function Approximation in Zero-Sum Markov Games
Towards General Function Approximation in Zero-Sum Markov Games
Baihe Huang
Jason D. Lee
Zhaoran Wang
Zhuoran Yang
88
47
0
30 Jul 2021
Policy Optimization in Adversarial MDPs: Improved Exploration via
  Dilated Bonuses
Policy Optimization in Adversarial MDPs: Improved Exploration via Dilated Bonuses
Haipeng Luo
Chen-Yu Wei
Chung-Wei Lee
124
45
0
18 Jul 2021
Pessimistic Model-based Offline Reinforcement Learning under Partial
  Coverage
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage
Masatoshi Uehara
Wen Sun
OffRL
194
150
0
13 Jul 2021
Model Selection for Generic Reinforcement Learning
Model Selection for Generic Reinforcement Learning
Avishek Ghosh
Sayak Ray Chowdhury
Kannan Ramchandran
54
1
0
13 Jul 2021
Efficient Model-Based Multi-Agent Mean-Field Reinforcement Learning
Efficient Model-Based Multi-Agent Mean-Field Reinforcement Learning
Barna Pásztor
Ilija Bogunovic
Andreas Krause
88
41
0
08 Jul 2021
Variance-Aware Off-Policy Evaluation with Linear Function Approximation
Variance-Aware Off-Policy Evaluation with Linear Function Approximation
Yifei Min
Tianhao Wang
Dongruo Zhou
Quanquan Gu
OffRL
89
38
0
22 Jun 2021
Provably Efficient Representation Selection in Low-rank Markov Decision
  Processes: From Online to Offline RL
Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL
Weitong Zhang
Jiafan He
Dongruo Zhou
Amy Zhang
Quanquan Gu
OffRL
72
11
0
22 Jun 2021
Uniform-PAC Bounds for Reinforcement Learning with Linear Function
  Approximation
Uniform-PAC Bounds for Reinforcement Learning with Linear Function Approximation
Jiafan He
Dongruo Zhou
Quanquan Gu
55
13
0
22 Jun 2021
Agnostic Reinforcement Learning with Low-Rank MDPs and Rich Observations
Agnostic Reinforcement Learning with Low-Rank MDPs and Rich Observations
Christoph Dann
Yishay Mansour
M. Mohri
Ayush Sekhari
Karthik Sridharan
OffRL
57
11
0
22 Jun 2021
On the Power of Multitask Representation Learning in Linear MDP
On the Power of Multitask Representation Learning in Linear MDP
Rui Lu
Gao Huang
S. Du
85
29
0
15 Jun 2021
Randomized Exploration for Reinforcement Learning with General Value
  Function Approximation
Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Haque Ishfaq
Qiwen Cui
V. Nguyen
Alex Ayoub
Zhuoran Yang
Zhaoran Wang
Doina Precup
Lin F. Yang
96
48
0
15 Jun 2021
Sample Efficient Reinforcement Learning In Continuous State Spaces: A
  Perspective Beyond Linearity
Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity
Dhruv Malik
Aldo Pacchiano
Vishwak Srinivasan
Yuanzhi Li
57
6
0
15 Jun 2021
Online Sub-Sampling for Reinforcement Learning with General Function
  Approximation
Online Sub-Sampling for Reinforcement Learning with General Function Approximation
Dingwen Kong
Ruslan Salakhutdinov
Ruosong Wang
Lin F. Yang
OffRL
75
1
0
14 Jun 2021
Bellman-consistent Pessimism for Offline Reinforcement Learning
Bellman-consistent Pessimism for Offline Reinforcement Learning
Tengyang Xie
Ching-An Cheng
Nan Jiang
Paul Mineiro
Alekh Agarwal
OffRLLRM
240
280
0
13 Jun 2021
Corruption-Robust Offline Reinforcement Learning
Corruption-Robust Offline Reinforcement Learning
Xuezhou Zhang
Yiding Chen
Jerry Zhu
Wen Sun
OffRL
94
43
0
11 Jun 2021
Policy Finetuning: Bridging Sample-Efficient Offline and Online
  Reinforcement Learning
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning
Tengyang Xie
Nan Jiang
Huan Wang
Caiming Xiong
Yu Bai
OffRLOnRL
109
165
0
09 Jun 2021
A Provably-Efficient Model-Free Algorithm for Constrained Markov
  Decision Processes
A Provably-Efficient Model-Free Algorithm for Constrained Markov Decision Processes
Honghao Wei
Xin Liu
Lei Ying
112
23
0
03 Jun 2021
Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs
  with a Generative Model
Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs with a Generative Model
Bingyan Wang
Yuling Yan
Jianqing Fan
110
20
0
28 May 2021
Sample-Efficient Reinforcement Learning Is Feasible for Linearly
  Realizable MDPs with Limited Revisiting
Sample-Efficient Reinforcement Learning Is Feasible for Linearly Realizable MDPs with Limited Revisiting
Gen Li
Yuxin Chen
Yuejie Chi
Yuantao Gu
Yuting Wei
OffRL
92
30
0
17 May 2021
Online Algorithms and Policies Using Adaptive and Machine Learning
  Approaches
Online Algorithms and Policies Using Adaptive and Machine Learning Approaches
Anuradha M. Annaswamy
A. Guha
Yingnan Cui
Sunbochen Tang
Peter A. Fisher
Joseph E. Gaudio
81
19
0
13 May 2021
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in
  Time-Homogeneous, Reward-Free and Task-Agnostic Settings
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings
Ming Yin
Yu Wang
OffRL
106
19
0
13 May 2021
Towards Theoretical Understandings of Robust Markov Decision Processes:
  Sample Complexity and Asymptotics
Towards Theoretical Understandings of Robust Markov Decision Processes: Sample Complexity and Asymptotics
Wenhao Yang
Liangyu Zhang
Zhihua Zhang
81
35
0
09 May 2021
Learning Good State and Action Representations via Tensor Decomposition
Learning Good State and Action Representations via Tensor Decomposition
Chengzhuo Ni
Yaqi Duan
M. Dahleh
Anru R. Zhang
Mengdi Wang
94
7
0
03 May 2021
An $L^2$ Analysis of Reinforcement Learning in High Dimensions with
  Kernel and Neural Network Approximation
An L2L^2L2 Analysis of Reinforcement Learning in High Dimensions with Kernel and Neural Network Approximation
Jihao Long
Jiequn Han null
Weinan E
OffRL
75
15
0
15 Apr 2021
Rule-Based Reinforcement Learning for Efficient Robot Navigation with
  Space Reduction
Rule-Based Reinforcement Learning for Efficient Robot Navigation with Space Reduction
Yuanyang Zhu
Zhi Wang
Chunlin Chen
D. Dong
42
37
0
15 Apr 2021
Leveraging Good Representations in Linear Contextual Bandits
Leveraging Good Representations in Linear Contextual Bandits
Matteo Papini
Andrea Tirinzoni
Marcello Restelli
A. Lazaric
Matteo Pirotta
73
27
0
08 Apr 2021
Policy Information Capacity: Information-Theoretic Measure for Task
  Complexity in Deep Reinforcement Learning
Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning
Hiroki Furuta
T. Matsushima
Tadashi Kozuno
Y. Matsuo
Sergey Levine
Ofir Nachum
S. Gu
OffRL
58
14
0
23 Mar 2021
An Exponential Lower Bound for Linearly-Realizable MDPs with Constant
  Suboptimality Gap
An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap
Yuanhao Wang
Ruosong Wang
Sham Kakade
OffRL
193
43
0
23 Mar 2021
Provably Correct Optimization and Exploration with Non-linear Policies
Provably Correct Optimization and Exploration with Non-linear Policies
Fei Feng
W. Yin
Alekh Agarwal
Lin F. Yang
156
13
0
22 Mar 2021
Infinite-Horizon Offline Reinforcement Learning with Linear Function
  Approximation: Curse of Dimensionality and Algorithm
Infinite-Horizon Offline Reinforcement Learning with Linear Function Approximation: Curse of Dimensionality and Algorithm
Lin Chen
B. Scherrer
Peter L. Bartlett
OffRL
213
16
0
17 Mar 2021
Sample Complexity of Offline Reinforcement Learning with Deep ReLU
  Networks
Sample Complexity of Offline Reinforcement Learning with Deep ReLU Networks
Thanh Nguyen-Tang
Sunil R. Gupta
Hung The Tran
Svetha Venkatesh
OffRL
137
7
0
11 Mar 2021
Online Learning for Unknown Partially Observable MDPs
Online Learning for Unknown Partially Observable MDPs
Mehdi Jafarnia-Jahromi
Rahul Jain
A. Nayyar
108
20
0
25 Feb 2021
Near-Optimal Randomized Exploration for Tabular Markov Decision
  Processes
Near-Optimal Randomized Exploration for Tabular Markov Decision Processes
Zhihan Xiong
Ruoqi Shen
Qiwen Cui
Maryam Fazel
S. Du
92
10
0
19 Feb 2021
Near-optimal Policy Optimization Algorithms for Learning Adversarial
  Linear Mixture MDPs
Near-optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs
Jiafan He
Dongruo Zhou
Quanquan Gu
139
24
0
17 Feb 2021
Almost Optimal Algorithms for Two-player Zero-Sum Linear Mixture Markov
  Games
Almost Optimal Algorithms for Two-player Zero-Sum Linear Mixture Markov Games
Zixiang Chen
Dongruo Zhou
Quanquan Gu
75
25
0
15 Feb 2021
Nearly Minimax Optimal Regret for Learning Infinite-horizon
  Average-reward MDPs with Linear Function Approximation
Nearly Minimax Optimal Regret for Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
Yue Wu
Dongruo Zhou
Quanquan Gu
62
21
0
15 Feb 2021
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous
  Agents via Personalized Simulators
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators
Anish Agarwal
Abdullah Alomar
Varkey Alumootil
Devavrat Shah
Dennis Shen
Zhi Xu
Cindy Yang
OffRL
76
18
0
13 Feb 2021
Non-stationary Reinforcement Learning without Prior Knowledge: An
  Optimal Black-box Approach
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach
Chen-Yu Wei
Haipeng Luo
OffRL
189
108
0
10 Feb 2021
Simple Agent, Complex Environment: Efficient Reinforcement Learning with
  Agent States
Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent States
Shi Dong
Benjamin Van Roy
Zhengyuan Zhou
110
32
0
10 Feb 2021
Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve
  Optimism, Embrace Virtual Curvature
Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve Optimism, Embrace Virtual Curvature
Kefan Dong
Jiaqi Yang
Tengyu Ma
97
33
0
08 Feb 2021
Near-optimal Representation Learning for Linear Bandits and Linear RL
Near-optimal Representation Learning for Linear Bandits and Linear RL
Jiachen Hu
Xiaoyu Chen
Chi Jin
Lihong Li
Liwei Wang
OffRL
165
53
0
08 Feb 2021
Previous
123456789
Next