ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.05388
  4. Cited By
Provably Efficient Reinforcement Learning with Linear Function
  Approximation
v1v2 (latest)

Provably Efficient Reinforcement Learning with Linear Function Approximation

11 July 2019
Chi Jin
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
ArXiv (abs)PDFHTML

Papers citing "Provably Efficient Reinforcement Learning with Linear Function Approximation"

50 / 417 papers shown
Title
Overcoming the Long Horizon Barrier for Sample-Efficient Reinforcement
  Learning with Latent Low-Rank Structure
Overcoming the Long Horizon Barrier for Sample-Efficient Reinforcement Learning with Latent Low-Rank Structure
Tyler Sam
Yudong Chen
Chao Yu
OffRL
135
7
0
07 Jun 2022
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
Rui Yang
Chenjia Bai
Xiaoteng Ma
Zhaoran Wang
Chongjie Zhang
Lei Han
OffRL
119
81
0
06 Jun 2022
Sample-Efficient Reinforcement Learning of Partially Observable Markov
  Games
Sample-Efficient Reinforcement Learning of Partially Observable Markov Games
Qinghua Liu
Csaba Szepesvári
Chi Jin
114
21
0
02 Jun 2022
Posterior Coreset Construction with Kernelized Stein Discrepancy for
  Model-Based Reinforcement Learning
Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning
Souradip Chakraborty
Amrit Singh Bedi
Alec Koppel
Brian M. Sadler
Furong Huang
Pratap Tokekar
Tianyi Zhou
79
9
0
02 Jun 2022
Offline Reinforcement Learning with Differential Privacy
Offline Reinforcement Learning with Differential Privacy
Dan Qiao
Yu Wang
OffRL
121
23
0
02 Jun 2022
One Policy is Enough: Parallel Exploration with a Single Policy is
  Near-Optimal for Reward-Free Reinforcement Learning
One Policy is Enough: Parallel Exploration with a Single Policy is Near-Optimal for Reward-Free Reinforcement Learning
Pedro Cisneros-Velarde
Boxiang Lyu
Oluwasanmi Koyejo
Mladen Kolar
OffRL
75
3
0
31 May 2022
Provable General Function Class Representation Learning in Multitask
  Bandits and MDPs
Provable General Function Class Representation Learning in Multitask Bandits and MDPs
Rui Lu
Andrew Zhao
S. Du
Gao Huang
OffRL
104
10
0
31 May 2022
Computationally Efficient Horizon-Free Reinforcement Learning for Linear
  Mixture MDPs
Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPs
Dongruo Zhou
Quanquan Gu
122
45
0
23 May 2022
Human-in-the-loop: Provably Efficient Preference-based Reinforcement
  Learning with General Function Approximation
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation
Xiaoyu Chen
Han Zhong
Zhuoran Yang
Zhaoran Wang
Liwei Wang
192
70
0
23 May 2022
Spiking Neural Operators for Scientific Machine Learning
Spiking Neural Operators for Scientific Machine Learning
Adar Kahana
Qian Zhang
Leonard Gleyzer
George Karniadakis
65
9
0
17 May 2022
Provably Efficient Kernelized Q-Learning
Provably Efficient Kernelized Q-Learning
Shuang Liu
H. Su
MLT
100
4
0
21 Apr 2022
Reinforcement Learning from Partial Observation: Linear Function
  Approximation with Provable Sample Efficiency
Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency
Qi Cai
Zhuoran Yang
Zhaoran Wang
66
14
0
20 Apr 2022
Jump-Start Reinforcement Learning
Jump-Start Reinforcement Learning
Ikechukwu Uchendu
Ted Xiao
Yao Lu
Banghua Zhu
Mengyuan Yan
...
Chuyuan Fu
Cong Ma
Jiantao Jiao
Sergey Levine
Karol Hausman
OffRLOnRL
110
116
0
05 Apr 2022
Non-Linear Reinforcement Learning in Large Action Spaces: Structural
  Conditions and Sample-efficiency of Posterior Sampling
Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling
Alekh Agarwal
Tong Zhang
77
8
0
15 Mar 2022
Near-optimal Offline Reinforcement Learning with Linear Representation:
  Leveraging Variance Information with Pessimism
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism
Ming Yin
Yaqi Duan
Mengdi Wang
Yu Wang
OffRL
127
67
0
11 Mar 2022
A Complete Characterization of Linear Estimators for Offline Policy
  Evaluation
A Complete Characterization of Linear Estimators for Offline Policy Evaluation
Juan C. Perdomo
A. Krishnamurthy
Peter L. Bartlett
Sham Kakade
OffRL
80
3
0
08 Mar 2022
Learn to Match with No Regret: Reinforcement Learning in Markov Matching
  Markets
Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets
Yifei Min
Tianhao Wang
Ruitu Xu
Zhaoran Wang
Michael I. Jordan
Zhuoran Yang
94
23
0
07 Mar 2022
Target Network and Truncation Overcome The Deadly Triad in $Q$-Learning
Target Network and Truncation Overcome The Deadly Triad in QQQ-Learning
Zaiwei Chen
John-Paul Clarke
S. T. Maguluri
70
19
0
05 Mar 2022
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement
  Learning
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Zhuoran Yang
Zhihong Deng
Animesh Garg
Peng Liu
Zhaoran Wang
OffRL
106
141
0
23 Feb 2022
Sequential Information Design: Markov Persuasion Process and Its
  Efficient Reinforcement Learning
Sequential Information Design: Markov Persuasion Process and Its Efficient Reinforcement Learning
Jibang Wu
Zixuan Zhang
Zhe Feng
Zhaoran Wang
Zhuoran Yang
Michael I. Jordan
Haifeng Xu
91
37
0
22 Feb 2022
Provably Efficient Causal Model-Based Reinforcement Learning for
  Systematic Generalization
Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization
Mirco Mutti
Ric De Santi
Emanuele Rossi
J. Calderón
Michael M. Bronstein
Marcello Restelli
113
14
0
14 Feb 2022
Towards Deployment-Efficient Reinforcement Learning: Lower Bound and
  Optimality
Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality
Jiawei Huang
Jinglin Chen
Li Zhao
Tao Qin
Nan Jiang
Tie-Yan Liu
OffRL
106
24
0
14 Feb 2022
Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost
Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost
Dan Qiao
Ming Yin
Ming Min
Yu Wang
91
29
0
13 Feb 2022
Shuffle Private Linear Contextual Bandits
Shuffle Private Linear Contextual Bandits
Sayak Ray Chowdhury
Xingyu Zhou
FedML
98
27
0
11 Feb 2022
Computational-Statistical Gaps in Reinforcement Learning
Computational-Statistical Gaps in Reinforcement Learning
D. Kane
Sihan Liu
Shachar Lovett
G. Mahajan
57
5
0
11 Feb 2022
Off-Policy Fitted Q-Evaluation with Differentiable Function
  Approximators: Z-Estimation and Inference Theory
Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory
Ruiqi Zhang
Xuezhou Zhang
Chengzhuo Ni
Mengdi Wang
OffRL
90
16
0
10 Feb 2022
No-Regret Learning in Dynamic Stackelberg Games
No-Regret Learning in Dynamic Stackelberg Games
Niklas Lauffer
Mahsa Ghasemi
Abolfazl Hashemi
Y. Savas
Ufuk Topcu
90
20
0
10 Feb 2022
Improved Regret for Differentially Private Exploration in Linear MDP
Improved Regret for Differentially Private Exploration in Linear MDP
Dung Daniel Ngo
G. Vietri
Zhiwei Steven Wu
96
8
0
02 Feb 2022
Efficient Reinforcement Learning in Block MDPs: A Model-free
  Representation Learning Approach
Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Xuezhou Zhang
Yuda Song
Masatoshi Uehara
Mengdi Wang
Alekh Agarwal
Wen Sun
OffRL
142
58
0
31 Jan 2022
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback
Tiancheng Jin
Tal Lancewicki
Haipeng Luo
Yishay Mansour
Aviv A. Rosenberg
133
22
0
31 Jan 2022
Provably Efficient Primal-Dual Reinforcement Learning for CMDPs with
  Non-stationary Objectives and Constraints
Provably Efficient Primal-Dual Reinforcement Learning for CMDPs with Non-stationary Objectives and Constraints
Yuhao Ding
Javad Lavaei
110
12
0
28 Jan 2022
Differentially Private Reinforcement Learning with Linear Function
  Approximation
Differentially Private Reinforcement Learning with Linear Function Approximation
Xingyu Zhou
97
26
0
18 Jan 2022
Exponential Family Model-Based Reinforcement Learning via Score Matching
Exponential Family Model-Based Reinforcement Learning via Score Matching
Gen Li
Junbo Li
Anmol Kabra
Nathan Srebro
Zhaoran Wang
Zhuoran Yang
95
4
0
28 Dec 2021
Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of
  Representation Learning in Actor-Critic
Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of Representation Learning in Actor-Critic
Yufeng Zhang
Siyu Chen
Zhuoran Yang
Michael I. Jordan
Zhaoran Wang
128
4
0
27 Dec 2021
Differentially Private Regret Minimization in Episodic Markov Decision
  Processes
Differentially Private Regret Minimization in Episodic Markov Decision Processes
Sayak Ray Chowdhury
Xingyu Zhou
85
22
0
20 Dec 2021
First-Order Regret in Reinforcement Learning with Linear Function
  Approximation: A Robust Estimation Approach
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach
Andrew Wagenmaker
Yifang Chen
Max Simchowitz
S. Du
Kevin Jamieson
182
39
0
07 Dec 2021
Misspecified Gaussian Process Bandit Optimization
Misspecified Gaussian Process Bandit Optimization
Ilija Bogunovic
Andreas Krause
98
45
0
09 Nov 2021
Safe Policy Optimization with Local Generalized Linear Function
  Approximations
Safe Policy Optimization with Local Generalized Linear Function Approximations
Akifumi Wachi
Yunyue Wei
Yanan Sui
OffRL
72
10
0
09 Nov 2021
Exponential Bellman Equation and Improved Regret Bounds for
  Risk-Sensitive Reinforcement Learning
Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning
Yingjie Fei
Zhuoran Yang
Yudong Chen
Zhaoran Wang
100
48
0
06 Nov 2021
Perturbational Complexity by Distribution Mismatch: A Systematic
  Analysis of Reinforcement Learning in Reproducing Kernel Hilbert Space
Perturbational Complexity by Distribution Mismatch: A Systematic Analysis of Reinforcement Learning in Reproducing Kernel Hilbert Space
Jihao Long
Jiequn Han
60
7
0
05 Nov 2021
Improved Regret Analysis for Variance-Adaptive Linear Bandits and
  Horizon-Free Linear Mixture MDPs
Improved Regret Analysis for Variance-Adaptive Linear Bandits and Horizon-Free Linear Mixture MDPs
Yeoneung Kim
Insoon Yang
Kwang-Sung Jun
102
38
0
05 Nov 2021
Adaptive Discretization in Online Reinforcement Learning
Adaptive Discretization in Online Reinforcement Learning
Sean R. Sinclair
Siddhartha Banerjee
Chao Yu
OffRL
87
17
0
29 Oct 2021
Reinforcement Learning in Linear MDPs: Constant Regret and
  Representation Selection
Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Matteo Papini
Andrea Tirinzoni
Aldo Pacchiano
Marcello Restelli
A. Lazaric
Matteo Pirotta
88
20
0
27 Oct 2021
V-Learning -- A Simple, Efficient, Decentralized Algorithm for
  Multiagent RL
V-Learning -- A Simple, Efficient, Decentralized Algorithm for Multiagent RL
Chi Jin
Qinghua Liu
Yuanhao Wang
Tiancheng Yu
OffRL
96
132
0
27 Oct 2021
Learning Stochastic Shortest Path with Linear Function Approximation
Learning Stochastic Shortest Path with Linear Function Approximation
Steffen Czolbe
Jiafan He
Adrian Dalca
Quanquan Gu
86
30
0
25 Oct 2021
Locally Differentially Private Reinforcement Learning for Linear Mixture
  Markov Decision Processes
Locally Differentially Private Reinforcement Learning for Linear Mixture Markov Decision Processes
Chonghua Liao
Jiafan He
Quanquan Gu
79
17
0
19 Oct 2021
Online Target Q-learning with Reverse Experience Replay: Efficiently
  finding the Optimal Policy for Linear MDPs
Online Target Q-learning with Reverse Experience Replay: Efficiently finding the Optimal Policy for Linear MDPs
Naman Agarwal
Syomantak Chaudhuri
Prateek Jain
Dheeraj M. Nagaraj
Praneeth Netrapalli
OffRL
113
21
0
16 Oct 2021
Reward-Free Model-Based Reinforcement Learning with Linear Function
  Approximation
Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation
Weitong Zhang
Dongruo Zhou
Quanquan Gu
OffRL
86
28
0
12 Oct 2021
Representation Learning for Online and Offline RL in Low-rank MDPs
Representation Learning for Online and Offline RL in Low-rank MDPs
Masatoshi Uehara
Xuezhou Zhang
Wen Sun
OffRL
156
129
0
09 Oct 2021
Theoretically Principled Deep RL Acceleration via Nearest Neighbor
  Function Approximation
Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation
Junhong Shen
Lin F. Yang
OffRL
51
18
0
09 Oct 2021
Previous
123456789
Next