ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.01107
  4. Cited By
Model-Based Reinforcement Learning with Value-Targeted Regression

Model-Based Reinforcement Learning with Value-Targeted Regression

1 June 2020
Alex Ayoub
Zeyu Jia
Csaba Szepesvári
Mengdi Wang
Lin F. Yang
    OffRL
ArXivPDFHTML

Papers citing "Model-Based Reinforcement Learning with Value-Targeted Regression"

50 / 223 papers shown
Title
Provably Efficient Representation Learning with Tractable Planning in
  Low-Rank POMDP
Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP
Jiacheng Guo
Zihao Li
Huazheng Wang
Mengdi Wang
Zhuoran Yang
Xuezhou Zhang
37
5
0
21 Jun 2023
On the Model-Misspecification in Reinforcement Learning
On the Model-Misspecification in Reinforcement Learning
Yunfan Li
Lin F. Yang
44
5
0
19 Jun 2023
The RL Perceptron: Generalisation Dynamics of Policy Learning in High
  Dimensions
The RL Perceptron: Generalisation Dynamics of Policy Learning in High Dimensions
Nishil Patel
Sebastian Lee
Stefano Sarao Mannelli
Sebastian Goldt
Adrew Saxe
OffRL
38
3
0
17 Jun 2023
Theoretical Hardness and Tractability of POMDPs in RL with Partial
  Online State Information
Theoretical Hardness and Tractability of POMDPs in RL with Partial Online State Information
Ming Shi
Yingbin Liang
Ness B. Shroff
34
2
0
14 Jun 2023
Tackling Heavy-Tailed Rewards in Reinforcement Learning with Function
  Approximation: Minimax Optimal and Instance-Dependent Regret Bounds
Tackling Heavy-Tailed Rewards in Reinforcement Learning with Function Approximation: Minimax Optimal and Instance-Dependent Regret Bounds
Jiayi Huang
Han Zhong
Liwei Wang
Lin F. Yang
24
6
0
12 Jun 2023
Regret Bounds for Risk-sensitive Reinforcement Learning with Lipschitz
  Dynamic Risk Measures
Regret Bounds for Risk-sensitive Reinforcement Learning with Lipschitz Dynamic Risk Measures
Hao Liang
Zhihui Luo
27
4
0
04 Jun 2023
Maximize to Explore: One Objective Function Fusing Estimation, Planning,
  and Exploration
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration
Zhihan Liu
Miao Lu
Wei Xiong
Han Zhong
Haotian Hu
Shenao Zhang
Sirui Zheng
Zhuoran Yang
Zhaoran Wang
OffRL
53
22
0
29 May 2023
Provable and Practical: Efficient Exploration in Reinforcement Learning
  via Langevin Monte Carlo
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Haque Ishfaq
Qingfeng Lan
Pan Xu
A. R. Mahmood
Doina Precup
Anima Anandkumar
Kamyar Azizzadenesheli
BDL
OffRL
30
20
0
29 May 2023
On the Statistical Efficiency of Mean Field Reinforcement Learning with
  General Function Approximation
On the Statistical Efficiency of Mean Field Reinforcement Learning with General Function Approximation
Jiawei Huang
Batuhan Yardim
Niao He
44
10
0
18 May 2023
A Theoretical Analysis of Optimistic Proximal Policy Optimization in
  Linear Markov Decision Processes
A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes
Han Zhong
Tong Zhang
35
26
0
15 May 2023
Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs
Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs
Kaixuan Ji
Qingyue Zhao
Jiafan He
Weitong Zhang
Q. Gu
55
4
0
15 May 2023
Uniform-PAC Guarantees for Model-Based RL with Bounded Eluder Dimension
Uniform-PAC Guarantees for Model-Based RL with Bounded Eluder Dimension
Yue Wu
Jiafan He
Quanquan Gu
19
2
0
15 May 2023
Delay-Adapted Policy Optimization and Improved Regret for Adversarial
  MDP with Delayed Bandit Feedback
Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback
Tal Lancewicki
Aviv A. Rosenberg
Dmitry Sotnikov
29
3
0
13 May 2023
Cooperative Multi-Agent Reinforcement Learning: Asynchronous
  Communication and Linear Function Approximation
Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation
Yifei Min
Jiafan He
Tianhao Wang
Quanquan Gu
38
7
0
10 May 2023
Bayesian Reinforcement Learning with Limited Cognitive Load
Bayesian Reinforcement Learning with Limited Cognitive Load
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
OffRL
34
8
0
05 May 2023
What can online reinforcement learning with function approximation
  benefit from general coverage conditions?
What can online reinforcement learning with function approximation benefit from general coverage conditions?
Fanghui Liu
Luca Viano
V. Cevher
OffRL
34
2
0
25 Apr 2023
Provably Feedback-Efficient Reinforcement Learning via Active Reward
  Learning
Provably Feedback-Efficient Reinforcement Learning via Active Reward Learning
Dingwen Kong
Lin F. Yang
37
9
0
18 Apr 2023
Minimax-Optimal Reward-Agnostic Exploration in Reinforcement Learning
Minimax-Optimal Reward-Agnostic Exploration in Reinforcement Learning
Gen Li
Yuling Yan
Yuxin Chen
Jianqing Fan
OffRL
76
12
0
14 Apr 2023
Improved Sample Complexity for Reward-free Reinforcement Learning under
  Low-rank MDPs
Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPs
Yuan Cheng
Ruiquan Huang
J. Yang
Yitao Liang
OffRL
41
8
0
20 Mar 2023
Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs
Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs
Junkai Zhang
Weitong Zhang
Quanquan Gu
33
3
0
17 Mar 2023
Variance-aware robust reinforcement learning with linear function
  approximation under heavy-tailed rewards
Variance-aware robust reinforcement learning with linear function approximation under heavy-tailed rewards
Xiang Li
Qiang Sun
32
8
0
09 Mar 2023
The Virtues of Laziness in Model-based RL: A Unified Objective and
  Algorithms
The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms
Anirudh Vemula
Yuda Song
Aarti Singh
J. Andrew Bagnell
Sanjiban Choudhury
OffRL
38
13
0
01 Mar 2023
Optimistic Planning by Regularized Dynamic Programming
Optimistic Planning by Regularized Dynamic Programming
Antoine Moulin
Gergely Neu
19
4
0
27 Feb 2023
The Provable Benefits of Unsupervised Data Sharing for Offline
  Reinforcement Learning
The Provable Benefits of Unsupervised Data Sharing for Offline Reinforcement Learning
Haotian Hu
Yiqin Yang
Qianchuan Zhao
Chongjie Zhang
OffRL
11
5
0
27 Feb 2023
Exponential Hardness of Reinforcement Learning with Linear Function
  Approximation
Exponential Hardness of Reinforcement Learning with Linear Function Approximation
Daniel M. Kane
Sihan Liu
Shachar Lovett
G. Mahajan
Csaba Szepesvári
Gellert Weisz
46
3
0
25 Feb 2023
Finding Regularized Competitive Equilibria of Heterogeneous Agent
  Macroeconomic Models with Reinforcement Learning
Finding Regularized Competitive Equilibria of Heterogeneous Agent Macroeconomic Models with Reinforcement Learning
Ruitu Xu
Yifei Min
Tianhao Wang
Zhaoran Wang
Michael I. Jordan
Zhuoran Yang
36
6
0
24 Feb 2023
Provably Efficient Reinforcement Learning via Surprise Bound
Provably Efficient Reinforcement Learning via Surprise Bound
Hanlin Zhu
Ruosong Wang
Jason D. Lee
OffRL
28
5
0
22 Feb 2023
Provably Efficient Exploration in Quantum Reinforcement Learning with
  Logarithmic Worst-Case Regret
Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret
Han Zhong
Jiachen Hu
Yecheng Xue
Tongyang Li
Liwei Wang
26
5
0
21 Feb 2023
Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement
  Learning: Adaptivity and Computational Efficiency
Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency
Heyang Zhao
Jiafan He
Dongruo Zhou
Tong Zhang
Quanquan Gu
42
27
0
21 Feb 2023
Efficient Planning in Combinatorial Action Spaces with Applications to
  Cooperative Multi-Agent Reinforcement Learning
Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning
Volodymyr Tkachuk
Seyed Alireza Bakhtiari
Johannes Kirschner
Matej Jusup
Ilija Bogunovic
Csaba Szepesvári
32
5
0
08 Feb 2023
A Near-Optimal Algorithm for Safe Reinforcement Learning Under
  Instantaneous Hard Constraints
A Near-Optimal Algorithm for Safe Reinforcement Learning Under Instantaneous Hard Constraints
Ming Shi
Yitao Liang
Ness B. Shroff
48
8
0
08 Feb 2023
Sample Complexity of Kernel-Based Q-Learning
Sample Complexity of Kernel-Based Q-Learning
Sing-Yuan Yeh
Fu-Chieh Chang
Chang-Wei Yueh
Pei-Yuan Wu
A. Bernacchia
Sattar Vakili
OffRL
30
4
0
01 Feb 2023
Improved Regret for Efficient Online Reinforcement Learning with Linear
  Function Approximation
Improved Regret for Efficient Online Reinforcement Learning with Linear Function Approximation
Uri Sherman
Tomer Koren
Yishay Mansour
32
12
0
30 Jan 2023
STEERING: Stein Information Directed Exploration for Model-Based
  Reinforcement Learning
STEERING: Stein Information Directed Exploration for Model-Based Reinforcement Learning
Souradip Chakraborty
Amrit Singh Bedi
Alec Koppel
Mengdi Wang
Furong Huang
Dinesh Manocha
24
7
0
28 Jan 2023
Multi-Agent Congestion Cost Minimization With Linear Function
  Approximations
Multi-Agent Congestion Cost Minimization With Linear Function Approximations
Prashant Trivedi
N. Hemachandra
37
0
0
26 Jan 2023
Minimal Value-Equivalent Partial Models for Scalable and Robust Planning
  in Lifelong Reinforcement Learning
Minimal Value-Equivalent Partial Models for Scalable and Robust Planning in Lifelong Reinforcement Learning
Safa Alver
Doina Precup
OffRL
24
5
0
24 Jan 2023
Tight Guarantees for Interactive Decision Making with the
  Decision-Estimation Coefficient
Tight Guarantees for Interactive Decision Making with the Decision-Estimation Coefficient
Dylan J. Foster
Noah Golowich
Yanjun Han
OffRL
33
29
0
19 Jan 2023
Exploration in Model-based Reinforcement Learning with Randomized Reward
Exploration in Model-based Reinforcement Learning with Randomized Reward
Lingxiao Wang
Ping Li
19
0
0
09 Jan 2023
Model-Based Reinforcement Learning with Multinomial Logistic Function
  Approximation
Model-Based Reinforcement Learning with Multinomial Logistic Function Approximation
Taehyun Hwang
Min Hwan Oh
52
8
0
27 Dec 2022
Near-optimal Policy Identification in Active Reinforcement Learning
Near-optimal Policy Identification in Active Reinforcement Learning
Xiang Li
Viraj Mehta
Johannes Kirschner
I. Char
Willie Neiswanger
J. Schneider
Andreas Krause
Ilija Bogunovic
OffRL
45
6
0
19 Dec 2022
Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision
  Processes
Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes
Jiafan He
Heyang Zhao
Dongruo Zhou
Quanquan Gu
OffRL
51
55
0
12 Dec 2022
Near-Optimal Differentially Private Reinforcement Learning
Near-Optimal Differentially Private Reinforcement Learning
Dan Qiao
Yu Wang
30
13
0
09 Dec 2022
Eluder-based Regret for Stochastic Contextual MDPs
Eluder-based Regret for Stochastic Contextual MDPs
Orin Levy
Asaf B. Cassel
Alon Cohen
Yishay Mansour
33
5
0
27 Nov 2022
Model-Free Reinforcement Learning with the Decision-Estimation
  Coefficient
Model-Free Reinforcement Learning with the Decision-Estimation Coefficient
Dylan J. Foster
Noah Golowich
Jian Qian
Alexander Rakhlin
Ayush Sekhari
OffRL
38
9
0
25 Nov 2022
Operator Splitting Value Iteration
Operator Splitting Value Iteration
Amin Rakhsha
Andrew Wang
Mohammad Ghavamzadeh
Amir-massoud Farahmand
OffRL
33
7
0
25 Nov 2022
On Instance-Dependent Bounds for Offline Reinforcement Learning with
  Linear Function Approximation
On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation
Thanh Nguyen-Tang
Ming Yin
Sunil R. Gupta
Svetha Venkatesh
R. Arora
OffRL
58
16
0
23 Nov 2022
Linear Reinforcement Learning with Ball Structure Action Space
Linear Reinforcement Learning with Ball Structure Action Space
Zeyu Jia
Randy Jia
Dhruv Madeka
Dean Phillips Foster
31
1
0
14 Nov 2022
Leveraging Offline Data in Online Reinforcement Learning
Leveraging Offline Data in Online Reinforcement Learning
Andrew Wagenmaker
Aldo Pacchiano
OffRL
OnRL
35
38
0
09 Nov 2022
On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement
  Learning
On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement Learning
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
31
4
0
30 Oct 2022
Provable Sim-to-real Transfer in Continuous Domain with Partial
  Observations
Provable Sim-to-real Transfer in Continuous Domain with Partial Observations
Jiachen Hu
Han Zhong
Chi Jin
Liwei Wang
27
7
0
27 Oct 2022
Previous
12345
Next