ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.01107
  4. Cited By
Model-Based Reinforcement Learning with Value-Targeted Regression

Model-Based Reinforcement Learning with Value-Targeted Regression

1 June 2020
Alex Ayoub
Zeyu Jia
Csaba Szepesvári
Mengdi Wang
Lin F. Yang
    OffRL
ArXivPDFHTML

Papers citing "Model-Based Reinforcement Learning with Value-Targeted Regression"

50 / 223 papers shown
Title
Towards Optimal Differentially Private Regret Bounds in Linear MDPs
Towards Optimal Differentially Private Regret Bounds in Linear MDPs
Sharan Sahu
60
0
0
12 Apr 2025
Dynamic Assortment Selection and Pricing with Censored Preference Feedback
Dynamic Assortment Selection and Pricing with Censored Preference Feedback
Jung-hun Kim
Min-hwan Oh
38
0
0
03 Apr 2025
Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games
Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games
Tong Yang
Bo Dai
Lin Xiao
Yuejie Chi
OffRL
64
2
0
13 Feb 2025
Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation
Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation
Long-Fei Li
Yu-Jie Zhang
Peng Zhao
Zhi-Hua Zhou
103
4
0
17 Jan 2025
Digital Twin Calibration with Model-Based Reinforcement Learning
Hua Zheng
Wei Xie
I. Ryzhov
Keilung Choy
39
0
0
04 Jan 2025
Near-Optimal Dynamic Regret for Adversarial Linear Mixture MDPs
Near-Optimal Dynamic Regret for Adversarial Linear Mixture MDPs
Long-Fei Li
Peng Zhao
Zhi-Hua Zhou
49
0
0
05 Nov 2024
Demystifying Linear MDPs and Novel Dynamics Aggregation Framework
Demystifying Linear MDPs and Novel Dynamics Aggregation Framework
Joongkyu Lee
Min-hwan Oh
45
2
0
31 Oct 2024
RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner
RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner
Fu-Chieh Chang
Yu-Ting Lee
Hui-Ying Shih
Pei-Yuan Wu
Pei-Yuan Wu
OffRL
LRM
219
0
0
31 Oct 2024
Learning Infinite-Horizon Average-Reward Linear Mixture MDPs of Bounded
  Span
Learning Infinite-Horizon Average-Reward Linear Mixture MDPs of Bounded Span
Woojin Chae
Kihyuk Hong
Yufan Zhang
Ambuj Tewari
Dabeen Lee
39
1
0
19 Oct 2024
Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation
Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation
Taehyun Cho
Seung Han
Kyungjae Lee
Seokhun Ju
Dohyeong Kim
Jungwoo Lee
72
0
0
31 Jul 2024
Misspecified $Q$-Learning with Sparse Linear Function Approximation:
  Tight Bounds on Approximation Error
Misspecified QQQ-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error
Ally Yalei Du
Lin F. Yang
Ruosong Wang
37
0
0
18 Jul 2024
Random Latent Exploration for Deep Reinforcement Learning
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
33
3
0
18 Jul 2024
Warm-up Free Policy Optimization: Improved Regret in Linear Markov
  Decision Processes
Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes
Asaf B. Cassel
Aviv A. Rosenberg
43
1
0
03 Jul 2024
Operator World Models for Reinforcement Learning
Operator World Models for Reinforcement Learning
P. Novelli
Marco Prattico
Massimiliano Pontil
C. Ciliberto
OffRL
42
0
0
28 Jun 2024
Meta-Gradient Search Control: A Method for Improving the Efficiency of
  Dyna-style Planning
Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning
Bradley Burega
John D. Martin
Luke Kapeluck
Michael Bowling
40
0
0
27 Jun 2024
A New View on Planning in Online Reinforcement Learning
A New View on Planning in Online Reinforcement Learning
Kevin Roice
Parham Mohammad Panahi
Scott M. Jordan
Adam White
Martha White
OffRL
28
0
0
03 Jun 2024
Reinforcement Learning for Infinite-Horizon Average-Reward Linear MDPs via Approximation by Discounted-Reward MDPs
Reinforcement Learning for Infinite-Horizon Average-Reward Linear MDPs via Approximation by Discounted-Reward MDPs
Kihyuk Hong
Yufan Zhang
Ambuj Tewari
Dabeen Lee
Ambuj Tewari
40
2
0
23 May 2024
Sample-efficient Learning of Infinite-horizon Average-reward MDPs with
  General Function Approximation
Sample-efficient Learning of Infinite-horizon Average-reward MDPs with General Function Approximation
Jianliang He
Han Zhong
Zhuoran Yang
38
6
0
19 Apr 2024
Differentially Private Reinforcement Learning with Self-Play
Differentially Private Reinforcement Learning with Self-Play
Dan Qiao
Yu-Xiang Wang
36
0
0
11 Apr 2024
Distributionally Robust Reinforcement Learning with Interactive Data
  Collection: Fundamental Hardness and Near-Optimal Algorithm
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm
Miao Lu
Han Zhong
Tong Zhang
Jose H. Blanchet
OffRL
OOD
79
6
0
04 Apr 2024
Utilizing Maximum Mean Discrepancy Barycenter for Propagating the Uncertainty of Value Functions in Reinforcement Learning
Srinjoy Roy
Swagatam Das
32
0
0
31 Mar 2024
Prior-dependent analysis of posterior sampling reinforcement learning
  with function approximation
Prior-dependent analysis of posterior sampling reinforcement learning with function approximation
Yingru Li
Zhi-Quan Luo
27
0
0
17 Mar 2024
Horizon-Free Regret for Linear Markov Decision Processes
Horizon-Free Regret for Linear Markov Decision Processes
Zihan Zhang
Jason D. Lee
Yuxin Chen
Simon S. Du
33
3
0
15 Mar 2024
Improved Algorithm for Adversarial Linear Mixture MDPs with Bandit
  Feedback and Unknown Transition
Improved Algorithm for Adversarial Linear Mixture MDPs with Bandit Feedback and Unknown Transition
Long-Fei Li
Peng Zhao
Zhi-Hua Zhou
56
4
0
07 Mar 2024
Provable Risk-Sensitive Distributional Reinforcement Learning with
  General Function Approximation
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation
Yu Chen
Xiangcheng Zhang
Siwei Wang
Longbo Huang
42
3
0
28 Feb 2024
Offline Multi-task Transfer RL with Representational Penalization
Offline Multi-task Transfer RL with Representational Penalization
Avinandan Bose
S. S. Du
Maryam Fazel
OffRL
57
12
0
19 Feb 2024
Double Duality: Variational Primal-Dual Policy Optimization for
  Constrained Reinforcement Learning
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning
Zihao Li
Boyi Liu
Zhuoran Yang
Zhaoran Wang
Mengdi Wang
42
1
0
16 Feb 2024
Active Preference Optimization for Sample Efficient RLHF
Active Preference Optimization for Sample Efficient RLHF
Nirjhar Das
Souradip Chakraborty
Aldo Pacchiano
Sayak Ray Chowdhury
27
13
0
16 Feb 2024
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic
  Shortest Path
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
Qiwei Di
Jiafan He
Dongruo Zhou
Quanquan Gu
33
2
0
14 Feb 2024
Towards Robust Model-Based Reinforcement Learning Against Adversarial
  Corruption
Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption
Chen Ye
Jiafan He
Quanquan Gu
Tong Zhang
48
5
0
14 Feb 2024
Model-Based RL for Mean-Field Games is not Statistically Harder than
  Single-Agent RL
Model-Based RL for Mean-Field Games is not Statistically Harder than Single-Agent RL
Jiawei Huang
Niao He
Andreas Krause
37
6
0
08 Feb 2024
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with
  Uniform PAC Guarantees
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees
Toshinori Kitamura
Tadashi Kozuno
Masahiro Kato
Yuki Ichihara
Soichiro Nishimori
Akiyoshi Sannai
Sho Sonoda
Wataru Kumagai
Yutaka Matsuo
42
2
0
31 Jan 2024
Rethinking Model-based, Policy-based, and Value-based Reinforcement
  Learning via the Lens of Representation Complexity
Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity
Guhao Feng
Han Zhong
OffRL
76
2
0
28 Dec 2023
Sample Efficient Reinforcement Learning with Partial Dynamics Knowledge
Sample Efficient Reinforcement Learning with Partial Dynamics Knowledge
Meshal Alharbi
Mardavij Roozbehani
M. Dahleh
29
0
0
19 Dec 2023
Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement
  Learning with General Function Approximation
Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation
Jiayi Huang
Han Zhong
Liwei Wang
Lin F. Yang
39
2
0
07 Dec 2023
Learning Adversarial Low-rank Markov Decision Processes with Unknown
  Transition and Full-information Feedback
Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback
Canzhe Zhao
Ruofeng Yang
Baoxiang Wang
Xuezhou Zhang
Shuai Li
30
3
0
14 Nov 2023
Anytime-Competitive Reinforcement Learning with Policy Prior
Anytime-Competitive Reinforcement Learning with Policy Prior
Jianyi Yang
Pengfei Li
Tongxin Li
Adam Wierman
Shaolei Ren
46
2
0
02 Nov 2023
Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement
  Learning
Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning
Ahmadreza Moradipari
M. Pedramfar
Modjtaba Shokrian Zini
Vaneet Aggarwal
32
5
0
30 Oct 2023
Posterior Sampling with Delayed Feedback for Reinforcement Learning with
  Linear Function Approximation
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation
Nikki Lijing Kuang
Ming Yin
Mengdi Wang
Yu-Xiang Wang
Yian Ma
24
6
0
29 Oct 2023
A Doubly Robust Approach to Sparse Reinforcement Learning
A Doubly Robust Approach to Sparse Reinforcement Learning
Wonyoung Hedge Kim
Garud Iyengar
A. Zeevi
25
3
0
23 Oct 2023
Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement
  Learning in Discounted Linear MDPs
Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs
Yu-Heng Hung
Ping-Chun Hsieh
Akshay Mete
P. R. Kumar
16
0
0
17 Oct 2023
Uncertainty-aware transfer across tasks using hybrid model-based
  successor feature reinforcement learning
Uncertainty-aware transfer across tasks using hybrid model-based successor feature reinforcement learning
Parvin Malekzadeh
Ming Hou
Konstantinos N. Plataniotis
51
1
0
16 Oct 2023
A Unified View on Solving Objective Mismatch in Model-Based
  Reinforcement Learning
A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning
Ran Wei
Nathan Lambert
Anthony D. McDonald
Alfredo Garcia
Roberto Calandra
33
7
0
10 Oct 2023
Pessimistic Nonlinear Least-Squares Value Iteration for Offline
  Reinforcement Learning
Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning
Qiwei Di
Heyang Zhao
Jiafan He
Quanquan Gu
OffRL
61
5
0
02 Oct 2023
Reason for Future, Act for Now: A Principled Framework for Autonomous
  LLM Agents with Provable Sample Efficiency
Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Zhihan Liu
Hao Hu
Shenao Zhang
Hongyi Guo
Shuqi Ke
Boyi Liu
Zhaoran Wang
LLMAG
LRM
36
33
0
29 Sep 2023
Rate-Optimal Policy Optimization for Linear Markov Decision Processes
Rate-Optimal Policy Optimization for Linear Markov Decision Processes
Uri Sherman
Alon Cohen
Tomer Koren
Yishay Mansour
41
7
0
28 Aug 2023
Bayesian Inverse Transition Learning for Offline Settings
Bayesian Inverse Transition Learning for Offline Settings
Leo Benac
S. Parbhoo
Finale Doshi-Velez
OffRL
16
0
0
09 Aug 2023
Provably Efficient Iterated CVaR Reinforcement Learning with Function
  Approximation and Human Feedback
Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback
Yu Chen
Yihan Du
Pihe Hu
Si-Yi Wang
De-hui Wu
Longbo Huang
24
6
0
06 Jul 2023
$λ$-models: Effective Decision-Aware Reinforcement Learning with
  Latent Models
λλλ-models: Effective Decision-Aware Reinforcement Learning with Latent Models
C. Voelcker
Arash Ahmadian
Romina Abachi
Igor Gilitschenski
Amir-massoud Farahmand
59
0
0
30 Jun 2023
Achieving Sample and Computational Efficient Reinforcement Learning by
  Action Space Reduction via Grouping
Achieving Sample and Computational Efficient Reinforcement Learning by Action Space Reduction via Grouping
Yining Li
Peizhong Ju
Ness B. Shroff
31
0
0
22 Jun 2023
12345
Next