Model-Based Reinforcement Learning with Value-Targeted Regression

1 June 2020

Papers citing "Model-Based Reinforcement Learning with Value-Targeted Regression"

50 / 223 papers shown

Title
Towards Optimal Differentially Private Regret Bounds in Linear MDPs Sharan Sahu 60 0 0 12 Apr 2025
Dynamic Assortment Selection and Pricing with Censored Preference Feedback Jung-hun Kim Min-hwan Oh 38 0 0 03 Apr 2025
Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games Tong Yang Bo Dai Lin Xiao Yuejie Chi OffRL 64 2 0 13 Feb 2025
Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation Long-Fei Li Yu-Jie Zhang Peng Zhao Zhi-Hua Zhou 103 4 0 17 Jan 2025
Digital Twin Calibration with Model-Based Reinforcement Learning Hua Zheng Wei Xie I. Ryzhov Keilung Choy 39 0 0 04 Jan 2025
Near-Optimal Dynamic Regret for Adversarial Linear Mixture MDPs Long-Fei Li Peng Zhao Zhi-Hua Zhou 49 0 0 05 Nov 2024
Demystifying Linear MDPs and Novel Dynamics Aggregation Framework Joongkyu Lee Min-hwan Oh 45 2 0 31 Oct 2024
RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner Fu-Chieh Chang Yu-Ting Lee Hui-Ying Shih Pei-Yuan Wu Pei-Yuan Wu OffRL LRM 219 0 0 31 Oct 2024
Learning Infinite-Horizon Average-Reward Linear Mixture MDPs of Bounded Span Woojin Chae Kihyuk Hong Yufan Zhang Ambuj Tewari Dabeen Lee 39 1 0 19 Oct 2024
Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation Taehyun Cho Seung Han Kyungjae Lee Seokhun Ju Dohyeong Kim Jungwoo Lee 72 0 0 31 Jul 2024
Misspecified $Q$ -Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error Ally Yalei Du Lin F. Yang Ruosong Wang 37 0 0 18 Jul 2024
Random Latent Exploration for Deep Reinforcement Learning Srinath Mahankali Zhang-Wei Hong Ayush Sekhari Alexander Rakhlin Pulkit Agrawal 33 3 0 18 Jul 2024
Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes Asaf B. Cassel Aviv A. Rosenberg 43 1 0 03 Jul 2024
Operator World Models for Reinforcement Learning P. Novelli Marco Prattico Massimiliano Pontil C. Ciliberto OffRL 42 0 0 28 Jun 2024
Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning Bradley Burega John D. Martin Luke Kapeluck Michael Bowling 40 0 0 27 Jun 2024
A New View on Planning in Online Reinforcement Learning Kevin Roice Parham Mohammad Panahi Scott M. Jordan Adam White Martha White OffRL 28 0 0 03 Jun 2024
Reinforcement Learning for Infinite-Horizon Average-Reward Linear MDPs via Approximation by Discounted-Reward MDPs Kihyuk Hong Yufan Zhang Ambuj Tewari Dabeen Lee Ambuj Tewari 40 2 0 23 May 2024
Sample-efficient Learning of Infinite-horizon Average-reward MDPs with General Function Approximation Jianliang He Han Zhong Zhuoran Yang 38 6 0 19 Apr 2024
Differentially Private Reinforcement Learning with Self-Play Dan Qiao Yu-Xiang Wang 36 0 0 11 Apr 2024
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm Miao Lu Han Zhong Tong Zhang Jose H. Blanchet OffRL OOD 79 6 0 04 Apr 2024
Utilizing Maximum Mean Discrepancy Barycenter for Propagating the Uncertainty of Value Functions in Reinforcement Learning Srinjoy Roy Swagatam Das 32 0 0 31 Mar 2024
Prior-dependent analysis of posterior sampling reinforcement learning with function approximation Yingru Li Zhi-Quan Luo 27 0 0 17 Mar 2024
Horizon-Free Regret for Linear Markov Decision Processes Zihan Zhang Jason D. Lee Yuxin Chen Simon S. Du 33 3 0 15 Mar 2024
Improved Algorithm for Adversarial Linear Mixture MDPs with Bandit Feedback and Unknown Transition Long-Fei Li Peng Zhao Zhi-Hua Zhou 56 4 0 07 Mar 2024
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation Yu Chen Xiangcheng Zhang Siwei Wang Longbo Huang 42 3 0 28 Feb 2024
Offline Multi-task Transfer RL with Representational Penalization Avinandan Bose S. S. Du Maryam Fazel OffRL 57 12 0 19 Feb 2024
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning Zihao Li Boyi Liu Zhuoran Yang Zhaoran Wang Mengdi Wang 42 1 0 16 Feb 2024
Active Preference Optimization for Sample Efficient RLHF Nirjhar Das Souradip Chakraborty Aldo Pacchiano Sayak Ray Chowdhury 27 13 0 16 Feb 2024
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path Qiwei Di Jiafan He Dongruo Zhou Quanquan Gu 33 2 0 14 Feb 2024
Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption Chen Ye Jiafan He Quanquan Gu Tong Zhang 48 5 0 14 Feb 2024
Model-Based RL for Mean-Field Games is not Statistically Harder than Single-Agent RL Jiawei Huang Niao He Andreas Krause 37 6 0 08 Feb 2024
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees Toshinori Kitamura Tadashi Kozuno Masahiro Kato Yuki Ichihara Soichiro Nishimori Akiyoshi Sannai Sho Sonoda Wataru Kumagai Yutaka Matsuo 42 2 0 31 Jan 2024
Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity Guhao Feng Han Zhong OffRL 76 2 0 28 Dec 2023
Sample Efficient Reinforcement Learning with Partial Dynamics Knowledge Meshal Alharbi Mardavij Roozbehani M. Dahleh 29 0 0 19 Dec 2023
Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation Jiayi Huang Han Zhong Liwei Wang Lin F. Yang 39 2 0 07 Dec 2023
Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback Canzhe Zhao Ruofeng Yang Baoxiang Wang Xuezhou Zhang Shuai Li 30 3 0 14 Nov 2023
Anytime-Competitive Reinforcement Learning with Policy Prior Jianyi Yang Pengfei Li Tongxin Li Adam Wierman Shaolei Ren 46 2 0 02 Nov 2023
Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning Ahmadreza Moradipari M. Pedramfar Modjtaba Shokrian Zini Vaneet Aggarwal 32 5 0 30 Oct 2023
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation Nikki Lijing Kuang Ming Yin Mengdi Wang Yu-Xiang Wang Yian Ma 24 6 0 29 Oct 2023
A Doubly Robust Approach to Sparse Reinforcement Learning Wonyoung Hedge Kim Garud Iyengar A. Zeevi 25 3 0 23 Oct 2023
Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs Yu-Heng Hung Ping-Chun Hsieh Akshay Mete P. R. Kumar 16 0 0 17 Oct 2023
Uncertainty-aware transfer across tasks using hybrid model-based successor feature reinforcement learning Parvin Malekzadeh Ming Hou Konstantinos N. Plataniotis 51 1 0 16 Oct 2023
A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning Ran Wei Nathan Lambert Anthony D. McDonald Alfredo Garcia Roberto Calandra 33 7 0 10 Oct 2023
Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning Qiwei Di Heyang Zhao Jiafan He Quanquan Gu OffRL 61 5 0 02 Oct 2023
Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency Zhihan Liu Hao Hu Shenao Zhang Hongyi Guo Shuqi Ke Boyi Liu Zhaoran Wang LLMAG LRM 36 33 0 29 Sep 2023
Rate-Optimal Policy Optimization for Linear Markov Decision Processes Uri Sherman Alon Cohen Tomer Koren Yishay Mansour 41 7 0 28 Aug 2023
Bayesian Inverse Transition Learning for Offline Settings Leo Benac S. Parbhoo Finale Doshi-Velez OffRL 16 0 0 09 Aug 2023
Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback Yu Chen Yihan Du Pihe Hu Si-Yi Wang De-hui Wu Longbo Huang 24 6 0 06 Jul 2023
$λ$ -models: Effective Decision-Aware Reinforcement Learning with Latent Models C. Voelcker Arash Ahmadian Romina Abachi Igor Gilitschenski Amir-massoud Farahmand 59 0 0 30 Jun 2023
Achieving Sample and Computational Efficient Reinforcement Learning by Action Space Reduction via Grouping Yining Li Peizhong Ju Ness B. Shroff 31 0 0 22 Jun 2023