v1v2 (latest)

Beyond Black-Box Advice: Learning-Augmented Algorithms for MDPs with Q-Value Predictions

20 July 2023

Adam Wierman

Papers citing "Beyond Black-Box Advice: Learning-Augmented Algorithms for MDPs with Q-Value Predictions"

34 / 34 papers shown

Title
Improving Online Algorithms via ML Predictions Ravi Kumar Manish Purohit Zoya Svitkina 67 319 0 25 Jul 2024
Scaling Laws for Reward Model Overoptimization Leo Gao John Schulman Jacob Hilton ALM 101 551 0 19 Oct 2022
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability Mengdi Xu Zuxin Liu Peide Huang Wenhao Ding Zhepeng Cen Yue Liu Ding Zhao 129 47 0 16 Sep 2022
Dynamic Regret of Online Markov Decision Processes Peng Zhao Longfei Li Zhi Zhou OffRL 80 17 0 26 Aug 2022
Chasing Convex Bodies and Functions with Black-Box Advice Nicolas H. Christianson Tinashe Handina Adam Wierman 58 32 0 23 Jun 2022
Optimal Competitive-Ratio Control Oron Sabag Sahin Lale B. Hassibi 84 12 0 03 Jun 2022
Online Algorithms with Multiple Predictions Keerti Anand Rong Ge Ajay Kumar Debmalya Panigrahi 76 35 0 08 May 2022
Transferred Q-learning Elynn Y. Chen Michael I. Jordan Sai Li OffRL OnRL 73 4 0 09 Feb 2022
Parsimonious Learning-Augmented Caching Sungjin Im Ravi Kumar Aditya Petety Manish Purohit 77 29 0 09 Feb 2022
Dynamic Regret Minimization for Control of Non-stationary Linear Dynamical Systems Yuwei Luo Varun Gupta Mladen Kolar 67 9 0 06 Nov 2021
Can Q-Learning be Improved with Advice? Noah Golowich Ankur Moitra OffRL 122 13 0 25 Oct 2021
Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning Lukas Brunke Melissa Greeff Adam W. Hall Zhaocong Yuan Siqi Zhou Jacopo Panerati Angela P. Schoellig OffRL 63 625 0 13 Aug 2021
Competitive Control Gautam Goel B. Hassibi 56 31 0 28 Jul 2021
On the Sample Complexity of Stability Constrained Imitation Learning Stephen Tu Alexander Robey Tingnan Zhang Nikolai Matni 71 39 0 18 Feb 2021
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach Chen-Yu Wei Haipeng Luo OffRL 148 107 0 10 Feb 2021
The Primal-Dual method for Learning Augmented Algorithms Étienne Bamas Andreas Maggiori O. Svensson OnRL 59 128 0 22 Oct 2020
Optimal Robustness-Consistency Trade-offs for Learning-Augmented Online Algorithms Alexander Wei Fred Zhang 77 99 0 22 Oct 2020
Non-Stochastic Control with Bandit Feedback Paula Gradu John Hallman Elad Hazan 44 28 0 12 Aug 2020
Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation L. Guan Mudit Verma Sihang Guo Ruohan Zhang Subbarao Kambhampati 89 43 0 26 Jun 2020
Sample Efficient Reinforcement Learning through Learning from Demonstrations in Minecraft Christian Scheller Yanick Schraner Manfred Vogel 60 27 0 12 Mar 2020
Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation Xueying Bai Jian Guan Hongning Wang OffRL 54 75 0 10 Nov 2019
Deep Value Model Predictive Control Farbod Farshidian David Hoeller Marco Hutter 48 45 0 08 Oct 2019
Control Regularization for Reduced Variance Reinforcement Learning Richard Cheng Abhinav Verma G. Orosz Swarat Chaudhuri Yisong Yue J. W. Burdick OffRL 75 80 0 14 May 2019
Online Control with Adversarial Disturbances Naman Agarwal Brian Bullins Elad Hazan Sham Kakade Karan Singh 42 237 0 23 Feb 2019
A Theory of Regularized Markov Decision Processes Matthieu Geist B. Scherrer Olivier Pietquin 128 331 0 31 Jan 2019
Provably Efficient Maximum Entropy Exploration Elad Hazan Sham Kakade Karan Singh A. V. Soest 73 299 0 06 Dec 2018
Is Q-learning Provably Efficient? Chi Jin Zeyuan Allen-Zhu Sébastien Bubeck Michael I. Jordan OffRL 70 807 0 10 Jul 2018
Competitive caching with machine learned advice Thodoris Lykouris Sergei Vassilvitskii 56 388 0 15 Feb 2018
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning I. Higgins Arka Pal Andrei A. Rusu Loic Matthey Christopher P. Burgess Alexander Pritzel M. Botvinick Charles Blundell Alexander Lerchner DRL 107 416 0 26 Jul 2017
Deep reinforcement learning from human preferences Paul Christiano Jan Leike Tom B. Brown Miljan Martic Shane Legg Dario Amodei 190 3,318 0 12 Jun 2017
Safe Model-based Reinforcement Learning with Stability Guarantees Felix Berkenkamp M. Turchetta Angela P. Schoellig Andreas Krause 176 852 0 23 May 2017
Minimax Regret Bounds for Reinforcement Learning M. G. Azar Ian Osband Rémi Munos 86 775 0 16 Mar 2017
Bridging the Gap Between Value and Policy Based Reinforcement Learning Ofir Nachum Mohammad Norouzi Kelvin Xu Dale Schuurmans 161 472 0 28 Feb 2017
Interactive Learning from Policy-Dependent Human Feedback J. MacGlashan Mark K. Ho R. Loftin Bei Peng Guan Wang David L. Roberts Matthew E. Taylor Michael L. Littman 74 305 0 21 Jan 2017