Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.10524
Cited By
v1
v2 (latest)
Beyond Black-Box Advice: Learning-Augmented Algorithms for MDPs with Q-Value Predictions
20 July 2023
Tongxin Li
Yiheng Lin
Shaolei Ren
Adam Wierman
AAML
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Beyond Black-Box Advice: Learning-Augmented Algorithms for MDPs with Q-Value Predictions"
34 / 34 papers shown
Title
Improving Online Algorithms via ML Predictions
Ravi Kumar
Manish Purohit
Zoya Svitkina
67
319
0
25 Jul 2024
Scaling Laws for Reward Model Overoptimization
Leo Gao
John Schulman
Jacob Hilton
ALM
101
551
0
19 Oct 2022
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability
Mengdi Xu
Zuxin Liu
Peide Huang
Wenhao Ding
Zhepeng Cen
Yue Liu
Ding Zhao
129
47
0
16 Sep 2022
Dynamic Regret of Online Markov Decision Processes
Peng Zhao
Longfei Li
Zhi Zhou
OffRL
80
17
0
26 Aug 2022
Chasing Convex Bodies and Functions with Black-Box Advice
Nicolas H. Christianson
Tinashe Handina
Adam Wierman
58
32
0
23 Jun 2022
Optimal Competitive-Ratio Control
Oron Sabag
Sahin Lale
B. Hassibi
84
12
0
03 Jun 2022
Online Algorithms with Multiple Predictions
Keerti Anand
Rong Ge
Ajay Kumar
Debmalya Panigrahi
76
35
0
08 May 2022
Transferred Q-learning
Elynn Y. Chen
Michael I. Jordan
Sai Li
OffRL
OnRL
73
4
0
09 Feb 2022
Parsimonious Learning-Augmented Caching
Sungjin Im
Ravi Kumar
Aditya Petety
Manish Purohit
77
29
0
09 Feb 2022
Dynamic Regret Minimization for Control of Non-stationary Linear Dynamical Systems
Yuwei Luo
Varun Gupta
Mladen Kolar
67
9
0
06 Nov 2021
Can Q-Learning be Improved with Advice?
Noah Golowich
Ankur Moitra
OffRL
122
13
0
25 Oct 2021
Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning
Lukas Brunke
Melissa Greeff
Adam W. Hall
Zhaocong Yuan
Siqi Zhou
Jacopo Panerati
Angela P. Schoellig
OffRL
63
625
0
13 Aug 2021
Competitive Control
Gautam Goel
B. Hassibi
56
31
0
28 Jul 2021
On the Sample Complexity of Stability Constrained Imitation Learning
Stephen Tu
Alexander Robey
Tingnan Zhang
Nikolai Matni
71
39
0
18 Feb 2021
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach
Chen-Yu Wei
Haipeng Luo
OffRL
148
107
0
10 Feb 2021
The Primal-Dual method for Learning Augmented Algorithms
Étienne Bamas
Andreas Maggiori
O. Svensson
OnRL
59
128
0
22 Oct 2020
Optimal Robustness-Consistency Trade-offs for Learning-Augmented Online Algorithms
Alexander Wei
Fred Zhang
77
99
0
22 Oct 2020
Non-Stochastic Control with Bandit Feedback
Paula Gradu
John Hallman
Elad Hazan
44
28
0
12 Aug 2020
Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation
L. Guan
Mudit Verma
Sihang Guo
Ruohan Zhang
Subbarao Kambhampati
89
43
0
26 Jun 2020
Sample Efficient Reinforcement Learning through Learning from Demonstrations in Minecraft
Christian Scheller
Yanick Schraner
Manfred Vogel
60
27
0
12 Mar 2020
Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation
Xueying Bai
Jian Guan
Hongning Wang
OffRL
54
75
0
10 Nov 2019
Deep Value Model Predictive Control
Farbod Farshidian
David Hoeller
Marco Hutter
48
45
0
08 Oct 2019
Control Regularization for Reduced Variance Reinforcement Learning
Richard Cheng
Abhinav Verma
G. Orosz
Swarat Chaudhuri
Yisong Yue
J. W. Burdick
OffRL
75
80
0
14 May 2019
Online Control with Adversarial Disturbances
Naman Agarwal
Brian Bullins
Elad Hazan
Sham Kakade
Karan Singh
42
237
0
23 Feb 2019
A Theory of Regularized Markov Decision Processes
Matthieu Geist
B. Scherrer
Olivier Pietquin
128
331
0
31 Jan 2019
Provably Efficient Maximum Entropy Exploration
Elad Hazan
Sham Kakade
Karan Singh
A. V. Soest
73
299
0
06 Dec 2018
Is Q-learning Provably Efficient?
Chi Jin
Zeyuan Allen-Zhu
Sébastien Bubeck
Michael I. Jordan
OffRL
70
807
0
10 Jul 2018
Competitive caching with machine learned advice
Thodoris Lykouris
Sergei Vassilvitskii
56
388
0
15 Feb 2018
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
I. Higgins
Arka Pal
Andrei A. Rusu
Loic Matthey
Christopher P. Burgess
Alexander Pritzel
M. Botvinick
Charles Blundell
Alexander Lerchner
DRL
107
416
0
26 Jul 2017
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
190
3,318
0
12 Jun 2017
Safe Model-based Reinforcement Learning with Stability Guarantees
Felix Berkenkamp
M. Turchetta
Angela P. Schoellig
Andreas Krause
176
852
0
23 May 2017
Minimax Regret Bounds for Reinforcement Learning
M. G. Azar
Ian Osband
Rémi Munos
86
775
0
16 Mar 2017
Bridging the Gap Between Value and Policy Based Reinforcement Learning
Ofir Nachum
Mohammad Norouzi
Kelvin Xu
Dale Schuurmans
161
472
0
28 Feb 2017
Interactive Learning from Policy-Dependent Human Feedback
J. MacGlashan
Mark K. Ho
R. Loftin
Bei Peng
Guan Wang
David L. Roberts
Matthew E. Taylor
Michael L. Littman
74
305
0
21 Jan 2017
1