ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.10524
  4. Cited By
Beyond Black-Box Advice: Learning-Augmented Algorithms for MDPs with
  Q-Value Predictions
v1v2 (latest)

Beyond Black-Box Advice: Learning-Augmented Algorithms for MDPs with Q-Value Predictions

20 July 2023
Tongxin Li
Yiheng Lin
Shaolei Ren
Adam Wierman
    AAMLOffRL
ArXiv (abs)PDFHTML

Papers citing "Beyond Black-Box Advice: Learning-Augmented Algorithms for MDPs with Q-Value Predictions"

34 / 34 papers shown
Title
Improving Online Algorithms via ML Predictions
Improving Online Algorithms via ML Predictions
Ravi Kumar
Manish Purohit
Zoya Svitkina
67
319
0
25 Jul 2024
Scaling Laws for Reward Model Overoptimization
Scaling Laws for Reward Model Overoptimization
Leo Gao
John Schulman
Jacob Hilton
ALM
101
551
0
19 Oct 2022
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities:
  Robustness, Safety, and Generalizability
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability
Mengdi Xu
Zuxin Liu
Peide Huang
Wenhao Ding
Zhepeng Cen
Yue Liu
Ding Zhao
129
47
0
16 Sep 2022
Dynamic Regret of Online Markov Decision Processes
Dynamic Regret of Online Markov Decision Processes
Peng Zhao
Longfei Li
Zhi Zhou
OffRL
80
17
0
26 Aug 2022
Chasing Convex Bodies and Functions with Black-Box Advice
Chasing Convex Bodies and Functions with Black-Box Advice
Nicolas H. Christianson
Tinashe Handina
Adam Wierman
58
32
0
23 Jun 2022
Optimal Competitive-Ratio Control
Optimal Competitive-Ratio Control
Oron Sabag
Sahin Lale
B. Hassibi
84
12
0
03 Jun 2022
Online Algorithms with Multiple Predictions
Online Algorithms with Multiple Predictions
Keerti Anand
Rong Ge
Ajay Kumar
Debmalya Panigrahi
76
35
0
08 May 2022
Transferred Q-learning
Transferred Q-learning
Elynn Y. Chen
Michael I. Jordan
Sai Li
OffRLOnRL
73
4
0
09 Feb 2022
Parsimonious Learning-Augmented Caching
Parsimonious Learning-Augmented Caching
Sungjin Im
Ravi Kumar
Aditya Petety
Manish Purohit
77
29
0
09 Feb 2022
Dynamic Regret Minimization for Control of Non-stationary Linear
  Dynamical Systems
Dynamic Regret Minimization for Control of Non-stationary Linear Dynamical Systems
Yuwei Luo
Varun Gupta
Mladen Kolar
67
9
0
06 Nov 2021
Can Q-Learning be Improved with Advice?
Can Q-Learning be Improved with Advice?
Noah Golowich
Ankur Moitra
OffRL
122
13
0
25 Oct 2021
Safe Learning in Robotics: From Learning-Based Control to Safe
  Reinforcement Learning
Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning
Lukas Brunke
Melissa Greeff
Adam W. Hall
Zhaocong Yuan
Siqi Zhou
Jacopo Panerati
Angela P. Schoellig
OffRL
63
625
0
13 Aug 2021
Competitive Control
Competitive Control
Gautam Goel
B. Hassibi
56
31
0
28 Jul 2021
On the Sample Complexity of Stability Constrained Imitation Learning
On the Sample Complexity of Stability Constrained Imitation Learning
Stephen Tu
Alexander Robey
Tingnan Zhang
Nikolai Matni
71
39
0
18 Feb 2021
Non-stationary Reinforcement Learning without Prior Knowledge: An
  Optimal Black-box Approach
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach
Chen-Yu Wei
Haipeng Luo
OffRL
148
107
0
10 Feb 2021
The Primal-Dual method for Learning Augmented Algorithms
The Primal-Dual method for Learning Augmented Algorithms
Étienne Bamas
Andreas Maggiori
O. Svensson
OnRL
59
128
0
22 Oct 2020
Optimal Robustness-Consistency Trade-offs for Learning-Augmented Online
  Algorithms
Optimal Robustness-Consistency Trade-offs for Learning-Augmented Online Algorithms
Alexander Wei
Fred Zhang
77
99
0
22 Oct 2020
Non-Stochastic Control with Bandit Feedback
Non-Stochastic Control with Bandit Feedback
Paula Gradu
John Hallman
Elad Hazan
44
28
0
12 Aug 2020
Widening the Pipeline in Human-Guided Reinforcement Learning with
  Explanation and Context-Aware Data Augmentation
Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation
L. Guan
Mudit Verma
Sihang Guo
Ruohan Zhang
Subbarao Kambhampati
89
43
0
26 Jun 2020
Sample Efficient Reinforcement Learning through Learning from
  Demonstrations in Minecraft
Sample Efficient Reinforcement Learning through Learning from Demonstrations in Minecraft
Christian Scheller
Yanick Schraner
Manfred Vogel
60
27
0
12 Mar 2020
Model-Based Reinforcement Learning with Adversarial Training for Online
  Recommendation
Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation
Xueying Bai
Jian Guan
Hongning Wang
OffRL
54
75
0
10 Nov 2019
Deep Value Model Predictive Control
Deep Value Model Predictive Control
Farbod Farshidian
David Hoeller
Marco Hutter
48
45
0
08 Oct 2019
Control Regularization for Reduced Variance Reinforcement Learning
Control Regularization for Reduced Variance Reinforcement Learning
Richard Cheng
Abhinav Verma
G. Orosz
Swarat Chaudhuri
Yisong Yue
J. W. Burdick
OffRL
75
80
0
14 May 2019
Online Control with Adversarial Disturbances
Online Control with Adversarial Disturbances
Naman Agarwal
Brian Bullins
Elad Hazan
Sham Kakade
Karan Singh
42
237
0
23 Feb 2019
A Theory of Regularized Markov Decision Processes
A Theory of Regularized Markov Decision Processes
Matthieu Geist
B. Scherrer
Olivier Pietquin
128
331
0
31 Jan 2019
Provably Efficient Maximum Entropy Exploration
Provably Efficient Maximum Entropy Exploration
Elad Hazan
Sham Kakade
Karan Singh
A. V. Soest
73
299
0
06 Dec 2018
Is Q-learning Provably Efficient?
Is Q-learning Provably Efficient?
Chi Jin
Zeyuan Allen-Zhu
Sébastien Bubeck
Michael I. Jordan
OffRL
70
807
0
10 Jul 2018
Competitive caching with machine learned advice
Competitive caching with machine learned advice
Thodoris Lykouris
Sergei Vassilvitskii
56
388
0
15 Feb 2018
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
I. Higgins
Arka Pal
Andrei A. Rusu
Loic Matthey
Christopher P. Burgess
Alexander Pritzel
M. Botvinick
Charles Blundell
Alexander Lerchner
DRL
107
416
0
26 Jul 2017
Deep reinforcement learning from human preferences
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
190
3,318
0
12 Jun 2017
Safe Model-based Reinforcement Learning with Stability Guarantees
Safe Model-based Reinforcement Learning with Stability Guarantees
Felix Berkenkamp
M. Turchetta
Angela P. Schoellig
Andreas Krause
176
852
0
23 May 2017
Minimax Regret Bounds for Reinforcement Learning
Minimax Regret Bounds for Reinforcement Learning
M. G. Azar
Ian Osband
Rémi Munos
86
775
0
16 Mar 2017
Bridging the Gap Between Value and Policy Based Reinforcement Learning
Bridging the Gap Between Value and Policy Based Reinforcement Learning
Ofir Nachum
Mohammad Norouzi
Kelvin Xu
Dale Schuurmans
161
472
0
28 Feb 2017
Interactive Learning from Policy-Dependent Human Feedback
Interactive Learning from Policy-Dependent Human Feedback
J. MacGlashan
Mark K. Ho
R. Loftin
Bei Peng
Guan Wang
David L. Roberts
Matthew E. Taylor
Michael L. Littman
74
305
0
21 Jan 2017
1