ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.10315
  4. Cited By

Lessons from AlphaZero for Optimal, Model Predictive, and Adaptive Control

20 August 2021
Dimitri Bertsekas
    AI4CE
ArXivPDFHTML

Papers citing "Lessons from AlphaZero for Optimal, Model Predictive, and Adaptive Control"

25 / 25 papers shown
Title
Rank-One Modified Value Iteration
Rank-One Modified Value Iteration
A. S. Kolarijani
Tolga Ok
Peyman Mohajerin Esfahani
Mohamad Amin Sharif Kolarijani
22
0
0
03 May 2025
Trust-Region Twisted Policy Improvement
Trust-Region Twisted Policy Improvement
Joery A. de Vries
Jinke He
Yaniv Oren
M. Spaan
OffRL
LRM
30
0
0
08 Apr 2025
MPCritic: A plug-and-play MPC architecture for reinforcement learning
MPCritic: A plug-and-play MPC architecture for reinforcement learning
Nathan P. Lawrence
Thomas Banker
Ali Mesbah
31
0
0
01 Apr 2025
On-line Policy Improvement using Monte-Carlo Search
On-line Policy Improvement using Monte-Carlo Search
Gerald Tesauro
Gregory R. Galperin
86
270
0
09 Jan 2025
Model Predictive Control and Reinforcement Learning: A Unified Framework
  Based on Dynamic Programming
Model Predictive Control and Reinforcement Learning: A Unified Framework Based on Dynamic Programming
Dimitri Bertsekas
41
6
0
02 Jun 2024
Learning to Boost the Performance of Stable Nonlinear Systems
Learning to Boost the Performance of Stable Nonlinear Systems
Luca Furieri
C. Galimberti
Giancarlo Ferrari-Trecate
31
9
0
01 May 2024
The Effective Horizon Explains Deep RL Performance in Stochastic
  Environments
The Effective Horizon Explains Deep RL Performance in Stochastic Environments
Cassidy Laidlaw
Banghua Zhu
Stuart J. Russell
Anca Dragan
30
2
0
13 Dec 2023
Surge Routing: Event-informed Multiagent Reinforcement Learning for
  Autonomous Rideshare
Surge Routing: Event-informed Multiagent Reinforcement Learning for Autonomous Rideshare
Daniel Garces
Stephanie Gil
AI4TS
19
2
0
05 Jul 2023
Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap
Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap
Hang Wang
Sen Lin
Junshan Zhang
OffRL
OnRL
26
3
0
20 Jun 2023
What model does MuZero learn?
What model does MuZero learn?
Jinke He
Thomas M. Moerland
F. Oliehoek
33
4
0
01 Jun 2023
Online augmentation of learned grasp sequence policies for more
  adaptable and data-efficient in-hand manipulation
Online augmentation of learned grasp sequence policies for more adaptable and data-efficient in-hand manipulation
E. Gordon
Rana Soltani-Zarrin
OffRL
24
5
0
04 Apr 2023
Deep networks for system identification: a Survey
Deep networks for system identification: a Survey
G. Pillonetto
Aleksandr Aravkin
Daniel Gedon
L. Ljung
Antônio H. Ribeiro
Thomas B. Schon
OOD
35
35
0
30 Jan 2023
Rollout Algorithms and Approximate Dynamic Programming for Bayesian
  Optimization and Sequential Estimation
Rollout Algorithms and Approximate Dynamic Programming for Bayesian Optimization and Sequential Estimation
Dimitri Bertsekas
33
3
0
15 Dec 2022
Multiagent Reinforcement Learning for Autonomous Routing and Pickup
  Problem with Adaptation to Variable Demand
Multiagent Reinforcement Learning for Autonomous Routing and Pickup Problem with Adaptation to Variable Demand
Daniel Garces
Sushmita Bhattacharya
Stephanie Gil
Dimitri Bertsekas
24
10
0
28 Nov 2022
Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control
  Approach
Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach
Siddhant Bhambri
Amrita Bhattacharjee
Dimitri Bertsekas
11
9
0
15 Nov 2022
Reinforcement Learning with Unbiased Policy Evaluation and Linear
  Function Approximation
Reinforcement Learning with Unbiased Policy Evaluation and Linear Function Approximation
Anna Winnicki
R. Srikant
OffRL
10
6
0
13 Oct 2022
Statistical Hypothesis Testing Based on Machine Learning: Large
  Deviations Analysis
Statistical Hypothesis Testing Based on Machine Learning: Large Deviations Analysis
P. Braca
L. Millefiori
A. Aubry
S. Maranò
A. De Maio
P. Willett
29
12
0
22 Jul 2022
New Auction Algorithms for Path Planning, Network Transport, and
  Reinforcement Learning
New Auction Algorithms for Path Planning, Network Transport, and Reinforcement Learning
Dimitri Bertsekas
9
2
0
19 Jul 2022
Bayesian Learning Approach to Model Predictive Control
Bayesian Learning Approach to Model Predictive Control
Namhoon Cho
Seokwon Lee
Hyo-Sang Shin
Antonios Tsourdos
10
1
0
05 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
36
9
0
23 Feb 2022
Non-Parametric Neuro-Adaptive Coordination of Multi-Agent Systems
Non-Parametric Neuro-Adaptive Coordination of Multi-Agent Systems
Christos K. Verginis
Zhe Xu
Ufuk Topcu
9
3
0
11 Oct 2021
The Role of Lookahead and Approximate Policy Evaluation in Reinforcement
  Learning with Linear Value Function Approximation
The Role of Lookahead and Approximate Policy Evaluation in Reinforcement Learning with Linear Value Function Approximation
Anna Winnicki
Joseph Lubars
Michael Livesay
R. Srikant
17
3
0
28 Sep 2021
Non-Parametric Neuro-Adaptive Control Subject to Task Specifications
Non-Parametric Neuro-Adaptive Control Subject to Task Specifications
Christos K. Verginis
Zhe Xu
Ufuk Topcu
12
4
0
25 Jun 2021
Adaptive Variants of Optimal Feedback Policies
Adaptive Variants of Optimal Feedback Policies
B. Lopez
Jean-Jacques E. Slotine
OffRL
21
4
0
06 Apr 2021
Multiagent Value Iteration Algorithms in Dynamic Programming and
  Reinforcement Learning
Multiagent Value Iteration Algorithms in Dynamic Programming and Reinforcement Learning
Dimitri Bertsekas
38
38
0
04 May 2020
1