ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.01464
  4. Cited By
Efficient Rate Optimal Regret for Adversarial Contextual MDPs Using
  Online Function Approximation

Efficient Rate Optimal Regret for Adversarial Contextual MDPs Using Online Function Approximation

2 March 2023
Orin Levy
Alon Cohen
Asaf B. Cassel
Yishay Mansour
ArXivPDFHTML

Papers citing "Efficient Rate Optimal Regret for Adversarial Contextual MDPs Using Online Function Approximation"

1 / 1 papers shown
Title
Multi-Step Alignment as Markov Games: An Optimistic Online Gradient Descent Approach with Convergence Guarantees
Multi-Step Alignment as Markov Games: An Optimistic Online Gradient Descent Approach with Convergence Guarantees
Yongtao Wu
Luca Viano
Yihang Chen
Zhenyu Zhu
Kimon Antonakopoulos
Quanquan Gu
V. Cevher
54
0
0
18 Feb 2025
1