Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.01464
Cited By
Efficient Rate Optimal Regret for Adversarial Contextual MDPs Using Online Function Approximation
2 March 2023
Orin Levy
Alon Cohen
Asaf B. Cassel
Yishay Mansour
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Efficient Rate Optimal Regret for Adversarial Contextual MDPs Using Online Function Approximation"
1 / 1 papers shown
Title
Multi-Step Alignment as Markov Games: An Optimistic Online Gradient Descent Approach with Convergence Guarantees
Yongtao Wu
Luca Viano
Yihang Chen
Zhenyu Zhu
Kimon Antonakopoulos
Quanquan Gu
V. Cevher
54
0
0
18 Feb 2025
1