ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.00579
  4. Cited By
Online Reinforcement Learning in Stochastic Games

Online Reinforcement Learning in Stochastic Games

2 December 2017
Chen-Yu Wei
Yi-Te Hong
Chi-Jen Lu
    OffRL
ArXivPDFHTML

Papers citing "Online Reinforcement Learning in Stochastic Games"

8 / 8 papers shown
Title
Improving LLM General Preference Alignment via Optimistic Online Mirror Descent
Improving LLM General Preference Alignment via Optimistic Online Mirror Descent
Yuheng Zhang
Dian Yu
Tao Ge
Linfeng Song
Zhichen Zeng
Haitao Mi
Nan Jiang
Dong Yu
95
4
0
24 Feb 2025
Maximin Action Identification: A New Bandit Framework for Games
Maximin Action Identification: A New Bandit Framework for Games
Aurélien Garivier
E. Kaufmann
Wouter M. Koolen
37
29
0
15 Feb 2016
Sample Complexity of Episodic Fixed-Horizon Reinforcement Learning
Sample Complexity of Episodic Fixed-Horizon Reinforcement Learning
Christoph Dann
Emma Brunskill
50
249
0
29 Oct 2015
Online Learning in Markov Decision Processes with Adversarially Chosen
  Transition Probability Distributions
Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions
Yasin Abbasi-Yadkori
Peter L. Bartlett
Csaba Szepesvári
68
86
0
12 Mar 2013
Value Function Approximation in Zero-Sum Markov Games
Value Function Approximation in Zero-Sum Markov Games
M. Lagoudakis
Ronald E. Parr
OffRL
46
78
0
12 Dec 2012
REGAL: A Regularization based Algorithm for Reinforcement Learning in
  Weakly Communicating MDPs
REGAL: A Regularization based Algorithm for Reinforcement Learning in Weakly Communicating MDPs
Peter L. Bartlett
Ambuj Tewari
71
280
0
09 May 2012
PAC Bounds for Discounted MDPs
PAC Bounds for Discounted MDPs
Tor Lattimore
Marcus Hutter
68
188
0
17 Feb 2012
Empirical Bernstein Bounds and Sample Variance Penalization
Empirical Bernstein Bounds and Sample Variance Penalization
Andreas Maurer
Massimiliano Pontil
164
540
0
21 Jul 2009
1