Reward is enough for convex MDPs
v1v2v3v4 (latest)

Reward is enough for convex MDPs

Papers citing "Reward is enough for convex MDPs"

23 / 23 papers shown
Title
Concave Utility Reinforcement Learning: the Mean-Field Game Viewpoint
Concave Utility Reinforcement Learning: the Mean-Field Game Viewpoint
Matthieu Geist
Julien Pérolat
Mathieu Laurière
Romuald Elie
Sarah Perrin
Olivier Bachem
Rémi Munos
Olivier Pietquin
112
65
0
07 Jun 2021