
Global Reinforcement Learning: Beyond Linear and Convex Rewards via Submodular Semi-gradient Methods
Papers citing "Global Reinforcement Learning: Beyond Linear and Convex Rewards via Submodular Semi-gradient Methods"
11 / 11 papers shown
Title |
---|
![]() Concave Utility Reinforcement Learning: the Mean-Field Game Viewpoint Matthieu Geist Julien Pérolat Mathieu Laurière Romuald Elie Sarah Perrin Olivier Bachem Rémi Munos Olivier Pietquin |