Discovering a set of policies for the worst case reward

8 February 2021

Papers citing "Discovering a set of policies for the worst case reward"

7 / 7 papers shown

Title
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front Ruohong Liu Yuxin Pan Linjie Xu Lei Song Jiang Bian Pengcheng You Yize Chen 45 1 0 03 Oct 2024
Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery Félix Chalumeau Raphael Boige Bryan Lim Valentin Macé Maxime Allard Arthur Flajolet Antoine Cully Thomas Pierrot 26 21 0 06 Oct 2022
Minimum Description Length Control Theodore H. Moskovitz Ta-Chu Kao M. Sahani M. Botvinick 26 1 0 17 Jul 2022
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery Michael Laskin Hao Liu Xue Bin Peng Denis Yarats Aravind Rajeswaran Pieter Abbeel SSL 78 65 0 01 Feb 2022
Learning to Be Cautious Montaser Mohammedalamen Dustin Morrill Alexander Sieusahai Yash Satsangi Michael Bowling 18 3 0 29 Oct 2021
A First-Occupancy Representation for Reinforcement Learning Theodore H. Moskovitz S. Wilson M. Sahani 34 15 0 28 Sep 2021
A Linearly Convergent Conditional Gradient Algorithm with Applications to Online and Stochastic Optimization Dan Garber Elad Hazan 61 94 0 20 Jan 2013