Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2102.04323
Cited By
Discovering a set of policies for the worst case reward
8 February 2021
Tom Zahavy
André Barreto
D. Mankowitz
Shaobo Hou
Brendan O'Donoghue
Iurii Kemaev
Satinder Singh
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Discovering a set of policies for the worst case reward"
7 / 7 papers shown
Title
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
Ruohong Liu
Yuxin Pan
Linjie Xu
Lei Song
Jiang Bian
Pengcheng You
Yize Chen
45
1
0
03 Oct 2024
Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery
Félix Chalumeau
Raphael Boige
Bryan Lim
Valentin Macé
Maxime Allard
Arthur Flajolet
Antoine Cully
Thomas Pierrot
26
21
0
06 Oct 2022
Minimum Description Length Control
Theodore H. Moskovitz
Ta-Chu Kao
M. Sahani
M. Botvinick
26
1
0
17 Jul 2022
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
Michael Laskin
Hao Liu
Xue Bin Peng
Denis Yarats
Aravind Rajeswaran
Pieter Abbeel
SSL
78
66
0
01 Feb 2022
Learning to Be Cautious
Montaser Mohammedalamen
Dustin Morrill
Alexander Sieusahai
Yash Satsangi
Michael Bowling
18
3
0
29 Oct 2021
A First-Occupancy Representation for Reinforcement Learning
Theodore H. Moskovitz
S. Wilson
M. Sahani
34
15
0
28 Sep 2021
A Linearly Convergent Conditional Gradient Algorithm with Applications to Online and Stochastic Optimization
Dan Garber
Elad Hazan
61
95
0
20 Jan 2013
1