ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.04323
  4. Cited By
Discovering a set of policies for the worst case reward

Discovering a set of policies for the worst case reward

8 February 2021
Tom Zahavy
André Barreto
D. Mankowitz
Shaobo Hou
Brendan O'Donoghue
Iurii Kemaev
Satinder Singh
    OffRL
ArXivPDFHTML

Papers citing "Discovering a set of policies for the worst case reward"

7 / 7 papers shown
Title
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
Ruohong Liu
Yuxin Pan
Linjie Xu
Lei Song
Jiang Bian
Pengcheng You
Yize Chen
45
1
0
03 Oct 2024
Neuroevolution is a Competitive Alternative to Reinforcement Learning
  for Skill Discovery
Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery
Félix Chalumeau
Raphael Boige
Bryan Lim
Valentin Macé
Maxime Allard
Arthur Flajolet
Antoine Cully
Thomas Pierrot
26
21
0
06 Oct 2022
Minimum Description Length Control
Minimum Description Length Control
Theodore H. Moskovitz
Ta-Chu Kao
M. Sahani
M. Botvinick
26
1
0
17 Jul 2022
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
Michael Laskin
Hao Liu
Xue Bin Peng
Denis Yarats
Aravind Rajeswaran
Pieter Abbeel
SSL
78
65
0
01 Feb 2022
Learning to Be Cautious
Learning to Be Cautious
Montaser Mohammedalamen
Dustin Morrill
Alexander Sieusahai
Yash Satsangi
Michael Bowling
18
3
0
29 Oct 2021
A First-Occupancy Representation for Reinforcement Learning
A First-Occupancy Representation for Reinforcement Learning
Theodore H. Moskovitz
S. Wilson
M. Sahani
34
15
0
28 Sep 2021
A Linearly Convergent Conditional Gradient Algorithm with Applications
  to Online and Stochastic Optimization
A Linearly Convergent Conditional Gradient Algorithm with Applications to Online and Stochastic Optimization
Dan Garber
Elad Hazan
61
94
0
20 Jan 2013
1