ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.15025
  4. Cited By
Constructing a Good Behavior Basis for Transfer using Generalized Policy
  Updates

Constructing a Good Behavior Basis for Transfer using Generalized Policy Updates

30 December 2021
Safa Alver
Doina Precup
    OffRL
ArXivPDFHTML

Papers citing "Constructing a Good Behavior Basis for Transfer using Generalized Policy Updates"

10 / 10 papers shown
Title
Constructing an Optimal Behavior Basis for the Option Keyboard
Constructing an Optimal Behavior Basis for the Option Keyboard
L. N. Alegre
A. Bazzan
André Barreto
Bruno C. da Silva
26
0
0
01 May 2025
Boosting Soft Q-Learning by Bounding
Boosting Soft Q-Learning by Bounding
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
Rahul V. Kulkarni
OffRL
56
2
0
26 Jun 2024
Diversifying AI: Towards Creative Chess with AlphaZero
Diversifying AI: Towards Creative Chess with AlphaZero
Tom Zahavy
Vivek Veeriah
Shaobo Hou
Kevin Waugh
Matthew Lai
Edouard Leurent
Nenad Tomašev
Lisa Schut
Demis Hassabis
Satinder Singh
37
15
0
17 Aug 2023
Bounding the Optimal Value Function in Compositional Reinforcement
  Learning
Bounding the Optimal Value Function in Compositional Reinforcement Learning
Jacob Adamczyk
Volodymyr Makarenko
A. Arriojas
Stas Tiomkin
R. Kulkarni
OffRL
37
2
0
05 Mar 2023
Diverse Policy Optimization for Structured Action Space
Diverse Policy Optimization for Structured Action Space
Wenhao Li
Baoxiang Wang
Shanchao Yang
H. Zha
OffRL
32
1
0
23 Feb 2023
Safety-Constrained Policy Transfer with Successor Features
Safety-Constrained Policy Transfer with Successor Features
Zeyu Feng
Bowen Zhang
Jianxin Bi
Harold Soh
16
4
0
10 Nov 2022
How to Reuse and Compose Knowledge for a Lifetime of Tasks: A Survey on
  Continual Learning and Functional Composition
How to Reuse and Compose Knowledge for a Lifetime of Tasks: A Survey on Continual Learning and Functional Composition
Jorge Armando Mendez Mendez
Eric Eaton
KELM
CLL
32
27
0
15 Jul 2022
Optimistic Linear Support and Successor Features as a Basis for Optimal
  Policy Transfer
Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer
L. N. Alegre
A. Bazzan
Bruno C. da Silva
30
26
0
22 Jun 2022
Generalised Policy Improvement with Geometric Policy Composition
Generalised Policy Improvement with Geometric Policy Composition
S. Thakoor
Mark Rowland
Diana Borsa
Will Dabney
Rémi Munos
André Barreto
OffRL
19
7
0
17 Jun 2022
Skill Machines: Temporal Logic Skill Composition in Reinforcement
  Learning
Skill Machines: Temporal Logic Skill Composition in Reinforcement Learning
Geraud Nangue Tasse
Devon Jarvis
Steven D. James
Benjamin Rosman
44
4
0
25 May 2022
1