ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.03704
  4. Cited By
Excluding the Irrelevant: Focusing Reinforcement Learning through
  Continuous Action Masking

Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking

6 June 2024
Roland Stolz
Hanna Krasowski
Jakob Thumm
Michael Eichelbeck
Philipp Gassert
Matthias Althoff
    CLL
ArXivPDFHTML

Papers citing "Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking"

2 / 2 papers shown
Title
Provably Safe Reinforcement Learning: Conceptual Analysis, Survey, and
  Benchmarking
Provably Safe Reinforcement Learning: Conceptual Analysis, Survey, and Benchmarking
Hanna Krasowski
Jakob Thumm
Marlon Müller
Lukas Schäfer
Xiao Wang
Matthias Althoff
88
19
0
13 May 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
339
12,003
0
04 Mar 2022
1