Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.03704
Cited By
Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking
6 June 2024
Roland Stolz
Hanna Krasowski
Jakob Thumm
Michael Eichelbeck
Philipp Gassert
Matthias Althoff
CLL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking"
2 / 2 papers shown
Title
Provably Safe Reinforcement Learning: Conceptual Analysis, Survey, and Benchmarking
Hanna Krasowski
Jakob Thumm
Marlon Müller
Lukas Schäfer
Xiao Wang
Matthias Althoff
85
19
0
13 May 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
328
11,953
0
04 Mar 2022
1