Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1911.01831
Cited By
Quinoa: a Q-function You Infer Normalized Over Actions
5 November 2019
Jonas Degrave
A. Abdolmaleki
Jost Tobias Springenberg
N. Heess
Martin Riedmiller
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Quinoa: a Q-function You Infer Normalized Over Actions"
2 / 2 papers shown
Title
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Zhaolin Gao
Wenhao Zhan
Jonathan D. Chang
Gokul Swamy
Kianté Brantley
Jason D. Lee
Wen Sun
OffRL
81
3
0
06 Oct 2024
Implicitly Regularized RL with Implicit Q-Values
Nino Vieillard
Marcin Andrychowicz
Anton Raichuk
Olivier Pietquin
M. Geist
OffRL
24
9
0
16 Aug 2021
1