Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.08517
Cited By
Distributional Reinforcement Learning for Energy-Based Sequential Models
18 December 2019
Tetiana Parshakova
J. Andreoli
Marc Dymetman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Distributional Reinforcement Learning for Energy-Based Sequential Models"
4 / 4 papers shown
Title
PIPA: Preference Alignment as Prior-Informed Statistical Estimation
Junbo Li
Zhangyang Wang
Qiang Liu
OffRL
104
0
0
09 Feb 2025
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
53
10
0
28 Aug 2023
On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting
Tomasz Korbak
Hady ElSahar
Germán Kruszewski
Marc Dymetman
CLL
25
51
0
01 Jun 2022
Energy-Based Models for Continual Learning
Shuang Li
Yilun Du
Gido M. van de Ven
Igor Mordatch
27
42
0
24 Nov 2020
1