Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.05868
Cited By
Negative Preference Optimization: From Catastrophic Collapse to Effective Unlearning
8 April 2024
Ruiqi Zhang
Licong Lin
Yu Bai
Song Mei
MU
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Negative Preference Optimization: From Catastrophic Collapse to Effective Unlearning"
2 / 52 papers shown
Title
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
354
12,003
0
04 Mar 2022
Extracting Training Data from Large Language Models
Nicholas Carlini
Florian Tramèr
Eric Wallace
Matthew Jagielski
Ariel Herbert-Voss
...
Tom B. Brown
D. Song
Ulfar Erlingsson
Alina Oprea
Colin Raffel
MLAU
SILM
290
1,824
0
14 Dec 2020
Previous
1
2