Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.19830
Cited By
v1
v2 (latest)
GameLabel-10K: Collecting Image Preference Data Through Mobile Game Crowdsourcing
30 September 2024
Jonathan Zhou
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"GameLabel-10K: Collecting Image Preference Data Through Mobile Game Crowdsourcing"
5 / 5 papers shown
Title
Provably Robust DPO: Aligning Language Models with Noisy Feedback
Sayak Ray Chowdhury
Anush Kini
Nagarajan Natarajan
80
70
0
01 Mar 2024
Diffusion Model Alignment Using Direct Preference Optimization
Bram Wallace
Meihua Dang
Rafael Rafailov
Linqi Zhou
Aaron Lou
Senthil Purushwalkam
Stefano Ermon
Caiming Xiong
Shafiq Joty
Nikhil Naik
EGVM
142
287
0
21 Nov 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Rafael Rafailov
Archit Sharma
E. Mitchell
Stefano Ermon
Christopher D. Manning
Chelsea Finn
ALM
389
4,169
0
29 May 2023
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
511
15,788
0
20 Dec 2021
LoRA: Low-Rank Adaptation of Large Language Models
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
502
10,526
0
17 Jun 2021
1