Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.16283
Cited By
TURNIP: A "Nondeterministic" GPU Runtime with CPU RAM Offload
25 May 2024
Zhimin Ding
Jiawen Yao
Brianna Barrow
Tania Lorido-Botran
Christopher M. Jermaine
Yu-Shuen Tang
Jiehui Li
Xinyu Yao
Sleem Mahmoud Abdelghafar
Daniel Bourgeois
Re-assign community
ArXiv
PDF
HTML
Papers citing
"TURNIP: A "Nondeterministic" GPU Runtime with CPU RAM Offload"
2 / 2 papers shown
Title
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
Ying Sheng
Lianmin Zheng
Binhang Yuan
Zhuohan Li
Max Ryabinin
...
Joseph E. Gonzalez
Percy Liang
Christopher Ré
Ion Stoica
Ce Zhang
149
369
0
13 Mar 2023
ZeRO-Offload: Democratizing Billion-Scale Model Training
Jie Ren
Samyam Rajbhandari
Reza Yazdani Aminabadi
Olatunji Ruwase
Shuangyang Yang
Minjia Zhang
Dong Li
Yuxiong He
MoE
177
414
0
18 Jan 2021
1