Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.08791
Cited By
PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters
7 April 2025
Zonghang Li
Tao Li
Wenjiao Feng
Mohsen Guizani
Hongfang Yu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters"
Title
No papers