Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.16112
Cited By
HPU: High-Bandwidth Processing Unit for Scalable, Cost-effective LLM Inference via GPU Co-processing
18 April 2025
Myunghyun Rhee
Joonseop Sim
Taeyoung Ahn
Seungyong Lee
Daegun Yoon
Euiseok Kim
Kyoung Park
Youngpyo Joo
Hosik Kim
Re-assign community
ArXiv
PDF
HTML
Papers citing
"HPU: High-Bandwidth Processing Unit for Scalable, Cost-effective LLM Inference via GPU Co-processing"
Title
No papers