Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.12057
Cited By
NinjaLLM: Fast, Scalable and Cost-effective RAG using Amazon SageMaker and AWS Trainium and Inferentia2
11 July 2024
Tengfei Xue
Xuefeng Li
Roman Smirnov
Tahir Azim
Arash Sadrieh
Babak Pahlavan
Re-assign community
ArXiv
PDF
HTML
Papers citing
"NinjaLLM: Fast, Scalable and Cost-effective RAG using Amazon SageMaker and AWS Trainium and Inferentia2"
Title
No papers