Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.17644
Cited By
BurstGPT: A Real-world Workload Dataset to Optimize LLM Serving Systems
31 January 2024
Yuxin Wang
Yuhan Chen
Zeyu Li
Xueze Kang
Zhenheng Tang
Xin He
Rui Guo
Xin Wang
Qiang-qiang Wang
Amelie Chi Zhou
Xiaowen Chu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BurstGPT: A Real-world Workload Dataset to Optimize LLM Serving Systems"
2 / 2 papers shown
Title
Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization
Minsu Kim
Seongmin Hong
RyeoWook Ko
S. Choi
Hunjong Lee
Junsoo Kim
J. Kim
Jongse Park
57
0
0
24 Mar 2025
Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud Provider
Mohammad Shahrad
Rodrigo Fonseca
Íñigo Goiri
G. Chaudhry
Paul Batum
Jason Cooke
Eduardo Laureano
Colby Tresness
M. Russinovich
Ricardo Bianchini
89
601
0
06 Mar 2020
1