Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.17644
Cited By
BurstGPT: A Real-world Workload Dataset to Optimize LLM Serving Systems
31 January 2024
Yuxin Wang
Yuhan Chen
Zeyu Li
Xueze Kang
Zhenheng Tang
Xin He
Rui Guo
Xin Wang
Qiang-qiang Wang
Amelie Chi Zhou
Xiaowen Chu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BurstGPT: A Real-world Workload Dataset to Optimize LLM Serving Systems"
3 / 3 papers shown
Title
ELIS: Efficient LLM Iterative Scheduling System with Response Length Predictor
Seungbeom Choi
Jeonghoe Goo
Eunjoo Jeon
Mingyu Yang
Minsung Jang
21
0
0
14 May 2025
Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization
Minsu Kim
Seongmin Hong
RyeoWook Ko
S. Choi
Hunjong Lee
Junsoo Kim
Joo-Young Kim
Jongse Park
57
0
0
24 Mar 2025
Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud Provider
Mohammad Shahrad
Rodrigo Fonseca
Íñigo Goiri
G. Chaudhry
Paul Batum
Jason Cooke
Eduardo Laureano
Colby Tresness
M. Russinovich
Ricardo Bianchini
89
601
0
06 Mar 2020
1