Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.11502
Cited By
Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models
17 April 2024
Yushuo Chen
Tianyi Tang
Erge Xiang
Linjiang Li
Wayne Xin Zhao
Jing Wang
Yunpeng Chai
Ji-Rong Wen
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models"
1 / 1 papers shown
Title
Full Stack Optimization of Transformer Inference: a Survey
Sehoon Kim
Coleman Hooper
Thanakul Wattanawong
Minwoo Kang
Ruohan Yan
...
Qijing Huang
Kurt Keutzer
Michael W. Mahoney
Y. Shao
A. Gholami
MQ
126
104
0
27 Feb 2023
1