Transformer-Lite: High-efficiency Deployment of Large Language Models on
  Mobile Phone GPUs
v1v2v3 (latest)

Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs

Papers citing "Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs"

12 / 12 papers shown
Title
Small Language Models: Survey, Measurements, and Insights
Small Language Models: Survey, Measurements, and Insights
Zhenyan Lu
Xiang Li
Dongqi Cai
Rongjie Yi
Fangming Liu
Xiwen Zhang
Nicholas D. Lane
Mengwei Xu
140
58
0
24 Sep 2024