Ripple: Accelerating LLM Inference on Smartphones with Correlation-Aware
  Neuron Management

Ripple: Accelerating LLM Inference on Smartphones with Correlation-Aware Neuron Management

Papers citing "Ripple: Accelerating LLM Inference on Smartphones with Correlation-Aware Neuron Management"

1 / 1 papers shown
Title
Small Language Models: Survey, Measurements, and Insights
Small Language Models: Survey, Measurements, and Insights
Zhenyan Lu
Xiang Li
Dongqi Cai
Rongjie Yi
Fangming Liu
Xiwen Zhang
Nicholas D. Lane
Mengwei Xu
55
36
0
24 Sep 2024