Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.09049
Cited By
DeepGEMM: Accelerated Ultra Low-Precision Inference on CPU Architectures using Lookup Tables
18 April 2023
Darshan C. Ganji
Saad Ashfaq
Ehsan Saboori
Sudhakar Sah
Saptarshi Mitra
Mohammadhossein Askarihemmat
Alexander Hoffman
Ahmed Hassanien
Mathieu Léonardon
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DeepGEMM: Accelerated Ultra Low-Precision Inference on CPU Architectures using Lookup Tables"
2 / 2 papers shown
Title
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge
Jianyu Wei
Shijie Cao
Ting Cao
Lingxiao Ma
Lei Wang
Yanyong Zhang
Mao Yang
MQ
53
11
0
25 Jun 2024
MQBench: Towards Reproducible and Deployable Model Quantization Benchmark
Yuhang Li
Mingzhu Shen
Jian Ma
Yan Ren
Mingxin Zhao
Qi Zhang
Ruihao Gong
F. Yu
Junjie Yan
MQ
35
49
0
05 Nov 2021
1