Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.06178
Cited By
Look-Up mAI GeMM: Increasing AI GeMMs Performance by Nearly 2.5x via msGeMM
9 October 2023
Saeed Maleki
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Look-Up mAI GeMM: Increasing AI GeMMs Performance by Nearly 2.5x via msGeMM"
3 / 3 papers shown
Title
MixPE: Quantization and Hardware Co-design for Efficient LLM Inference
Yu Zhang
Hao Wu
Lancheng Zou
Wulong Liu
Hui-Ling Zhen
M. Yuan
Bei Yu
MQ
79
1
0
25 Nov 2024
LUT Tensor Core: A Software-Hardware Co-Design for LUT-Based Low-Bit LLM Inference
Zhiwen Mo
Lei Wang
Jianyu Wei
Zhichen Zeng
Shijie Cao
...
Naifeng Jing
Ting Cao
Jilong Xue
Fan Yang
Mao Yang
54
0
0
12 Aug 2024
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge
Jianyu Wei
Shijie Cao
Ting Cao
Lingxiao Ma
Lei Wang
Yanyong Zhang
Mao Yang
MQ
53
11
0
25 Jun 2024
1