Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.12356
Cited By
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models
21 May 2023
Yijia Zhang
Lingran Zhao
Shijie Cao
Wenqiang Wang
Ting Cao
Fan Yang
Mao Yang
Shanghang Zhang
Ningyi Xu
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models"
2 / 2 papers shown
Title
NQKV: A KV Cache Quantization Scheme Based on Normal Distribution Characteristics
Zhihang Cai
Xingjun Zhang
Zhendong Tan
Zheng Wei
MQ
195
0
0
22 May 2025
Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs
Jordan Dotzel
Yuzong Chen
Bahaa Kotb
Sushma Prasad
Gang Wu
Sheng Li
Mohamed S. Abdelfattah
Zhiru Zhang
76
9
0
06 May 2024
1