Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.13304
Cited By
QuIP: 2-Bit Quantization of Large Language Models With Guarantees
25 July 2023
Jerry Chee
Yaohui Cai
Volodymyr Kuleshov
Chris De Sa
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"QuIP: 2-Bit Quantization of Large Language Models With Guarantees"
2 / 152 papers shown
Title
A Survey on Model Compression for Large Language Models
Xunyu Zhu
Jian Li
Yong Liu
Can Ma
Weiping Wang
36
193
0
15 Aug 2023
SqueezeLLM: Dense-and-Sparse Quantization
Sehoon Kim
Coleman Hooper
A. Gholami
Zhen Dong
Xiuyu Li
Sheng Shen
Michael W. Mahoney
Kurt Keutzer
MQ
32
167
0
13 Jun 2023
Previous
1
2
3
4