Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.07657
Cited By
SplitQuantV2: Enhancing Low-Bit Quantization of LLMs Without GPUs
7 March 2025
Jaewoo Song
Fangzhen Lin
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SplitQuantV2: Enhancing Low-Bit Quantization of LLMs Without GPUs"
6 / 6 papers shown
Title
SpinQuant: LLM quantization with learned rotations
Zechun Liu
Changsheng Zhao
Igor Fedorov
Bilge Soran
Dhruv Choudhary
Raghuraman Krishnamoorthi
Vikas Chandra
Yuandong Tian
Tijmen Blankevoort
MQ
214
117
0
21 Feb 2025
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
Elias Frantar
Saleh Ashkboos
Torsten Hoefler
Dan Alistarh
MQ
124
989
0
31 Oct 2022
Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning
Elias Frantar
Sidak Pal Singh
Dan Alistarh
MQ
98
237
0
24 Aug 2022
ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers
Z. Yao
Reza Yazdani Aminabadi
Minjia Zhang
Xiaoxia Wu
Conglong Li
Yuxiong He
VLM
MQ
119
477
0
04 Jun 2022
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
496
42,449
0
03 Dec 2019
Improving Neural Network Quantization without Retraining using Outlier Channel Splitting
Ritchie Zhao
Yuwei Hu
Jordan Dotzel
Christopher De Sa
Zhiru Zhang
OODD
MQ
94
311
0
28 Jan 2019
1