
L4Q: Parameter Efficient Quantization-Aware Fine-Tuning on Large Language Models
Papers citing "L4Q: Parameter Efficient Quantization-Aware Fine-Tuning on Large Language Models"
14 / 14 papers shown
Title |
---|
![]() Rethinking Channel Dimensions to Isolate Outliers for Low-bit Weight Quantization of Large Language Models Jung Hwan Heo Jeonghoon Kim Beomseok Kwon Byeongwook Kim Se Jung Kwon Dongsoo Lee |