
Extreme Compression of Large Language Models via Additive Quantization
Papers citing "Extreme Compression of Large Language Models via Additive Quantization"
22 / 22 papers shown
Title |
---|
![]() Mixtral of Experts Albert Q. Jiang Alexandre Sablayrolles Antoine Roux A. Mensch Blanche Savary ...Théophile Gervet Thibaut Lavril Thomas Wang Timothée Lacroix William El Sayed |