ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large
  Language Models

ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models

Papers citing "ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models"