Optimization Strategies for Enhancing Resource Efficiency in Transformers & Large Language Models

Optimization Strategies for Enhancing Resource Efficiency in Transformers & Large Language Models

Papers citing "Optimization Strategies for Enhancing Resource Efficiency in Transformers & Large Language Models"

14 / 14 papers shown
Title
The case for 4-bit precision: k-bit Inference Scaling Laws
The case for 4-bit precision: k-bit Inference Scaling Laws
Tim Dettmers
Luke Zettlemoyer
88
228
0
19 Dec 2022
8-bit Optimizers via Block-wise Quantization
8-bit Optimizers via Block-wise Quantization
Tim Dettmers
M. Lewis
Sam Shleifer
Luke Zettlemoyer
112
297
0
06 Oct 2021