How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study Paper โข 2404.14047 โข Published Apr 22 โข 44 โข 12
Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs Paper โข 2309.05516 โข Published Sep 11, 2023 โข 9 โข 2