Update README.md
Browse files
README.md
CHANGED
@@ -13,8 +13,7 @@ This repo contains AWQ model files for [PMC_LLaMA_13B](https://huggingface.co/ax
|
|
13 |
|
14 |
### About AWQ
|
15 |
|
16 |
-
AWQ is
|
17 |
-
|
18 |
|
19 |
Example of usage with vLLM library:
|
20 |
|
|
|
13 |
|
14 |
### About AWQ
|
15 |
|
16 |
+
[AWQ](https://arxiv.org/abs/2306.00978) is a rapid, precise, and efficient low-bit weight quantization method, enabling 4-bit quantization with remarkable speed.
|
|
|
17 |
|
18 |
Example of usage with vLLM library:
|
19 |
|