Update README.md
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ license_link: LICENSE
|
|
20 |
# Quant Infos
|
21 |
|
22 |
- quants done with an importance matrix for improved quantization loss
|
23 |
-
- K & IQ quants in basically all variants from Q6_K down to
|
24 |
- fixed end token for instruct mode (<|eot_id|>[128009])
|
25 |
|
26 |
Quantized with [llama.cpp](https://github.com/ggerganov/llama.cpp) commit [0d56246f4b9764158525d894b96606f6163c53a8](https://github.com/ggerganov/llama.cpp/commit/0d56246f4b9764158525d894b96606f6163c53a8) (master from 2024-04-18)
|
|
|
20 |
# Quant Infos
|
21 |
|
22 |
- quants done with an importance matrix for improved quantization loss
|
23 |
+
- K & IQ quants in basically all variants from Q6_K down to IQ1_S
|
24 |
- fixed end token for instruct mode (<|eot_id|>[128009])
|
25 |
|
26 |
Quantized with [llama.cpp](https://github.com/ggerganov/llama.cpp) commit [0d56246f4b9764158525d894b96606f6163c53a8](https://github.com/ggerganov/llama.cpp/commit/0d56246f4b9764158525d894b96606f6163c53a8) (master from 2024-04-18)
|