Update README.md
Browse files
README.md
CHANGED
@@ -61,7 +61,7 @@ bin/falcon -m /path/to/WizardLM-Uncensored-Falcon-40b.ggmlv3.q4_0.bin -t 10 -n 2
|
|
61 |
| WizardLM-Uncensored-Falcon-40b.ggmlv3.q5_0.bin | q5_0 | 5 | 28.77 GB | 31.27 GB | 5-bit. Higher accuracy, higher resource usage and slower inference. |
|
62 |
| WizardLM-Uncensored-Falcon-40b.ggmlv3.q5_1.bin | q5_1 | 5 | 31.38 GB | 33.88 GB | 5-bit. Even higher accuracy, resource usage and slower inference. |
|
63 |
|
64 |
-
A q8_0
|
65 |
|
66 |
<!-- footer start -->
|
67 |
## Discord
|
|
|
61 |
| WizardLM-Uncensored-Falcon-40b.ggmlv3.q5_0.bin | q5_0 | 5 | 28.77 GB | 31.27 GB | 5-bit. Higher accuracy, higher resource usage and slower inference. |
|
62 |
| WizardLM-Uncensored-Falcon-40b.ggmlv3.q5_1.bin | q5_1 | 5 | 31.38 GB | 33.88 GB | 5-bit. Even higher accuracy, resource usage and slower inference. |
|
63 |
|
64 |
+
A q8_0 file will be provided shortly. There is currently an issue preventing it from working. Once this is fixed, it will be uploaded.
|
65 |
|
66 |
<!-- footer start -->
|
67 |
## Discord
|