TheBloke commited on
Commit
fad6157
1 Parent(s): 5db19b0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -61,7 +61,7 @@ bin/falcon -m /path/to/WizardLM-Uncensored-Falcon-40b.ggmlv3.q4_0.bin -t 10 -n 2
61
  | WizardLM-Uncensored-Falcon-40b.ggmlv3.q5_0.bin | q5_0 | 5 | 28.77 GB | 31.27 GB | 5-bit. Higher accuracy, higher resource usage and slower inference. |
62
  | WizardLM-Uncensored-Falcon-40b.ggmlv3.q5_1.bin | q5_1 | 5 | 31.38 GB | 33.88 GB | 5-bit. Even higher accuracy, resource usage and slower inference. |
63
 
64
- A q8_0 quant will also be possible in future, but is currently not working. It will be uploaded once this issue is fixed.
65
 
66
  <!-- footer start -->
67
  ## Discord
 
61
  | WizardLM-Uncensored-Falcon-40b.ggmlv3.q5_0.bin | q5_0 | 5 | 28.77 GB | 31.27 GB | 5-bit. Higher accuracy, higher resource usage and slower inference. |
62
  | WizardLM-Uncensored-Falcon-40b.ggmlv3.q5_1.bin | q5_1 | 5 | 31.38 GB | 33.88 GB | 5-bit. Even higher accuracy, resource usage and slower inference. |
63
 
64
+ A q8_0 file will be provided shortly. There is currently an issue preventing it from working. Once this is fixed, it will be uploaded.
65
 
66
  <!-- footer start -->
67
  ## Discord