Update README.md
Browse files
README.md
CHANGED
@@ -28,7 +28,8 @@ This model was created by [jphme](https://huggingface.co/jphme). It's a fine-tun
|
|
28 |
| **Quantization Formats** | 8 Bit, 5 Bit (K_M) |
|
29 |
|
30 |
|
31 |
-
##
|
|
|
32 |
1. Clone and install llama.cpp *(at time of writing, we used commit 9e20231)*, then compile.
|
33 |
```
|
34 |
cd llama.cpp && make
|
|
|
28 |
| **Quantization Formats** | 8 Bit, 5 Bit (K_M) |
|
29 |
|
30 |
|
31 |
+
## Quantization Process
|
32 |
+
If you want to create your own GGUF quantizations of HuggingFace models, use llama.cpp.
|
33 |
1. Clone and install llama.cpp *(at time of writing, we used commit 9e20231)*, then compile.
|
34 |
```
|
35 |
cd llama.cpp && make
|