Update README.md
Browse files
README.md
CHANGED
@@ -8,4 +8,10 @@ tags:
|
|
8 |
- gemma
|
9 |
- gguf
|
10 |
- imatrix
|
11 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
8 |
- gemma
|
9 |
- gguf
|
10 |
- imatrix
|
11 |
+
---
|
12 |
+
|
13 |
+
# Google Gemma 9B IT GGUF
|
14 |
+
|
15 |
+
- just f32 gguf from official kaggle repo reuploaded for now
|
16 |
+
- imatrix quants are running
|
17 |
+
- you will need the gemma2 llama.cpp [PR](https://github.com/ggerganov/llama.cpp/pull/8156) applied to your llama.cpp
|