qwp4w3hyb
/

gemma-2-9b-it-iMat-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

qwp4w3hyb commited on Jun 27

Commit

ae2e095

•

1 Parent(s): 7449d16

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -8,4 +8,10 @@ tags:
 - gemma
 - gguf
 - imatrix
----

 - gemma
 - gguf
 - imatrix
+---
+# Google Gemma 9B IT GGUF
+- just f32 gguf from official kaggle repo reuploaded for now
+- imatrix quants are running
+- you will need the gemma2 llama.cpp [PR](https://github.com/ggerganov/llama.cpp/pull/8156) applied to your llama.cpp