namtran
/

Mistral-7b-v0.2-AWQ-GGUF

Model card Files Files and versions Community

namtran commited on Jan 12

Commit

81e31f7

•

1 Parent(s): 70662c1

Update README.md

Files changed (1) hide show

README.md +15 -1

README.md CHANGED Viewed

@@ -1,3 +1,17 @@
 ---
-license: mit
 ---

 ---
+inference: false
+license: other
+model_type: llama
 ---
+# Mistral 7B v0.2 - AWQ GGUF
+These files are in GGUF format.
+- Model creator: [Mistralai](https://huggingface.co/mistralai)
+- Original model: [Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2)
+The model was converted by the combination of [llama.cpp](https://github.com/ggerganov/llama.cpp) and quantization method [AWQ](https://github.com/mit-han-lab/llm-awq)
+## How to use models in `llama.cpp`
+```
+./main -m Mistral-7b-v0.1-Q2_K.gguf -n 128 --prompt "Once upon a time"
+```