namtran commited on
Commit
81e31f7
1 Parent(s): 70662c1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -1
README.md CHANGED
@@ -1,3 +1,17 @@
1
  ---
2
- license: mit
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ inference: false
3
+ license: other
4
+ model_type: llama
5
  ---
6
+ # Mistral 7B v0.2 - AWQ GGUF
7
+ These files are in GGUF format.
8
+ - Model creator: [Mistralai](https://huggingface.co/mistralai)
9
+ - Original model: [Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2)
10
+
11
+ The model was converted by the combination of [llama.cpp](https://github.com/ggerganov/llama.cpp) and quantization method [AWQ](https://github.com/mit-han-lab/llm-awq)
12
+
13
+ ## How to use models in `llama.cpp`
14
+
15
+ ```
16
+ ./main -m Mistral-7b-v0.1-Q2_K.gguf -n 128 --prompt "Once upon a time"
17
+ ```