hfl
/

chinese-mixtral-instruct-gguf

Mixture of Experts

Inference Endpoints

Model card Files Files and versions

hfl-rc commited on Jan 28

Commit

cc7fe76

•

1 Parent(s): d45a7b9

Update README.md

Files changed (1) hide show

README.md +31 -1

README.md CHANGED Viewed

@@ -4,4 +4,34 @@ language:
 - zh
 - en
 ---
-Work-in-progress (WIP)

 - zh
 - en
 ---
+# Chinese-Mixtral-Instruct-GGUF
+This repository contains the GGUF-v3 models (llama.cpp compatible) for **Chinese-Mixtral-Instruct** (chat/instruction model).
+## Performance
+Metric: PPL, lower is better
+| Quant | PPL  |
+| ----- | ---- |
+| IQ2_XXS | - |
+| IQ2_XS | - |
+| Q2_K  | -    |
+| Q3_K  | -     |
+| Q4_0  | -      |
+| Q4_K  | -    |
+| Q5_0  | -     |
+| Q5_K  | -    |
+| Q6_K  | -     |
+| Q8_0  | -     |
+| F16   |   x   |
+Due to the file size limitation, for F16 model, please use `cat` command to concatenate all parts into a single file. **You must concatenate these parts in order.**
+## Others
+For Hugging Face version, please see: https://huggingface.co/hfl/chinese-mixtral-instruct
+Please refer to [https://github.com/ymcui/Chinese-Mixtral/](https://github.com/ymcui/Chinese-Mixtral/) for more details.