hfl
/

chinese-alpaca-2-1.3b-gguf

Inference Endpoints

Model card Files Files and versions Community

hfl-rc commited on Jan 16

Commit

026be4b

•

1 Parent(s): afea6a3

Update README.md

Files changed (1) hide show

README.md +18 -0

README.md CHANGED Viewed

@@ -9,6 +9,24 @@ language:
 This repository contains the GGUF-v3 models (llama.cpp compatible) for **Chinese-Alpaca-2-1.3B**.
 For Hugging Face version, please see: https://huggingface.co/hfl/chinese-alpaca-2-1.3b
 Please refer to [https://github.com/ymcui/Chinese-LLaMA-Alpaca-2/](https://github.com/ymcui/Chinese-LLaMA-Alpaca-2/) for more details.

 This repository contains the GGUF-v3 models (llama.cpp compatible) for **Chinese-Alpaca-2-1.3B**.
+## Performance
+Metric: PPL, lower is better
+| Quant | original | imatrix (`-im`) |
+|-----|------|------|
+| Q2_K | 19.9339 +/- 0.29752 | 18.8935 +/- 0.28558 |
+| Q3_K | 17.2487 +/- 0.27668 | 17.2950 +/- 0.27994 |
+| Q4_K | 16.4583 +/- 0.26453 | 16.2688 +/- 0.26216 |
+| Q5_K | 15.7547 +/- 0.25207 | 16.0190 +/- 0.25782 |
+| Q6_K | 15.8166 +/- 0.25359 | 15.7357 +/- 0.25210 |
+*The model with `-im` suffix is generated with important matrix, which has generally better performance (not always though).*
+## Others
 For Hugging Face version, please see: https://huggingface.co/hfl/chinese-alpaca-2-1.3b
 Please refer to [https://github.com/ymcui/Chinese-LLaMA-Alpaca-2/](https://github.com/ymcui/Chinese-LLaMA-Alpaca-2/) for more details.