dranger003
/

dbrx-instruct-iMat.GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

dranger003 commited on Apr 14

Commit

17680aa

•

1 Parent(s): c56e918

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ The quants here are meant to test imatrix quantized weights.
 **Added `ggml-dbrx-instruct-16x12b-f16_imatrix-wiki.dat` which is a 2K batches (1M tokens) on FP16 weights using wiki.train.**
-| Precision | Quant/Dataset | Size (GiB) | PPL |
 | -- | -- | -- | -- |
 | IQ4_XS | Q8_0/wiki.train | 65.29 | 5.2260 +/- 0.03558 |
 | IQ4_XS | FP16/wiki.train | 65.29 | 5.2241 +/- 0.03559 |

 **Added `ggml-dbrx-instruct-16x12b-f16_imatrix-wiki.dat` which is a 2K batches (1M tokens) on FP16 weights using wiki.train.**
+| Precision | Quant/imatrix | Size (GiB) | PPL |
 | -- | -- | -- | -- |
 | IQ4_XS | Q8_0/wiki.train | 65.29 | 5.2260 +/- 0.03558 |
 | IQ4_XS | FP16/wiki.train | 65.29 | 5.2241 +/- 0.03559 |