dranger003
commited on
Commit
•
17680aa
1
Parent(s):
c56e918
Update README.md
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ The quants here are meant to test imatrix quantized weights.
|
|
12 |
|
13 |
**Added `ggml-dbrx-instruct-16x12b-f16_imatrix-wiki.dat` which is a 2K batches (1M tokens) on FP16 weights using wiki.train.**
|
14 |
|
15 |
-
| Precision | Quant/
|
16 |
| -- | -- | -- | -- |
|
17 |
| IQ4_XS | Q8_0/wiki.train | 65.29 | 5.2260 +/- 0.03558 |
|
18 |
| IQ4_XS | FP16/wiki.train | 65.29 | 5.2241 +/- 0.03559 |
|
|
|
12 |
|
13 |
**Added `ggml-dbrx-instruct-16x12b-f16_imatrix-wiki.dat` which is a 2K batches (1M tokens) on FP16 weights using wiki.train.**
|
14 |
|
15 |
+
| Precision | Quant/imatrix | Size (GiB) | PPL |
|
16 |
| -- | -- | -- | -- |
|
17 |
| IQ4_XS | Q8_0/wiki.train | 65.29 | 5.2260 +/- 0.03558 |
|
18 |
| IQ4_XS | FP16/wiki.train | 65.29 | 5.2241 +/- 0.03559 |
|