Update README.md
Browse files
README.md
CHANGED
@@ -16,6 +16,9 @@ Quantized using the [cleaned PIPPA](https://huggingface.co/datasets/royallab/PIP
|
|
16 |
|
17 |
[4.0bpw8h quants](https://huggingface.co/luigi86/magnum-72b-v1-exl2-rpcal/tree/4.0bpw8h) (tested and working on two 3090s with Q4 cache at 32k context)
|
18 |
|
|
|
|
|
|
|
19 |
|
20 |
See [original model](https://huggingface.co/alpindale/magnum-72b-v1) for further details.
|
21 |
|
|
|
16 |
|
17 |
[4.0bpw8h quants](https://huggingface.co/luigi86/magnum-72b-v1-exl2-rpcal/tree/4.0bpw8h) (tested and working on two 3090s with Q4 cache at 32k context)
|
18 |
|
19 |
+
[8.0bpw8h quants](https://huggingface.co/luigi86/magnum-72b-v1-exl2-rpcal/tree/8.0bpw8h)
|
20 |
+
|
21 |
+
|
22 |
|
23 |
See [original model](https://huggingface.co/alpindale/magnum-72b-v1) for further details.
|
24 |
|