Melvin56
/

DeepScaleR-1.5B-Preview-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Melvin56 commited on 8 days ago

Commit

a3a47b8

·

verified ·

1 Parent(s): 347f5a2

Update README.md

Files changed (1) hide show

README.md +17 -0

README.md CHANGED Viewed

@@ -18,6 +18,23 @@ Original Model : [agentica-org/DeepScaleR-1.5B-Preview](https://huggingface.co/a
 All quants are made using the imatrix option.
 | Model                                            |   Size (GB)   |
 |:-------------------------------------------------|:-------------:|

 All quants are made using the imatrix option.
+|               | CPU (AVX2) | Metal | cuBLAS | rocBLAS | SYCL | CLBlast | Vulkan | Kompute |
+| :------------ | :---------: | :---: | :----: | :-----: | :---: | :------: | :----: | :------: |
+| K-quants      |      ✅     |   ✅  |   ✅   |    ✅   |  ✅  |   ✅ 🐢5  |  ✅ 🐢5 |    ❌    |
+| I-quants      |    ✅ 🐢4   |  ✅ 🐢4 |   ✅   |    ✅   | Partial¹ |    ❌    |   ❌  |    ❌    |
+```
+✅: feature works.
+🚫: feature does not work
+❓: unknown, please contribute if you can test it youself
+🐢: feature is slow
+¹: IQ3_S and IQ1_S, see #5886
+²: Only with -ngl 0
+³: Inference is 50% slower
+⁴: Slower than K-quants of comparable size
+⁵: Slower than cuBLAS/rocBLAS on similar cards
+⁶: Only q8_0 and iq4_nl
+```
 | Model                                            |   Size (GB)   |
 |:-------------------------------------------------|:-------------:|