Update README.md
Browse files
README.md
CHANGED
@@ -2,15 +2,15 @@
|
|
2 |
license: apache-2.0
|
3 |
---
|
4 |
|
5 |
-
|
6 |
|
7 |
-
|
8 |
|
9 |
-
|
10 |
|
11 |
-
|
12 |
|
13 |
-
# RAM USAGE
|
14 |
Model | Initial RAM usage
|
15 |
:--:|:--:
|
16 |
ggml-pythia-70m-deduped-q4_3.bin | 121.2 MiB
|
|
|
2 |
license: apache-2.0
|
3 |
---
|
4 |
|
5 |
+
**NOTE:** Tested with KoboldCpp. I can't promise that it will work with other frontends. Use at your own risk!
|
6 |
|
7 |
+
**2023-04-20:** Converted using ggerganov/ggml's "stablelm" conversion script and quantization code as of [commit 05f3079](https://github.com/ggerganov/ggml/tree/05f307971862b83df12fada0c42ee027ba5a82b5/examples/stablelm).
|
8 |
|
9 |
+
**2023-04-30:** Converted using ggerganov/ggml's "stablelm" conversion script and quantization code as of [commit 5dd92f4](https://github.com/ggerganov/ggml/tree/5dd92f421ee44f18b8fde0afbf5ca8fc7bf93841/examples/stablelm).
|
10 |
|
11 |
+
These conversions are based off of the updated Pythia Deduped checkpoints, not the original v0 trainings.
|
12 |
|
13 |
+
# RAM USAGE:
|
14 |
Model | Initial RAM usage
|
15 |
:--:|:--:
|
16 |
ggml-pythia-70m-deduped-q4_3.bin | 121.2 MiB
|