Merry's picture
Update README.md
abfdf8f
|
raw
history blame
1.1 kB
metadata
license: apache-2.0

NOTE: Tested with KoboldCpp. I can't promise that it will work with other frontends. Use at your own risk!

2023-04-20: Converted using ggerganov/ggml's "stablelm" conversion script and quantization code as of commit 05f3079.

2023-04-30: Converted using ggerganov/ggml's "stablelm" conversion script and quantization code as of commit 5dd92f4.

These conversions are based off of the updated Pythia Deduped checkpoints, not the original v0 trainings.

RAM USAGE:

Model Initial RAM usage
ggml-pythia-70m-deduped-q4_3.bin 121.2 MiB
ggml-pythia-160m-deduped-q4_3.bin 225.2 MiB
ggml-pythia-410m-deduped-q4_3.bin 498.1 MiB
ggml-pythia-1b-deduped-q4_3.bin 951.5 MiB
ggml-pythia-1.4b-deduped-q4_3.bin 1.3 GiB
ggml-pythia-2.8b-deduped-q4_3.bin 2.4 GiB
ggml-pythia-6.9b-deduped-q4_3.bin 5.4 GiB
ggml-pythia-12b-deduped-q4_3.bin 9.2 GiB