3B version?
#1
by
Chriffe
- opened
Great job!
I would love a 3b version for running on lower spec hardware. Would like to try using that with Home Assistant.
Thanks!
Would a Q3 quant do( 3.5GB)? I tested a newer version as Q3 and hardly noticed any difference between it and the Q8(!). I don't mind finetuning a 3b, but the tricky part is finding one that has somewhat decent Swedish.
That would be cool to try out! :)
I hope to have a new version up in a couple of days. I'll make a Q3 quant for it.
neph1
changed discussion status to
closed
There's a now a q3_k_m with the latest quantization stuff from llama.cpp. Let me know how it performs.
neph1
changed discussion status to
open