3B version?

#1
by Chriffe - opened

Great job!
I would love a 3b version for running on lower spec hardware. Would like to try using that with Home Assistant.

Owner

Thanks!
Would a Q3 quant do( 3.5GB)? I tested a newer version as Q3 and hardly noticed any difference between it and the Q8(!). I don't mind finetuning a 3b, but the tricky part is finding one that has somewhat decent Swedish.

That would be cool to try out! :)

Owner

I hope to have a new version up in a couple of days. I'll make a Q3 quant for it.

neph1 changed discussion status to closed
Owner

There's a now a q3_k_m with the latest quantization stuff from llama.cpp. Let me know how it performs.

neph1 changed discussion status to open

Sign up or log in to comment