3B version?

by Chriffe - opened Jan 21

Jan 21

Great job!
I would love a 3b version for running on lower spec hardware. Would like to try using that with Home Assistant.

neph1

Owner Jan 21

Thanks!
Would a Q3 quant do( 3.5GB)? I tested a newer version as Q3 and hardly noticed any difference between it and the Q8(!). I don't mind finetuning a 3b, but the tricky part is finding one that has somewhat decent Swedish.

Chriffe

Jan 21

That would be cool to try out! :)

neph1

Owner Jan 22

I hope to have a new version up in a couple of days. I'll make a Q3 quant for it.

neph1 changed discussion status to closed Jan 22

neph1

Owner Jan 24

There's a now a q3_k_m with the latest quantization stuff from llama.cpp. Let me know how it performs.

neph1 changed discussion status to open Feb 5

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment