Why the model is so large?

#1
by JohnLaw - opened

a 2b model in gguf format is 10Gb?

Google org

That's right! The weights are store in fp32 precision (see here) meaning 2.6B params * 4 bytes per param = ~10GB.
image.png

Google org

Hi @JohnLaw , Could you please confirm if this issue is resolved for above Answered . We can close this issue or else if you have any concerns let us know. Thank you.

Sign up or log in to comment