How was this quantized?

by jlinux - opened Feb 21

Feb 21

Can you share how this was quantized? I am unable to quantize using convert.py from llama.cpp and successfully load it with BPE or SPM vocab. Your insights are appreciated :).

jlinux

Feb 21

Closing.. llama.cpp has a pending merge request to support which successfully generates GGUF.

jlinux changed discussion status to closed Feb 21

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment