How I can get gguf model from your models?
#1
by
m-fireson
- opened
Hello!
I want to test your model, because it looks very cool!
But I don't know how I can get gguf model from yours, can you help me with link to information about it?
Hey, thanks for your interest!
You should be able to follow the instructions in the llama.cpp repo to quantize this model: https://github.com/ggerganov/llama.cpp
There's detailed documentation in the readme there, but in short the process should look something like this:
- Clone llama.cpp
make
- Download weights from HF and put in subfolder, like
models/dodo
python3 -m pip install -r requirements.txt
python3 convert.py models/dodo/
./quantize ./models/dodo/ggml-model-f16.gguf ./models/dodo/ggml-model-q4_0.gguf q4_0
Thanks for the detailed description, now I’ll try to follow your steps! 😊
m-fireson
changed discussion status to
closed