minosu/godot_dodo_4x_60k_llama_7b · How I can get gguf model from your models?

May 8

Hello!
I want to test your model, because it looks very cool!
But I don't know how I can get gguf model from yours, can you help me with link to information about it?

minosu

Owner May 9

Hey, thanks for your interest!

You should be able to follow the instructions in the llama.cpp repo to quantize this model: https://github.com/ggerganov/llama.cpp

There's detailed documentation in the readme there, but in short the process should look something like this:

Clone llama.cpp
make
Download weights from HF and put in subfolder, like models/dodo
python3 -m pip install -r requirements.txt
python3 convert.py models/dodo/
./quantize ./models/dodo/ggml-model-f16.gguf ./models/dodo/ggml-model-q4_0.gguf q4_0

m-fireson

May 15

Thanks for the detailed description, now I’ll try to follow your steps! 😊

m-fireson changed discussion status to closed May 15