Quantized GGUF available

by MaziyarPanahi - opened Mar 12, 2024

Mar 12, 2024

Hi,

Thanks for sharing your model. I quantized it to GGUF models for those with low resources: https://huggingface.co/MaziyarPanahi/luxia-21.4b-alignment-v1.0-GGUF

Thanks again

Yuuru

Mar 12, 2024

GGML_ASSERT: D:\a\llama-cpp-python-cuBLAS-wheels\llama-cpp-python-cuBLAS-wheels\vendor\llama.cpp\llama.cpp:3493: codepoints_from_utf8(word).size() > 0

davideuler

Mar 18, 2024

GGML_ASSERT: D:\a\llama-cpp-python-cuBLAS-wheels\llama-cpp-python-cuBLAS-wheels\vendor\llama.cpp\llama.cpp:3493: codepoints_from_utf8(word).size() > 0

I've got the same error message.

MaziyarPanahi

Mar 18, 2024

Sorry for the inconvenience, I have reported the issue: https://github.com/ggerganov/llama.cpp/issues/6132

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment