Text Generation
Transformers
Safetensors
English
llama
Not-For-All-Audiences
conversational
text-generation-inference
Inference Endpoints

Linked imatrix GGUF quants are based on older llama.cpp without rope fix

#3
by bartowski - opened

Just thought you should know

The static ones are updated, but @MarsupialAI you may want to consider updating yours

in the meantime if anyone needs them mine are updated

https://huggingface.co/bartowski/L3.1-8B-Celeste-V1.5-GGUF

Nothing is Real org

Thank you for letting me know and providing up to date quants. I updated the card to include them

Sign up or log in to comment