Transformers
GGUF
text-generation-inference
unsloth
llama
Eval Results
Inference Endpoints
Replete-Coder-Llama3-8B-GGUF / imatrix-q2_k.dat

Commit History