Joseph717171's picture
Create README.md
ff553b2 verified
|
raw
history blame
232 Bytes

Custom GGUF quants of Meta’s Llama-3.2-3B-Instruct, where the Output Tensors are quantized to Q8_0 or kept at F32, and the Embeddings are kept at F32. Enjoy! 🧠🔥🚀