GGUF Mistral-Nemo-2407-Instruct OQ8_0.EF32 IQuants
Collection
Custom GGUF quants of Mistral-Nemo-2407-Instruct, where the Output Tensors are quantized to Q8_0 while the Embeddings are kept at F32. π§ π₯π
β’
1 item
β’
Updated