qwenv2-7b-inst-imatrix-gguf / qwen7bv2inst_q4km_embeddingf16_outputf16.gguf

Commit History

Good speed reference quant for older CPUs, however not much improvement from f16 embedding
dac48df
verified

nisten commited on