qwenv2-7b-inst-imatrix-gguf / qwen7bv2inst_iq4xs_embedding8_outputq8.gguf

Commit History

great quant if your chip has 8bit acceleration, slightly better than 4k embedding
0bc4249
verified

nisten commited on