Model Details

This is 01-ai/Yi-6B quantized and serialized with AutoGPTQ in 4-bit.

Details here:

Yi: Fine-tune and Run One the Best Bilingual LLMs on Your Computer

Downloads last month
84
Safetensors
Model size
1.27B params
Tensor type
I32
·
FP16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Collection including kaitchup/Yi-6B-gptq-4bit