Llama 2 7B quantized with AutoGPTQ V0.3.0.

This model is compatible with the first version of QA-LoRA.

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

kaitchup
/

Llama-2-7b-4bit-32g-autogptq

Space using kaitchup/Llama-2-7b-4bit-32g-autogptq 1