bdambrosio
/

Mixtral-8x22b-instruct-oh-6.0bpw-exl2

Text Generation

Transformers

Safetensors

mixtral

conversational

text-generation-inference

Inference Endpoints

6-bit

exl2

Model card Files Files and versions Community

6.0 bit exl2 quant (8 vut head) of Fireworks Hermes 2.5 fine tune of Mixtral-8x22b

Use Vicuna prompt template

needs ~ 120GB vRam (2xA100 or 3X RTX 6000)

Downloads last month: 6

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.