Pixtral-Large-Instruct-2411 🧡 ExLlamaV2 4.5bpw Quant

4.5bpw quant of Pixtral-Large-Instruct.

Vision inputs working on dev branch of ExLlamaV2.

Tokenizer And Prompt Template

Using conversion of v7m1 tokenizer with 32k vocab size.

Chat template in chat_template.json uses the v7 instruct template:

<s>[SYSTEM_PROMPT] <system prompt>[/SYSTEM_PROMPT][INST] <user message>[/INST] <assistant response></s>[INST] <user message>[/INST]

Available Sizes

Downloads last month
5
Inference Examples
Inference API (serverless) has been turned off for this model.

Model tree for nintwentydo/Pixtral-Large-Instruct-2411-exl2-4.5bpw

Quantized
(9)
this model