Text Generation
Transformers
Safetensors
PyTorch
English
mistral
finetuned
quantized
4-bit precision
AWQ
conversational
Inference Endpoints
text-generation-inference
awq
Suparious's picture
Create quant_config.json
09b619c verified
raw history blame
No virus
90 Bytes
{
"zero_point": true,
"q_group_size": 128,
"w_bit": 4,
"version": "GEMM"
}