Safetensors
llama
falcon3
8-bit precision
gptq
Falcon3-3B-Instruct-GPTQ-Int8 / quantize_config.json

Commit History