Safetensors
llama
falcon3
8-bit precision
gptq
Falcon3-7B-Instruct-GPTQ-Int8 / quantize_config.json

Commit History