Quantized with these parameters:

--bits 4

--group_size 128

--desc_act 1

--damp 0.1

--seqlen 16384

--num_samples 512

Quantization Dataset: Erotiquant XL

Downloads last month
26
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Space using openerotica/Llama-3-lima-nsfw-16k-test-GPTQ 1