⇲ Click here to expand/hide information – General chart with relative quant parformances.

Recommended read:

"Which GGUF is right for me? (Opinionated)" by Artefact2

Click the image to view full size.

GGUF

Model size

1.1B params

Architecture

llama

Inference API (serverless) has been turned off for this model.

Collection including LWDCLS/KobbleTinyV2-1.1B-GGUF-IQ-Imatrix-Request