Add model in GGUF format for inference in llama.cpp.

This is the 110M parameter Llama 2 architecture model trained on the TinyStories dataset. These are converted from karpathy/tinyllamas. See the llama2.c project for more details.

Downloads last month: 17

GGUF

Model size

134M params

Architecture

llama

16-bit

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for deniskirbaba/tinyllama-110M-F16-GGUF

Base model

nickypro/tinyllama-110M

Quantized

(3)

this model