Here are a few GGUF(v2) quantizations of the model conceptofmind/Open-LLongMA-3b

Open LLongMA 3B is a language model trained to have 8192 tokens of context size using linear rope_scaling 0.25, Using 1.0 it will output gibberish.

GGUF

Model size

3.43B params

Architecture

llama

4-bit

5-bit

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Aryanne
/

Open-LLongMA-3B-gguf

Dataset used to train Aryanne/Open-LLongMA-3B-gguf