Edit model card

This repo includes .gguf built for HuggingFace/Candle. They will not work with llama.cpp.

Refer to the original repo for more details.

Downloads last month
291
GGUF
Model size
7.24B params
Architecture
undefined
+6
Inference API
Model is too large to load in Inference API (serverless). To try the model, launch it on Inference Endpoints (dedicated) instead.