Edit model card
YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

4-bit HQQ quantized version of Meta-Llama-3.1-405B (base version). Quantization parameters:

nbits=2, group_size=128, quant_zero=True, quant_scale=True, axis=0

Shards have been split with "split", to recombine:

cat qmodel_shard* > qmodel.pt

Downloads last month
6
Inference API
Unable to determine this model's library. Check the docs .