Edit model card

Base Llama3 model after quantization into Int8 in weights and activations.

Downloads last month
2
Inference API
This model can be loaded on Inference API (serverless).