license: other | |
license_name: nvidia-open-model-license | |
license_link: https://developer.download.nvidia.com/licenses/nvidia-open-model-license-agreement-june-2024.pdf | |
tags: | |
- mlx | |
# sigjhl/Llama-3.1-Minitron-4B-Width-Base_mlx_q8 | |
The Model [sigjhl/Llama-3.1-Minitron-4B-Width-Base_mlx_q8](https://huggingface.co/sigjhl/Llama-3.1-Minitron-4B-Width-Base_mlx_q8) was converted to MLX format from [nvidia/Llama-3.1-Minitron-4B-Width-Base](https://huggingface.co/nvidia/Llama-3.1-Minitron-4B-Width-Base) using mlx-lm version **0.17.0**. | |
## Use with mlx | |
```bash | |
pip install mlx-lm | |
``` | |
```python | |
from mlx_lm import load, generate | |
model, tokenizer = load("sigjhl/Llama-3.1-Minitron-4B-Width-Base_mlx_q8") | |
response = generate(model, tokenizer, prompt="hello", verbose=True) | |
``` | |