File size: 787 Bytes
cede670
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
---

license: other
license_name: nvidia-open-model-license
license_link: https://developer.download.nvidia.com/licenses/nvidia-open-model-license-agreement-june-2024.pdf
tags:
- mlx
---


# sigjhl/Llama-3.1-Minitron-4B-Width-Base_mlx_q8

The Model [sigjhl/Llama-3.1-Minitron-4B-Width-Base_mlx_q8](https://huggingface.co/sigjhl/Llama-3.1-Minitron-4B-Width-Base_mlx_q8) was converted to MLX format from [nvidia/Llama-3.1-Minitron-4B-Width-Base](https://huggingface.co/nvidia/Llama-3.1-Minitron-4B-Width-Base) using mlx-lm version **0.17.0**.

## Use with mlx

```bash
pip install mlx-lm
```

```python
from mlx_lm import load, generate

model, tokenizer = load("sigjhl/Llama-3.1-Minitron-4B-Width-Base_mlx_q8")
response = generate(model, tokenizer, prompt="hello", verbose=True)
```