language: | |
- en | |
license: apache-2.0 | |
tags: | |
- mlx | |
datasets: | |
- Locutusque/hercules-v5.0 | |
inference: | |
parameters: | |
do_sample: true | |
temperature: 0.8 | |
top_p: 0.95 | |
top_k: 40 | |
min_p: 0.1 | |
max_new_tokens: 250 | |
repetition_penalty: 1.1 | |
# mlx-community/Hercules-5.0-Qwen2-1.5B-4bits | |
The Model [mlx-community/Hercules-5.0-Qwen2-1.5B-4bits](https://huggingface.co/mlx-community/Hercules-5.0-Qwen2-1.5B-4bits) was converted to MLX format from [M4-ai/Hercules-5.0-Qwen2-1.5B](https://huggingface.co/M4-ai/Hercules-5.0-Qwen2-1.5B) using mlx-lm version **0.14.0**. | |
## Use with mlx | |
```bash | |
pip install mlx-lm | |
``` | |
```python | |
from mlx_lm import load, generate | |
model, tokenizer = load("mlx-community/Hercules-5.0-Qwen2-1.5B-4bits") | |
response = generate(model, tokenizer, prompt="hello", verbose=True) | |
``` | |