--- language: - en license: apache-2.0 tags: - mlx datasets: - Locutusque/hercules-v5.0 inference: parameters: do_sample: true temperature: 0.8 top_p: 0.95 top_k: 40 min_p: 0.1 max_new_tokens: 250 repetition_penalty: 1.1 --- # mlx-community/Hercules-5.0-Qwen2-1.5B-4bits The Model [mlx-community/Hercules-5.0-Qwen2-1.5B-4bits](https://huggingface.co/mlx-community/Hercules-5.0-Qwen2-1.5B-4bits) was converted to MLX format from [M4-ai/Hercules-5.0-Qwen2-1.5B](https://huggingface.co/M4-ai/Hercules-5.0-Qwen2-1.5B) using mlx-lm version **0.14.0**. ## Use with mlx ```bash pip install mlx-lm ``` ```python from mlx_lm import load, generate model, tokenizer = load("mlx-community/Hercules-5.0-Qwen2-1.5B-4bits") response = generate(model, tokenizer, prompt="hello", verbose=True) ```