mlx-community
/

Hercules-5.0-Qwen2-1.5B-4bits

Model card Files Files and versions Community

Hercules-5.0-Qwen2-1.5B-4bits / README.md

Isaak Carter Augustus

Upload folder using huggingface_hub (#1)

c590e44 verified 4 months ago

|

817 Bytes

	---
	language:
	- en
	license: apache-2.0
	tags:
	- mlx
	datasets:
	- Locutusque/hercules-v5.0
	inference:
	parameters:
	do_sample: true
	temperature: 0.8
	top_p: 0.95
	top_k: 40
	min_p: 0.1
	max_new_tokens: 250
	repetition_penalty: 1.1
	---

	# mlx-community/Hercules-5.0-Qwen2-1.5B-4bits

	The Model [mlx-community/Hercules-5.0-Qwen2-1.5B-4bits](https://huggingface.co/mlx-community/Hercules-5.0-Qwen2-1.5B-4bits) was converted to MLX format from [M4-ai/Hercules-5.0-Qwen2-1.5B](https://huggingface.co/M4-ai/Hercules-5.0-Qwen2-1.5B) using mlx-lm version 0.14.0.

	## Use with mlx

	```bash
	pip install mlx-lm
	```

	```python
	from mlx_lm import load, generate

	model, tokenizer = load("mlx-community/Hercules-5.0-Qwen2-1.5B-4bits")
	response = generate(model, tokenizer, prompt="hello", verbose=True)
	```