Monor
/

Llama3-8B-Chinese-Chat-gguf

Inference Endpoints

Model card Files Files and versions Community

Llama3-8B-Chinese-Chat-gguf / README.md

Monor's picture

Update README.md

48a9ebf verified 7 months ago

|

206 Bytes

	---
	license: apache-2.0
	---

	## Introduce

	Quantizing the [shenzhi-wang/Llama3-8B-Chinese-Chat](https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat) to f16, q2, q3, q4, q5, q6 and q8 with Llama.cpp.