hfl
/

chinese-mixtral-gguf

Mixture of Experts

Inference Endpoints

Model card Files Files and versions

chinese-mixtral-gguf / README.md

hfl-rc's picture

Update README.md

c210776 verified 10 months ago

|

820 Bytes

	---
	license: apache-2.0
	language:
	- zh
	- en
	---

	# Chinese-Mixtral-GGUF

	This repository contains the GGUF-v3 models (llama.cpp compatible) for Chinese-Mixtral (this is not a chat/instruction model).

	## Performance

	Metric: PPL, lower is better

	\| Quant \| PPL \|
	\| ----- \| ---- \|
	\| Q2_K \| \|
	\| Q3_K \| \|
	\| Q4_0 \| \|
	\| Q4_K \| \|
	\| Q5_0 \| \|
	\| Q5_K \| \|
	\| Q6_K \| \|
	\| Q8_0 \| \|
	\| F16 \| \|

	Due to the file size limitation, for F16 model, please use `cat` command to concatenate all parts into a single file. You must concatenate these parts in order.


	## Others


	For Hugging Face version, please see: https://huggingface.co/hfl/chinese-mixtral

	Please refer to [https://github.com/ymcui/Chinese-Mixtral/](https://github.com/ymcui/Chinese-Mixtral/) for more details.