alpindale
/

Mistral-7B-Instruct-v0.2-EETQ

Text Generation

text-generation-inference

Inference Endpoints

8-bit precision

Model card Files Files and versions Community

Mistral-7B-Instruct-v0.2-EETQ / README.md

alpindale's picture

Create README.md

0ea83af verified 6 months ago

|

No virus

182 Bytes

	Model quantized using a modified [EETQ](https://github.com/NetEase-FuXi/EETQ) repo. Currently working on
	decoupling its kernels from CUTLASS to make this a bit easier to use.

	8bits.