Nexesenex
/

LaDameBlanche-v2-95b-iMat-CQ.GGUF

Inference Endpoints

Model card Files Files and versions Community

Nexesenex commited on Apr 22

Commit

7f70caf

•

1 Parent(s): e41b9ee

Create README.md

Files changed (1) hide show

README.md +13 -0

README.md ADDED Viewed

	@@ -0,0 +1,13 @@

+Custom GGUF Quants with iMatrix for :
+https://huggingface.co/MarsupialAI/LaDameBlanche-v2-95b
+- Q8_0 used as quant base : https://huggingface.co/mradermacher/LaDameBlanche-v2-95b-GGUF
+- iMatrix here : https://huggingface.co/mradermacher/LaDameBlanche-v2-95b-i1-GGUF
+(Yes, I'm lazy, but I can live with a 0.01ppl bump ^^)
+The model is a great merge, sensical and creative, imho working better for lesser requirements than the 100b+ Miqu which are worthy only for those with 48GB VRAM or more.
+In IQ2_LR(2.7BPW, for 8k context with 36GB VRAM and an IGP running the OS display), ARC Challenge at 57, ARC Easy at 77, PPL 512 at 4.5860.
+Mesdames et messieurs, vous êtes servis!