bartowski
/

dolphin-2.6.1-mixtral-8x7b-exl2

Text Generation

Model card Files Files and versions Community

bartowski commited on Dec 30, 2023

Commit

422a71f

·

1 Parent(s): fa42dab

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -14,6 +14,8 @@ quantized_by: bartowski
 pipeline_tag: text-generation
 ---
 ## Exllama v2 Quantizations of dolphin-2.6.1-mixtral-8x7b
 Using <a href="https://github.com/turboderp/exllamav2/releases/tag/v0.0.11">turboderp's ExLlamaV2 v0.0.11</a> for quantization.
@@ -24,7 +26,7 @@ Conversion was done using the default calibration dataset.
 Default arguments used except when the bits per weight is above 6.0, at that point the lm_head layer is quantized at 8 bits per weight instead of the default 6.
-Original model: https://huggingface.co/cognitivecomputations/dolphin-2.6.1-mixtral-8x7b
 <a href="https://huggingface.co/bartowski/dolphin-2.6.1-mixtral-8x7b-exl2/tree/3_0">3.0 bits per weight</a>

 pipeline_tag: text-generation
 ---
+# Eric has pulled this model due to decreased performance, will leave the quants up but downloader beware, performance isn't what was expected
 ## Exllama v2 Quantizations of dolphin-2.6.1-mixtral-8x7b
 Using <a href="https://github.com/turboderp/exllamav2/releases/tag/v0.0.11">turboderp's ExLlamaV2 v0.0.11</a> for quantization.
 Default arguments used except when the bits per weight is above 6.0, at that point the lm_head layer is quantized at 8 bits per weight instead of the default 6.
+Original model: ~https://huggingface.co/cognitivecomputations/dolphin-2.6.1-mixtral-8x7b~
 <a href="https://huggingface.co/bartowski/dolphin-2.6.1-mixtral-8x7b-exl2/tree/3_0">3.0 bits per weight</a>