DavidAU
/

L3-Dark-Planet-8B-GGUF

Model card Files Files and versions Community

DavidAU commited on Sep 3, 2024

Commit

3424995

•

1 Parent(s): 097d21b

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -58,12 +58,14 @@ Example outputs below.
 - This is not a "happy ever after" model. It has a negative bias.
 - Output length will vary however this model prefers shortly outputs unless you state the size.
 - For creative uses, different quants will produce slightly different output.
 - If you use rope to extend context, increase temp AND instructions detail levels to compensate for "rope issues".
 - Source code for this model (Bfloat16), Float 32 master GGUFs (and source), and Imatrix GGUFs versions will be uploaded shortly at separate repos.
 Note the "float32" version of this model behaves VERY differently which is why it was not uploaded first.
-The Imatrix versions of this model have even lower perplexity then both this model and Llama3 Instruct and enhanced output.
 This is a LLAMA3 model, and requires Llama3 template, but may work with other template(s) and has maximum context of 8k / 8192.
 However this can be extended using "rope" settings up to 32k.

 - This is not a "happy ever after" model. It has a negative bias.
 - Output length will vary however this model prefers shortly outputs unless you state the size.
 - For creative uses, different quants will produce slightly different output.
+- Due to the high stability and compressed nature of this model, all quants will operate at above average levels.
 - If you use rope to extend context, increase temp AND instructions detail levels to compensate for "rope issues".
 - Source code for this model (Bfloat16), Float 32 master GGUFs (and source), and Imatrix GGUFs versions will be uploaded shortly at separate repos.
 Note the "float32" version of this model behaves VERY differently which is why it was not uploaded first.
+The Imatrix versions of this model have even lower perplexity (1/2 level of magnitude lower than this model, 1 full level of magnitude
+lower than LLama3 Instruct) then both this model and Llama3 Instruct and enhanced output.
 This is a LLAMA3 model, and requires Llama3 template, but may work with other template(s) and has maximum context of 8k / 8192.
 However this can be extended using "rope" settings up to 32k.