Create README.md

Browse files

Files changed (1) hide show

README.md +82 -0

README.md ADDED Viewed

	@@ -0,0 +1,82 @@

+---
+license: apache-2.0
+language:
+- en
+tags:
+- story
+- general usage
+- roleplay
+- creative
+- rp
+- fantasy
+- story telling
+- ultra high precision
+---
+<B>NEO CLASS Ultra Quants for : Daredevil-8B-abliterated-Ultra </B>
+The NEO Class tech was created after countless investigations and over 120 lab experiments backed by
+real world testing and qualitative results.
+<b>NEO Class results: </b>
+Better overall function, instruction following, output quality and stronger connections to ideas, concepts and the world in general.
+In addition quants now operate above their "grade" so to speak :
+IE: Q4 / IQ4 operate at Q5KM/Q6 levels.
+Likewise for Q3/IQ3 operate at Q4KM/Q5 levels.
+Perplexity drop of 724 points for Neo Class Imatrix quant of IQ4XS VS regular quant of IQ4XS.
+(lower is better)
+<B> A Funny thing happened on the way to the "lab" ... </b>
+Although this model uses a "Llama3" template we found that Command-R's template worked better specifically for creative purposes.
+This applies to both normal quants and Neo quants.
+Here is Command-R's template:
+<PRE>
+{
+  "name": "Cohere Command R",
+  "inference_params": {
+    "input_prefix": "<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|USER_TOKEN|>",
+    "input_suffix": "<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>",
+    "antiprompt": [
+      "<|START_OF_TURN_TOKEN|>",
+      "<|END_OF_TURN_TOKEN|>"
+    ],
+    "pre_prompt_prefix": "<|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>",
+    "pre_prompt_suffix": ""
+  }
+}
+</PRE>
+This was "interesting" issue was confirmed by multiple users.
+<B> Model Notes: </B>
+Maximum context is 32k. Please see original model maker's page for details, and usage information for this model.
+Special thanks to the model creators at MLABONNE for making such a fantastic model:
+[ https://huggingface.co/mlabonne/Daredevil-8B-abliterated ]
+<h3> Sample Prompt and Model's Compared:</h3>
+Prompt tested with "temp=0" to ensure compliance, 2048 context (model supports 31768 context / 32k), and "chat" template for LLAMA3.
+Additional parameters are also minimized.
+PROMPT: <font color="red">"Start a 1000 word scene with: The sky scraper swayed, as she watched the window in front of her on the 21 floor explode..."</font>
+<B> Original model IQ4XS - unaltered: </b>
+<b>New NEO Class IQ4XS Imatrix: </b>