totally-not-an-llm
/

EverythingLM-13b-16k

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

totally-not-an-llm commited on Aug 12, 2023

Commit

825f454

·

1 Parent(s): 44748e3

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -12,6 +12,8 @@ Introducing EverythingLM, a llama-2 based, general-purpose 13b model with 16k co
 The model is completely uncensored.
 ### Notable features:
 - Automatically triggered CoT reasoning
 - Verbose and detailed replies
@@ -29,8 +31,6 @@ ASSISTANT:
 Training took about 1 hour using QLoRa on 1xA100, so this model can be recreated for about $3.  QLoRa model can be found here: https://huggingface.co/totally-not-an-llm/EverythingLM-13b-peft.
-This is an early test, so here are some things to note on the model:
 ### Model quirks:
 - Due to the nature of the dataset, it does better with more detail.  I've found it gives much better stories when I provide more requirements
 - It really likes to use numbered lists.  I don't necessarilly have a problem with this but it's something to note when training on the dataset

 The model is completely uncensored.
+This model is an early test of the EverythingLM dataset and some new experimental principles, so don't consider it SOTA.
 ### Notable features:
 - Automatically triggered CoT reasoning
 - Verbose and detailed replies
 Training took about 1 hour using QLoRa on 1xA100, so this model can be recreated for about $3.  QLoRa model can be found here: https://huggingface.co/totally-not-an-llm/EverythingLM-13b-peft.
 ### Model quirks:
 - Due to the nature of the dataset, it does better with more detail.  I've found it gives much better stories when I provide more requirements
 - It really likes to use numbered lists.  I don't necessarilly have a problem with this but it's something to note when training on the dataset