totally-not-an-llm
/

EverythingLM-13b-16k

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

totally-not-an-llm commited on Aug 12, 2023

Commit

44748e3

·

1 Parent(s): f770975

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -32,11 +32,11 @@ Training took about 1 hour using QLoRa on 1xA100, so this model can be recreated
 This is an early test, so here are some things to note on the model:
 ### Model quirks:
-- Due to the nature of the dataset, it does better with more detail.  I've found it gives much better stories when I provide more requirements.
-- It really likes to use numbered lists.  I don't necessarilly have a problem with this but it's something to note when training on the dataset.
-- I've had trouble with ggml k-quants.
-- It likes to write fairy tales over anything else, which is strange.  This can easily be fixed by prompting.
-- Occasionally it will fall into repetition, this seems to be a commmon issue with llama-2 models.
 ### Future plans:
 - Native finetune

 This is an early test, so here are some things to note on the model:
 ### Model quirks:
+- Due to the nature of the dataset, it does better with more detail.  I've found it gives much better stories when I provide more requirements
+- It really likes to use numbered lists.  I don't necessarilly have a problem with this but it's something to note when training on the dataset
+- It likes to write fairy tales over anything else, which is strange.  This can easily be fixed by prompting
+- Occasionally it will fall into repetition, this seems to be a commmon issue with llama-2 models
+- Haven't tested pushing it all the way to 16k context.
 ### Future plans:
 - Native finetune