grimjim
/

kukulemon-7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

grimjim commited on Mar 12

Commit

80098e3

•

1 Parent(s): 8f363d2

Update README.md

Updated description. Minor corrections.

Files changed (1) hide show

README.md +7 -3

README.md CHANGED Viewed

@@ -6,9 +6,13 @@ library_name: transformers
 tags:
 - mergekit
 - merge
 ---
-# kukulem
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
@@ -47,4 +51,4 @@ parameters:
     - value: 0.5 # fallback for rest of tensors
 dtype: float16
-```

 tags:
 - mergekit
 - merge
+license: cc-by-nc-4.0
 ---
+# kukulemon-7B
+A merger of two similar models with strong reasoning, hopefully resulting in "dense" encoding of said reasoning, was merged with a model targeting roleplay.
+I've tested with ChatML prompts with temperature=1.1 and minP=0.03. The model itself supports Alpaca format prompts. The model claims a context length of 32K, but I've only tested to 8K to date.
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
     - value: 0.5 # fallback for rest of tensors
 dtype: float16
+```