lizpreciatior commited on
Commit
187e4fa
1 Parent(s): 1d37fd8
Files changed (1) hide show
  1. README.md +42 -0
README.md CHANGED
@@ -1,3 +1,45 @@
1
  ---
2
  license: cc-by-sa-4.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: cc-by-sa-4.0
3
  ---
4
+
5
+
6
+ # lzlv_70B
7
+ ## A 13B-style merge of 70B models
8
+
9
+ A multi-model merge of several different LLaMA2 70B finetunes aimed for roleplay and creative work. The goal was to create a model that combines creativity with intelligence and prompt-following capabilities for an enhanced experience.
10
+
11
+ Did it work? I think so.
12
+
13
+
14
+ ## How it was done:
15
+
16
+ The models were used to create this merge:
17
+ - NousResearch/Nous-Hermes-Llama2-70b - A great model for roleplay but not the best at following complex instructions
18
+ - Xwin-LM/Xwin-LM-7B-V0.1 - Excellent at following instructions and quite creative with some drawbacks, has been my main model since release so I know it quite well.
19
+ - Doctor-Shotgun/mythospice-70b - The wildcard of the three. I was looking for creative model aimed at NSFW and came across this while digging through hf. I had never heard of it and apparently nobody had bothered to publish a quantized version before. So I downloaded it and did it myself. Turned out to be more or less what I was looking for so I used it here.
20
+
21
+
22
+ The merging process was almost entirely lifted from Undi95's approach in Undi95/MXLewdMini-L2-13B. I picked three of my favourite models that would compliment each other and adjusted the ratios at by preference.
23
+
24
+ To be concrete the components are:
25
+
26
+ Component 1: Merge of Mythospice x Xwin with SLERP gradient [0.25, 0.3, 0.5]
27
+ Component 2: Merge of Xwin x Hermes with SLERP gradient [0.4, 0.3, 0.25]
28
+
29
+ Finally, both Component 1 and Component 2 were merged SLERP weight 0.5
30
+
31
+ ## Benefits
32
+
33
+ I tested it for a day. It seems to retain the instruction-following capabilities of Xwin while appearing to have adapted a lot of the creativity of the other two models.
34
+ Is it better? Feels like it to me, subjetively. Is it truly better? I don't know, test it yourself.
35
+
36
+ ## Prompt Format:
37
+ Vicuna
38
+ USER: [prompt]
39
+ ASSISTANT:
40
+
41
+
42
+ ## NSFW and other content
43
+ Due to the nature of many of the models that make up this merge, it can and will produce inappropriate content when prompted to do so. A jailbreak is not required. If you prompt it directly to commit a hatecrime with 0 additional context, there is a chance that it will refuse to do so for the first regeneration or two. This should never happen in slightly more complex prompts or while it's acting out a character.
44
+ So be careful or not.
45
+