Triangle104
commited on
Commit
•
cdde137
1
Parent(s):
2134b6c
Update README.md
Browse files
README.md
CHANGED
@@ -23,6 +23,58 @@ tags:
|
|
23 |
This model was converted to GGUF format from [`FallenMerick/MN-Chunky-Lotus-12B`](https://huggingface.co/FallenMerick/MN-Chunky-Lotus-12B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
24 |
Refer to the [original model card](https://huggingface.co/FallenMerick/MN-Chunky-Lotus-12B) for more details on the model.
|
25 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
26 |
## Use with llama.cpp
|
27 |
Install llama.cpp through brew (works on Mac and Linux)
|
28 |
|
|
|
23 |
This model was converted to GGUF format from [`FallenMerick/MN-Chunky-Lotus-12B`](https://huggingface.co/FallenMerick/MN-Chunky-Lotus-12B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
24 |
Refer to the [original model card](https://huggingface.co/FallenMerick/MN-Chunky-Lotus-12B) for more details on the model.
|
25 |
|
26 |
+
---
|
27 |
+
Model details:
|
28 |
+
-
|
29 |
+
I had originally planned to use this model for future/further merges, but decided to go ahead and release it since it scored rather high on my local EQ Bench testing (79.58 w/ 100% parsed @ 8-bit).
|
30 |
+
Bear in mind that most models tend to score a bit higher on my own local tests as compared to their posted scores. Still, its the highest score I've personally seen from all the models I've tested.
|
31 |
+
Its a decent model, with great emotional intelligence and acceptable adherence to various character personalities. It does a good job at roleplaying despite being a bit bland at times.
|
32 |
+
|
33 |
+
Overall, I like the way it writes, but it has a few formatting issues that show up from time to time, and it has an uncommon tendency to paste walls of character feelings/intentions at the end of some outputs without any prompting. This is something I hope to correct with future iterations.
|
34 |
+
|
35 |
+
This is a merge of pre-trained language models created using mergekit.
|
36 |
+
|
37 |
+
Merge Method
|
38 |
+
-
|
39 |
+
This model was merged using the TIES merge method.
|
40 |
+
|
41 |
+
Models Merged
|
42 |
+
-
|
43 |
+
The following models were included in the merge:
|
44 |
+
|
45 |
+
Epiculous/Violet_Twilight-v0.2
|
46 |
+
nbeerbower/mistral-nemo-gutenberg-12B-v4
|
47 |
+
flammenai/Mahou-1.5-mistral-nemo-12B
|
48 |
+
|
49 |
+
Configuration
|
50 |
+
-
|
51 |
+
The following YAML configuration was used to produce this model:
|
52 |
+
|
53 |
+
models:
|
54 |
+
- model: Epiculous/Violet_Twilight-v0.2
|
55 |
+
parameters:
|
56 |
+
weight: 1.0
|
57 |
+
density: 1.0
|
58 |
+
- model: nbeerbower/mistral-nemo-gutenberg-12B-v4
|
59 |
+
parameters:
|
60 |
+
weight: 1.0
|
61 |
+
density: 0.54
|
62 |
+
- model: flammenai/Mahou-1.5-mistral-nemo-12B
|
63 |
+
parameters:
|
64 |
+
weight: 1.0
|
65 |
+
density: 0.26
|
66 |
+
merge_method: ties
|
67 |
+
base_model: TheDrummer/Rocinante-12B-v1.1
|
68 |
+
parameters:
|
69 |
+
normalize: true
|
70 |
+
dtype: bfloat16
|
71 |
+
|
72 |
+
The idea behind this recipe was to take the long-form writing capabilities of Gutenberg, curtail it a bit with the very short output formatting of Mahou, and use Violet Twilight as an extremely solid roleplaying foundation underneath.
|
73 |
+
Rocinante is used as the base model in this merge in order to really target the delta weights from Gutenberg, since those seemed to have the highest impact on the resulting EQ of the model.
|
74 |
+
|
75 |
+
Special shoutout to @matchaaaaa for helping with testing, and for all the great model recommendations. Also, for just being an all around great person who's really inspired and motivated me to continue merging and working on models.
|
76 |
+
|
77 |
+
---
|
78 |
## Use with llama.cpp
|
79 |
Install llama.cpp through brew (works on Mac and Linux)
|
80 |
|