Update README.md
Browse files
README.md
CHANGED
@@ -1,14 +1,18 @@
|
|
1 |
---
|
2 |
-
base_model:
|
|
|
|
|
3 |
tags:
|
4 |
- mergekit
|
5 |
- merge
|
6 |
-
|
7 |
---
|
8 |
# merge
|
9 |
|
10 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
11 |
|
|
|
|
|
12 |
## Merge Details
|
13 |
### Merge Method
|
14 |
|
@@ -17,8 +21,8 @@ This model was merged using the SLERP merge method.
|
|
17 |
### Models Merged
|
18 |
|
19 |
The following models were included in the merge:
|
20 |
-
* /
|
21 |
-
*
|
22 |
|
23 |
### Configuration
|
24 |
|
@@ -27,10 +31,10 @@ The following YAML configuration was used to produce this model:
|
|
27 |
```yaml
|
28 |
|
29 |
models:
|
30 |
-
- model:
|
31 |
-
- model:
|
32 |
merge_method: slerp
|
33 |
-
base_model:
|
34 |
parameters:
|
35 |
t:
|
36 |
- value: [0, 0.1, 0.2, 0.4, 0.5, 0.6, 0.5, 0.4, 0.2, 0.1, 0]
|
@@ -38,4 +42,4 @@ parameters:
|
|
38 |
dtype: float16
|
39 |
|
40 |
|
41 |
-
```
|
|
|
1 |
---
|
2 |
+
base_model:
|
3 |
+
- NeverSleep/CausalLM-RP-34B
|
4 |
+
- anthracite-org/magnum-v3-34b
|
5 |
tags:
|
6 |
- mergekit
|
7 |
- merge
|
8 |
+
license: apache-2.0
|
9 |
---
|
10 |
# merge
|
11 |
|
12 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
13 |
|
14 |
+
First time trying to merge this large of a model, feedback on result and/or method is always appreciated.
|
15 |
+
|
16 |
## Merge Details
|
17 |
### Merge Method
|
18 |
|
|
|
21 |
### Models Merged
|
22 |
|
23 |
The following models were included in the merge:
|
24 |
+
* NeverSleep/CausalLM-RP-34B
|
25 |
+
* anthracite-org/magnum-v3-34b
|
26 |
|
27 |
### Configuration
|
28 |
|
|
|
31 |
```yaml
|
32 |
|
33 |
models:
|
34 |
+
- model: ../models/anthracite-org_magnum-v3-34b
|
35 |
+
- model: ../models/NeverSleep_CausalLM-RP-34B
|
36 |
merge_method: slerp
|
37 |
+
base_model: ../models/NeverSleep_CausalLM-RP-34B
|
38 |
parameters:
|
39 |
t:
|
40 |
- value: [0, 0.1, 0.2, 0.4, 0.5, 0.6, 0.5, 0.4, 0.2, 0.1, 0]
|
|
|
42 |
dtype: float16
|
43 |
|
44 |
|
45 |
+
```
|