Jsoo commited on
Commit
3e33fda
1 Parent(s): b4a3a9d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +49 -49
README.md CHANGED
@@ -1,49 +1,49 @@
1
- ---
2
- base_model:
3
- - upstage/SOLAR-10.7B-Instruct-v1.0
4
- - Jsoo/solar-ko-common-merged
5
- library_name: transformers
6
- tags:
7
- - mergekit
8
- - merge
9
-
10
- ---
11
- # merged
12
-
13
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
14
-
15
- ## Merge Details
16
- ### Merge Method
17
-
18
- This model was merged using the SLERP merge method.
19
-
20
- ### Models Merged
21
-
22
- The following models were included in the merge:
23
- * [upstage/SOLAR-10.7B-Instruct-v1.0](https://huggingface.co/upstage/SOLAR-10.7B-Instruct-v1.0)
24
- * [Jsoo/solar-ko-common-merged](https://huggingface.co/Jsoo/solar-ko-common-merged)
25
-
26
- ### Configuration
27
-
28
- The following YAML configuration was used to produce this model:
29
-
30
- ```yaml
31
-
32
- slices:
33
- - sources:
34
- - model: upstage/SOLAR-10.7B-Instruct-v1.0
35
- layer_range: [0, 47]
36
- - model: Jsoo/solar-ko-common-merged
37
- layer_range: [0, 47]
38
- merge_method: slerp
39
- base_model: upstage/SOLAR-10.7B-Instruct-v1.0
40
- parameters:
41
- t:
42
- - filter: self_attn
43
- value: [0, 0.5, 0.3, 0.7, 1]
44
- - filter: mlp
45
- value: [0.28, 1.0, 1.0, 0.28]
46
- - value: 0.5
47
- dtype: bfloat16
48
-
49
- ```
 
1
+ ---
2
+ base_model:
3
+ - upstage/SOLAR-10.7B-Instruct-v1.0
4
+ - Jsoo/solar-ko-common-merged
5
+ library_name: transformers
6
+ tags:
7
+ - mergekit
8
+ - merge
9
+ license: cc-by-nc-4.0
10
+ ---
11
+ # merged
12
+
13
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
14
+
15
+ ## Merge Details
16
+ ### Merge Method
17
+
18
+ This model was merged using the SLERP merge method.
19
+
20
+ ### Models Merged
21
+
22
+ The following models were included in the merge:
23
+ * [upstage/SOLAR-10.7B-Instruct-v1.0](https://huggingface.co/upstage/SOLAR-10.7B-Instruct-v1.0)
24
+ * [Jsoo/solar-ko-common-merged](https://huggingface.co/Jsoo/solar-ko-common-merged)
25
+
26
+ ### Configuration
27
+
28
+ The following YAML configuration was used to produce this model:
29
+
30
+ ```yaml
31
+
32
+ slices:
33
+ - sources:
34
+ - model: upstage/SOLAR-10.7B-Instruct-v1.0
35
+ layer_range: [0, 47]
36
+ - model: Jsoo/solar-ko-common-merged
37
+ layer_range: [0, 47]
38
+ merge_method: slerp
39
+ base_model: upstage/SOLAR-10.7B-Instruct-v1.0
40
+ parameters:
41
+ t:
42
+ - filter: self_attn
43
+ value: [0, 0.5, 0.3, 0.7, 1]
44
+ - filter: mlp
45
+ value: [0.28, 1.0, 1.0, 0.28]
46
+ - value: 0.5
47
+ dtype: bfloat16
48
+
49
+ ```