Text Generation
Transformers
Safetensors
mistral
mergekit
Merge
Not-For-All-Audiences
conversational
Eval Results
text-generation-inference
Inference Endpoints
bamec66557 commited on
Commit
4af6dbc
1 Parent(s): fc9ae1f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +51 -47
README.md CHANGED
@@ -1,47 +1,51 @@
1
- ---
2
- base_model: []
3
- library_name: transformers
4
- tags:
5
- - mergekit
6
- - merge
7
-
8
- ---
9
- # VICIOUS_MESH-12B-OMEGA
10
-
11
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
12
-
13
- ## Merge Details
14
- ### Merge Method
15
-
16
- This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using D:\VICIOUS_MESH-12B-NEMO as a base.
17
-
18
- ### Models Merged
19
-
20
- The following models were included in the merge:
21
- * D:\VICIOUS_MESH-12B-BETA
22
-
23
- ### Configuration
24
-
25
- The following YAML configuration was used to produce this model:
26
-
27
- ```yaml
28
- models:
29
- - model: D:\VICIOUS_MESH-12B-NEMO
30
- parameters:
31
- weight: 1
32
- density: 1
33
- - model: D:\VICIOUS_MESH-12B-BETA
34
- parameters:
35
- weight: 0.7
36
- density: 1
37
- merge_method: ties
38
- base_model: D:\VICIOUS_MESH-12B-NEMO
39
- parameters:
40
- weight: 1
41
- density: 1
42
- normalize: true
43
- int8_mask: true
44
- tokenizer_source: D:\VICIOUS_MESH-12B-NEMO
45
- dtype: bfloat16
46
-
47
- ```
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - bamec66557/VICIOUS_MESH-12B-NEMO
4
+ library_name: transformers
5
+ tags:
6
+ - mergekit
7
+ - merge
8
+ - not-for-all-audiences
9
+ license: apache-2.0
10
+ language:
11
+ - en
12
+ ---
13
+ # VICIOUS_MESH-12B-OMEGA
14
+
15
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
16
+
17
+ ## Merge Details
18
+ ### Merge Method
19
+
20
+ This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using D:\VICIOUS_MESH-12B-NEMO as a base.
21
+
22
+ ### Models Merged
23
+
24
+ The following models were included in the merge:
25
+ * D:\VICIOUS_MESH-12B-BETA
26
+
27
+ ### Configuration
28
+
29
+ The following YAML configuration was used to produce this model:
30
+
31
+ ```yaml
32
+ models:
33
+ - model: D:\VICIOUS_MESH-12B-NEMO
34
+ parameters:
35
+ weight: 1
36
+ density: 1
37
+ - model: D:\VICIOUS_MESH-12B-BETA
38
+ parameters:
39
+ weight: 0.7
40
+ density: 1
41
+ merge_method: ties
42
+ base_model: D:\VICIOUS_MESH-12B-NEMO
43
+ parameters:
44
+ weight: 1
45
+ density: 1
46
+ normalize: true
47
+ int8_mask: true
48
+ tokenizer_source: D:\VICIOUS_MESH-12B-NEMO
49
+ dtype: bfloat16
50
+
51
+ ```