gghfez commited on
Commit
a112a0b
1 Parent(s): 6174b40

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +43 -0
README.md ADDED
@@ -0,0 +1,43 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - openbmb/Eurux-8x22b-nca
4
+ - alpindale/WizardLM-2-8x22B
5
+ - fireworks-ai/mixtral-8x22b-instruct-oh
6
+ - migtissera/Tess-2.0-Mixtral-8x22B
7
+ library_name: transformers
8
+ tags:
9
+ - mergekit
10
+ - merge
11
+
12
+ ---
13
+ # WizardLM-2-8x22B-BigMerge
14
+
15
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
16
+
17
+ ## Merge Details
18
+ ### Merge Method
19
+
20
+ This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [alpindale/WizardLM-2-8x22B](https://huggingface.co/alpindale/WizardLM-2-8x22B) as a base.
21
+
22
+ ### Models Merged
23
+
24
+ The following models were included in the merge:
25
+ * [openbmb/Eurux-8x22b-nca](https://huggingface.co/openbmb/Eurux-8x22b-nca)
26
+ * [fireworks-ai/mixtral-8x22b-instruct-oh](https://huggingface.co/fireworks-ai/mixtral-8x22b-instruct-oh)
27
+ * [migtissera/Tess-2.0-Mixtral-8x22B](https://huggingface.co/migtissera/Tess-2.0-Mixtral-8x22B)
28
+
29
+ ### Configuration
30
+
31
+ The following YAML configuration was used to produce this model:
32
+
33
+ ```yaml
34
+ models:
35
+ - model: alpindale/WizardLM-2-8x22B
36
+ - model: openbmb/Eurux-8x22b-nca
37
+ - model: migtissera/Tess-2.0-Mixtral-8x22B
38
+ - model: fireworks-ai/mixtral-8x22b-instruct-oh
39
+ base_model: alpindale/WizardLM-2-8x22B
40
+ merge_method: model_stock
41
+ dtype: bfloat16
42
+
43
+ ```