T145 commited on
Commit
1edf9d7
·
verified ·
1 Parent(s): fefa98a

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +179 -0
README.md ADDED
@@ -0,0 +1,179 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - arcee-ai/Llama-3.1-SuperNova-Lite
4
+ - unsloth/Llama-3.1-Storm-8B
5
+ - unsloth/Meta-Llama-3.1-8B-Instruct
6
+ - Orenguteng/Llama-3.1-8B-Lexi-Uncensored-V2
7
+ library_name: transformers
8
+ tags:
9
+ - mergekit
10
+ - merge
11
+ model-index:
12
+ - name: ZEUS-8B-V13
13
+ results:
14
+ - task:
15
+ type: text-generation
16
+ name: Text Generation
17
+ dataset:
18
+ name: IFEval (0-Shot)
19
+ type: wis-k/instruction-following-eval
20
+ split: train
21
+ args:
22
+ num_few_shot: 0
23
+ metrics:
24
+ - type: inst_level_strict_acc and prompt_level_strict_acc
25
+ value: 79.04
26
+ name: averaged accuracy
27
+ source:
28
+ url: >-
29
+ https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=T145%2FZEUS-8B-V13
30
+ name: Open LLM Leaderboard
31
+ - task:
32
+ type: text-generation
33
+ name: Text Generation
34
+ dataset:
35
+ name: BBH (3-Shot)
36
+ type: SaylorTwift/bbh
37
+ split: test
38
+ args:
39
+ num_few_shot: 3
40
+ metrics:
41
+ - type: acc_norm
42
+ value: 32.73
43
+ name: normalized accuracy
44
+ source:
45
+ url: >-
46
+ https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=T145%2FZEUS-8B-V13
47
+ name: Open LLM Leaderboard
48
+ - task:
49
+ type: text-generation
50
+ name: Text Generation
51
+ dataset:
52
+ name: MATH Lvl 5 (4-Shot)
53
+ type: lighteval/MATH-Hard
54
+ split: test
55
+ args:
56
+ num_few_shot: 4
57
+ metrics:
58
+ - type: exact_match
59
+ value: 20.62
60
+ name: exact match
61
+ source:
62
+ url: >-
63
+ https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=T145%2FZEUS-8B-V13
64
+ name: Open LLM Leaderboard
65
+ - task:
66
+ type: text-generation
67
+ name: Text Generation
68
+ dataset:
69
+ name: GPQA (0-shot)
70
+ type: Idavidrein/gpqa
71
+ split: train
72
+ args:
73
+ num_few_shot: 0
74
+ metrics:
75
+ - type: acc_norm
76
+ value: 9.84
77
+ name: acc_norm
78
+ source:
79
+ url: >-
80
+ https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=T145%2FZEUS-8B-V13
81
+ name: Open LLM Leaderboard
82
+ - task:
83
+ type: text-generation
84
+ name: Text Generation
85
+ dataset:
86
+ name: MuSR (0-shot)
87
+ type: TAUR-Lab/MuSR
88
+ args:
89
+ num_few_shot: 0
90
+ metrics:
91
+ - type: acc_norm
92
+ value: 8.39
93
+ name: acc_norm
94
+ source:
95
+ url: >-
96
+ https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=T145%2FZEUS-8B-V13
97
+ name: Open LLM Leaderboard
98
+ - task:
99
+ type: text-generation
100
+ name: Text Generation
101
+ dataset:
102
+ name: MMLU-PRO (5-shot)
103
+ type: TIGER-Lab/MMLU-Pro
104
+ config: main
105
+ split: test
106
+ args:
107
+ num_few_shot: 5
108
+ metrics:
109
+ - type: acc
110
+ value: 32.35
111
+ name: accuracy
112
+ source:
113
+ url: >-
114
+ https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=T145%2FZEUS-8B-V13
115
+ name: Open LLM Leaderboard
116
+ license: llama3.1
117
+ ---
118
+ # ZEUS 8B 🌩️ V13
119
+
120
+ 👍
121
+
122
+ ## Merge Details
123
+ ### Merge Method
124
+
125
+ This model was merged using the [DARE](https://arxiv.org/abs/2311.03099) [TIES](https://arxiv.org/abs/2306.01708) merge method using [unsloth/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/unsloth/Meta-Llama-3.1-8B-Instruct) as a base.
126
+
127
+ ### Models Merged
128
+
129
+ The following models were included in the merge:
130
+ * [arcee-ai/Llama-3.1-SuperNova-Lite](https://huggingface.co/arcee-ai/Llama-3.1-SuperNova-Lite)
131
+ * [unsloth/Llama-3.1-Storm-8B](https://huggingface.co/unsloth/Llama-3.1-Storm-8B)
132
+ * [Orenguteng/Llama-3.1-8B-Lexi-Uncensored-V2](https://huggingface.co/Orenguteng/Llama-3.1-8B-Lexi-Uncensored-V2)
133
+
134
+ ### Configuration
135
+
136
+ The following YAML configuration was used to produce this model:
137
+
138
+ ```yaml
139
+ base_model: unsloth/Meta-Llama-3.1-8B-Instruct
140
+ dtype: bfloat16
141
+ merge_method: dare_ties
142
+ parameters:
143
+ int8_mask: 1.0
144
+ normalize: 1.0
145
+ random_seed: 145.0
146
+ slices:
147
+ - sources:
148
+ - layer_range: [0, 32]
149
+ model: unsloth/Llama-3.1-Storm-8B
150
+ parameters:
151
+ density: 0.95
152
+ weight: 0.33
153
+ - layer_range: [0, 32]
154
+ model: arcee-ai/Llama-3.1-SuperNova-Lite
155
+ parameters:
156
+ density: 0.9
157
+ weight: 0.29
158
+ - layer_range: [0, 32]
159
+ model: Orenguteng/Llama-3.1-8B-Lexi-Uncensored-V2
160
+ parameters:
161
+ density: 0.92
162
+ weight: 0.38
163
+ - layer_range: [0, 32]
164
+ model: unsloth/Meta-Llama-3.1-8B-Instruct
165
+ tokenizer_source: union
166
+ ```
167
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
168
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/T145__ZEUS-8B-V13-details)!
169
+ Summarized results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/contents/viewer/default/train?q=T145%2FZEUS-8B-V13&sort[column]=Average%20%E2%AC%86%EF%B8%8F&sort[direction]=desc)!
170
+
171
+ | Metric |Value (%)|
172
+ |-------------------|--------:|
173
+ |**Average** | 30.50|
174
+ |IFEval (0-Shot) | 79.04|
175
+ |BBH (3-Shot) | 32.73|
176
+ |MATH Lvl 5 (4-Shot)| 20.62|
177
+ |GPQA (0-shot) | 9.84|
178
+ |MuSR (0-shot) | 8.39|
179
+ |MMLU-PRO (5-shot) | 32.35|