pszemraj leaderboard-pr-bot commited on
Commit
3f0dcc1
1 Parent(s): 8143e34

Adding Evaluation Results (#1)

Browse files

- Adding Evaluation Results (05379a645276ded1dfe8d49aa0e3f68005a20315)


Co-authored-by: Open LLM Leaderboard PR Bot <leaderboard-pr-bot@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +20 -7
README.md CHANGED
@@ -1,17 +1,17 @@
1
  ---
 
 
2
  license: llama3
3
- base_model: meta-llama/Meta-Llama-3-8B
4
  tags:
5
  - axolotl
6
  - generated_from_trainer
7
- model-index:
8
- - name: Meta-Llama-3-8Bee
9
- results: []
10
  datasets:
11
  - BEE-spoke-data/bees-internal
12
- language:
13
- - en
14
  pipeline_tag: text-generation
 
 
 
15
  ---
16
 
17
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -135,4 +135,17 @@ The following hyperparameters were used during training:
135
  - Transformers 4.40.0.dev0
136
  - Pytorch 2.3.0+cu118
137
  - Datasets 2.15.0
138
- - Tokenizers 0.15.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - en
4
  license: llama3
 
5
  tags:
6
  - axolotl
7
  - generated_from_trainer
8
+ base_model: meta-llama/Meta-Llama-3-8B
 
 
9
  datasets:
10
  - BEE-spoke-data/bees-internal
 
 
11
  pipeline_tag: text-generation
12
+ model-index:
13
+ - name: Meta-Llama-3-8Bee
14
+ results: []
15
  ---
16
 
17
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
135
  - Transformers 4.40.0.dev0
136
  - Pytorch 2.3.0+cu118
137
  - Datasets 2.15.0
138
+ - Tokenizers 0.15.0
139
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
140
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_BEE-spoke-data__Meta-Llama-3-8Bee)
141
+
142
+ | Metric |Value|
143
+ |-------------------|----:|
144
+ |Avg. |14.49|
145
+ |IFEval (0-Shot) |19.51|
146
+ |BBH (3-Shot) |24.20|
147
+ |MATH Lvl 5 (4-Shot)| 3.85|
148
+ |GPQA (0-shot) | 8.50|
149
+ |MuSR (0-shot) | 6.24|
150
+ |MMLU-PRO (5-shot) |24.66|
151
+