BramVanroy leaderboard-pr-bot commited on
Commit
8cefc58
1 Parent(s): d4552cd

Adding Evaluation Results (#2)

Browse files

- Adding Evaluation Results (5e8f0a52187f5e68b886e73f73cdbaa73ae4c1f2)


Co-authored-by: Open LLM Leaderboard PR Bot <leaderboard-pr-bot@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +19 -6
README.md CHANGED
@@ -1,6 +1,7 @@
1
  ---
 
 
2
  license: cc-by-nc-4.0
3
- base_model: BramVanroy/GEITje-7B-ultra-sft
4
  tags:
5
  - alignment-handbook
6
  - generated_from_trainer
@@ -8,15 +9,14 @@ tags:
8
  - dpo
9
  - geitje
10
  - conversational
 
11
  datasets:
12
  - BramVanroy/ultra_feedback_dutch
 
 
13
  model-index:
14
  - name: BramVanroy/GEITje-7B-ultra
15
  results: []
16
- language:
17
- - nl
18
- pipeline_tag: text-generation
19
- inference: false
20
  ---
21
 
22
  <p align="center" style="margin:0;padding:0">
@@ -185,4 +185,17 @@ The following hyperparameters were used during training:
185
  - Transformers 4.36.2
186
  - Pytorch 2.1.2+cu121
187
  - Datasets 2.14.6
188
- - Tokenizers 0.15.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - nl
4
  license: cc-by-nc-4.0
 
5
  tags:
6
  - alignment-handbook
7
  - generated_from_trainer
 
9
  - dpo
10
  - geitje
11
  - conversational
12
+ base_model: BramVanroy/GEITje-7B-ultra-sft
13
  datasets:
14
  - BramVanroy/ultra_feedback_dutch
15
+ pipeline_tag: text-generation
16
+ inference: false
17
  model-index:
18
  - name: BramVanroy/GEITje-7B-ultra
19
  results: []
 
 
 
 
20
  ---
21
 
22
  <p align="center" style="margin:0;padding:0">
 
185
  - Transformers 4.36.2
186
  - Pytorch 2.1.2+cu121
187
  - Datasets 2.14.6
188
+ - Tokenizers 0.15.0
189
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
190
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_BramVanroy__GEITje-7B-ultra)
191
+
192
+ | Metric |Value|
193
+ |-------------------|----:|
194
+ |Avg. |10.91|
195
+ |IFEval (0-Shot) |37.23|
196
+ |BBH (3-Shot) |12.88|
197
+ |MATH Lvl 5 (4-Shot)| 0.91|
198
+ |GPQA (0-shot) | 1.68|
199
+ |MuSR (0-shot) | 1.52|
200
+ |MMLU-PRO (5-shot) |11.24|
201
+