BramVanroy
commited on
Commit
•
2e02b83
1
Parent(s):
cb7e24b
Update README.md
Browse files
README.md
CHANGED
@@ -110,20 +110,3 @@ The following hyperparameters were used during training:
|
|
110 |
- Pytorch 2.1.2+cu121
|
111 |
- Datasets 2.18.0
|
112 |
- Tokenizers 0.15.2
|
113 |
-
|
114 |
-
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
115 |
-
|
116 |
-
Results for the English Open LLM Leaderboard. For results specific to Dutch, check out [ScandEval](https://scandeval.com/dutch-nlg/).
|
117 |
-
|
118 |
-
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_BramVanroy__fietje-2-instruct)
|
119 |
-
|
120 |
-
| Metric |Value|
|
121 |
-
|-------------------|----:|
|
122 |
-
|Avg. |10.20|
|
123 |
-
|IFEval (0-Shot) |27.90|
|
124 |
-
|BBH (3-Shot) |17.57|
|
125 |
-
|MATH Lvl 5 (4-Shot)| 0.53|
|
126 |
-
|GPQA (0-shot) | 0.00|
|
127 |
-
|MuSR (0-shot) | 2.91|
|
128 |
-
|MMLU-PRO (5-shot) |12.26|
|
129 |
-
|
|
|
110 |
- Pytorch 2.1.2+cu121
|
111 |
- Datasets 2.18.0
|
112 |
- Tokenizers 0.15.2
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|