BramVanroy
commited on
Commit
•
82f55b9
1
Parent(s):
f434cf0
Update README.md
Browse files
README.md
CHANGED
@@ -108,20 +108,3 @@ The following hyperparameters were used during training:
|
|
108 |
- Pytorch 2.1.2+cu121
|
109 |
- Datasets 2.18.0
|
110 |
- Tokenizers 0.15.2
|
111 |
-
|
112 |
-
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
113 |
-
|
114 |
-
Results for the English Open LLM Leaderboard. For results specific to Dutch, check out [ScandEval](https://scandeval.com/dutch-nlg/).
|
115 |
-
|
116 |
-
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_BramVanroy__fietje-2-chat)
|
117 |
-
|
118 |
-
| Metric |Value|
|
119 |
-
|-------------------|----:|
|
120 |
-
|Avg. |10.39|
|
121 |
-
|IFEval (0-Shot) |29.17|
|
122 |
-
|BBH (3-Shot) |17.72|
|
123 |
-
|MATH Lvl 5 (4-Shot)| 0.53|
|
124 |
-
|GPQA (0-shot) | 0.00|
|
125 |
-
|MuSR (0-shot) | 3.20|
|
126 |
-
|MMLU-PRO (5-shot) |11.72|
|
127 |
-
|
|
|
108 |
- Pytorch 2.1.2+cu121
|
109 |
- Datasets 2.18.0
|
110 |
- Tokenizers 0.15.2
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|