Adding Evaluation Results
#2
by
leaderboard-pr-bot
- opened
README.md
CHANGED
@@ -156,3 +156,17 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
|
|
156 |
|MuSR (0-shot) |15.21|
|
157 |
|MMLU-PRO (5-shot) |29.62|
|
158 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
156 |
|MuSR (0-shot) |15.21|
|
157 |
|MMLU-PRO (5-shot) |29.62|
|
158 |
|
159 |
+
|
160 |
+
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
161 |
+
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/bamec66557__VICIOUS_MESH-12B-GAMMA-details)
|
162 |
+
|
163 |
+
| Metric |Value|
|
164 |
+
|-------------------|----:|
|
165 |
+
|Avg. |26.77|
|
166 |
+
|IFEval (0-Shot) |63.62|
|
167 |
+
|BBH (3-Shot) |31.49|
|
168 |
+
|MATH Lvl 5 (4-Shot)|12.16|
|
169 |
+
|GPQA (0-shot) | 8.50|
|
170 |
+
|MuSR (0-shot) |15.21|
|
171 |
+
|MMLU-PRO (5-shot) |29.62|
|
172 |
+
|