Adding Evaluation Results
#2
by
T145
- opened
README.md
CHANGED
@@ -322,3 +322,17 @@ Summarized results can be found [here](https://huggingface.co/datasets/open-llm-
|
|
322 |
|GPQA (0-shot) | 7.05|
|
323 |
|MuSR (0-shot) | 9.59|
|
324 |
|MMLU-PRO (5-shot) | 31.94|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
322 |
|GPQA (0-shot) | 7.05|
|
323 |
|MuSR (0-shot) | 9.59|
|
324 |
|MMLU-PRO (5-shot) | 31.94|
|
325 |
+
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
326 |
+
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/T145__ZEUS-8B-V2-abliterated-details)!
|
327 |
+
Summarized results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/contents/viewer/default/train?q=T145%2FZEUS-8B-V2-abliterated&sort[column]=Average%20%E2%AC%86%EF%B8%8F&sort[direction]=desc)!
|
328 |
+
|
329 |
+
| Metric |Value (%)|
|
330 |
+
|-------------------|--------:|
|
331 |
+
|**Average** | 29.71|
|
332 |
+
|IFEval (0-Shot) | 78.95|
|
333 |
+
|BBH (3-Shot) | 30.98|
|
334 |
+
|MATH Lvl 5 (4-Shot)| 20.62|
|
335 |
+
|GPQA (0-shot) | 8.39|
|
336 |
+
|MuSR (0-shot) | 7.92|
|
337 |
+
|MMLU-PRO (5-shot) | 31.39|
|
338 |
+
|