Adding Evaluation Results

#2
by T145 - opened
Files changed (1) hide show
  1. README.md +14 -0
README.md CHANGED
@@ -322,3 +322,17 @@ Summarized results can be found [here](https://huggingface.co/datasets/open-llm-
322
  |GPQA (0-shot) | 7.05|
323
  |MuSR (0-shot) | 9.59|
324
  |MMLU-PRO (5-shot) | 31.94|
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
322
  |GPQA (0-shot) | 7.05|
323
  |MuSR (0-shot) | 9.59|
324
  |MMLU-PRO (5-shot) | 31.94|
325
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
326
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/T145__ZEUS-8B-V2-abliterated-details)!
327
+ Summarized results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/contents/viewer/default/train?q=T145%2FZEUS-8B-V2-abliterated&sort[column]=Average%20%E2%AC%86%EF%B8%8F&sort[direction]=desc)!
328
+
329
+ | Metric |Value (%)|
330
+ |-------------------|--------:|
331
+ |**Average** | 29.71|
332
+ |IFEval (0-Shot) | 78.95|
333
+ |BBH (3-Shot) | 30.98|
334
+ |MATH Lvl 5 (4-Shot)| 20.62|
335
+ |GPQA (0-shot) | 8.39|
336
+ |MuSR (0-shot) | 7.92|
337
+ |MMLU-PRO (5-shot) | 31.39|
338
+