cstr commited on
Commit
f880b55
1 Parent(s): 7a33f85

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -0
README.md CHANGED
@@ -20,8 +20,26 @@ language:
20
 
21
  llama3.1-8b-spaetzle-v90 is a progressive merge of merges.
22
 
 
 
23
  German EQ-Bench v2_de: 69.93 (171/171). English (v2): 77.88 (171/171)
24
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
25
  The merge tree involves the following models:
26
 
27
  - NousResearch/Hermes-3-Llama-3.1-8B
 
20
 
21
  llama3.1-8b-spaetzle-v90 is a progressive merge of merges.
22
 
23
+ # evaluation
24
+
25
  German EQ-Bench v2_de: 69.93 (171/171). English (v2): 77.88 (171/171)
26
 
27
+ [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
28
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_cstr__llama3.1-8b-spaetzle-v90)
29
+
30
+ | Metric |Value|
31
+ |-------------------|----:|
32
+ |Avg. |27.59|
33
+ |IFEval (0-Shot) |73.56|
34
+ |BBH (3-Shot) |32.76|
35
+ |MATH Lvl 5 (4-Shot)|13.37|
36
+ |GPQA (0-shot) | 4.36|
37
+ |MuSR (0-shot) |11.15|
38
+ |MMLU-PRO (5-shot) |30.34|
39
+
40
+
41
+ # merge tree
42
+
43
  The merge tree involves the following models:
44
 
45
  - NousResearch/Hermes-3-Llama-3.1-8B