csabakecskemeti commited on
Commit
f816168
·
verified ·
1 Parent(s): c825d3c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +67 -0
README.md CHANGED
@@ -2,6 +2,73 @@
2
  license: llama3.1
3
  base_model:
4
  - DevQuasar/HermesNova-Llama-3.1-8B
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5
  ---
6
 
7
  ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64e6d37e02dee9bcb9d9fa18/QQTtLQDPLHAU_Zg5doADx.jpeg)
 
2
  license: llama3.1
3
  base_model:
4
  - DevQuasar/HermesNova-Llama-3.1-8B
5
+ model-index:
6
+ - name: HermesNova-Llama-3.1-8B
7
+ results:
8
+ - task:
9
+ type: text-generation
10
+ dataset:
11
+ type: lm-evaluation-harness
12
+ name: bbh
13
+ metrics:
14
+ - name: acc_norm
15
+ type: acc_norm
16
+ value: 0.5418
17
+ verified: false
18
+ - task:
19
+ type: text-generation
20
+ dataset:
21
+ type: lm-evaluation-harness
22
+ name: gpqa
23
+ metrics:
24
+ - name: acc_norm
25
+ type: acc_norm
26
+ value: 0.3365
27
+ verified: false
28
+ - task:
29
+ type: text-generation
30
+ dataset:
31
+ type: lm-evaluation-harness
32
+ name: math
33
+ metrics:
34
+ - name: exact_match
35
+ type: exact_match
36
+ value: 0.1148
37
+ verified: false
38
+ - task:
39
+ type: text-generation
40
+ dataset:
41
+ type: lm-evaluation-harness
42
+ name: mmlu
43
+ metrics:
44
+ - name: acc_norm
45
+ type: acc_norm
46
+ value: 0.3729
47
+ verified: false
48
+ - task:
49
+ type: text-generation
50
+ dataset:
51
+ type: lm-evaluation-harness
52
+ name: musr
53
+ metrics:
54
+ - name: acc_norm
55
+ type: acc_norm
56
+ value: 0.4330
57
+ verified: false
58
+ - task:
59
+ type: text-generation
60
+ dataset:
61
+ type: lm-evaluation-harness
62
+ name: hellaswag
63
+ metrics:
64
+ - name: acc
65
+ type: acc
66
+ value: 0.6306512646883091
67
+ verified: false
68
+ - name: acc_norm
69
+ type: acc_norm
70
+ value: 0.818263294164509
71
+ verified: false
72
  ---
73
 
74
  ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64e6d37e02dee9bcb9d9fa18/QQTtLQDPLHAU_Zg5doADx.jpeg)