arnocandel
commited on
Commit
·
237e779
1
Parent(s):
1575e79
commit files to HF hub
Browse files
README.md
CHANGED
@@ -129,6 +129,27 @@ GPTNeoXConfig {
|
|
129 |
|
130 |
```
|
131 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
132 |
## Disclaimer
|
133 |
|
134 |
Please read this disclaimer carefully before using the large language model provided in this repository. Your use of the model signifies your agreement to the following terms and conditions.
|
|
|
129 |
|
130 |
```
|
131 |
|
132 |
+
## Model Validation
|
133 |
+
|
134 |
+
Model validation results using [EleutherAI lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) out of the box:
|
135 |
+
|
136 |
+
|
137 |
+
| Task |Version| Metric |Value | |Stderr|
|
138 |
+
|-------------|------:|--------|-----:|---|-----:|
|
139 |
+
|boolq | 1|acc |0.6266|± |0.0085|
|
140 |
+
|arc_challenge| 0|acc |0.3225|± |0.0137|
|
141 |
+
| | |acc_norm|0.3396|± |0.0138|
|
142 |
+
|openbookqa | 0|acc |0.2660|± |0.0198|
|
143 |
+
| | |acc_norm|0.3660|± |0.0216|
|
144 |
+
|arc_easy | 0|acc |0.6776|± |0.0096|
|
145 |
+
| | |acc_norm|0.6195|± |0.0100|
|
146 |
+
|hellaswag | 0|acc |0.4822|± |0.0050|
|
147 |
+
| | |acc_norm|0.6465|± |0.0048|
|
148 |
+
|winogrande | 0|acc |0.6219|± |0.0136|
|
149 |
+
|piqa | 0|acc |0.7530|± |0.0101|
|
150 |
+
| | |acc_norm|0.7606|± |0.0100|
|
151 |
+
|
152 |
+
|
153 |
## Disclaimer
|
154 |
|
155 |
Please read this disclaimer carefully before using the large language model provided in this repository. Your use of the model signifies your agreement to the following terms and conditions.
|