Tasks | Version | Filter | n-shot | Metric | Value | Stderr | ||
---|---|---|---|---|---|---|---|---|
arc_challenge | 1 | none | 0 | acc | ↑ | 0.2176 | ± | 0.0121 |
none | 0 | acc_norm | ↑ | 0.2628 | ± | 0.0129 | ||
arc_easy | 1 | none | 0 | acc | ↑ | 0.2538 | ± | 0.0089 |
none | 0 | acc_norm | ↑ | 0.2475 | ± | 0.0089 | ||
boolq | 2 | none | 0 | acc | ↑ | 0.4239 | ± | 0.0086 |
hellaswag | 1 | none | 0 | acc | ↑ | 0.2566 | ± | 0.0044 |
none | 0 | acc_norm | ↑ | 0.2606 | ± | 0.0044 | ||
openbookqa | 1 | none | 0 | acc | ↑ | 0.1340 | ± | 0.0152 |
none | 0 | acc_norm | ↑ | 0.2660 | ± | 0.0198 | ||
piqa | 1 | none | 0 | acc | ↑ | 0.5381 | ± | 0.0116 |
none | 0 | acc_norm | ↑ | 0.5190 | ± | 0.0117 | ||
winogrande | 1 | none | 0 | acc | ↑ | 0.4704 | ± | 0.0140 |
- Downloads last month
- 13
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.