
This is yet another mergekit abomination.
This is probably more of a "dense" MoE than a sparse one.
Unfortunately, most of the testing I have tried with this model shows it works well for a couple sentences, then it starts spouting gibberish. Don't waste your bandwidth.
Open LLM Leaderboard Evaluation Results
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 73.77 |
AI2 Reasoning Challenge (25-Shot) | 72.61 |
HellaSwag (10-Shot) | 89.57 |
MMLU (5-Shot) | 71.67 |
TruthfulQA (0-shot) | 66.49 |
Winogrande (5-shot) | 84.37 |
GSM8k (5-shot) | 57.92 |
- Downloads last month
- 1,711
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.