metadata
license: other
library_name: transformers
tags:
- generated_from_trainer
base_model: Qwen/Qwen2.5-3B
license_name: qwen-research
license_link: https://huggingface.co/Qwen/Qwen2.5-3B-Instruct/blob/main/LICENSE
model-index:
- name: outputs/gelato-3b
results: []
Prompt Format: ChatML
This is an experimental which was heavily optimized for reasoning tasks and not meant for production-use.
GGUFs: https://huggingface.co/mradermacher/raspberry-3B-GGUF
Open LLM Leaderboard Evaluation Results
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 15.40 |
IFEval (0-Shot) | 31.54 |
BBH (3-Shot) | 19.53 |
MATH Lvl 5 (4-Shot) | 7.63 |
GPQA (0-shot) | 3.69 |
MuSR (0-shot) | 9.41 |
MMLU-PRO (5-shot) | 20.60 |