Uploaded model

Developed by: Daemontatox
License: apache-2.0
Finetuned from model : Qwen/Qwen2-Math-7B-Instruct

This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here! Summarized results can be found here!

Metric	Value (%)
Average	22.49
IFEval (0-Shot)	37.84
BBH (3-Shot)	28.47
MATH Lvl 5 (4-Shot)	33.91
GPQA (0-shot)	7.38
MuSR (0-shot)	9.37
MMLU-PRO (5-shot)	17.96

Downloads last month: 24

Safetensors

Model size

7.62B params

Tensor type

FP16

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for Daemontatox/RA2.0

Base model

Qwen/Qwen2-Math-7B-Instruct

Finetuned

(9)

this model

Quantizations

2 models

Evaluation results

averaged accuracy on IFEval (0-Shot)
Open LLM Leaderboard

37.840
normalized accuracy on BBH (3-Shot)
test set Open LLM Leaderboard

28.470
exact match on MATH Lvl 5 (4-Shot)
test set Open LLM Leaderboard

33.910
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

7.380
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

9.370
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

17.960

View on Papers With Code