Jellon
/

Lyra4-Gutenberg-12B-6bpw

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

6bpw exl2 quant of: https://huggingface.co/nbeerbower/Lyra4-Gutenberg-12B

Lyra4-Gutenberg-12B

Sao10K/MN-12B-Lyra-v4 finetuned on jondurbin/gutenberg-dpo-v0.1.

Method

ORPO Finetuned using an RTX 3090 + 4060 Ti for 3 epochs.

Fine-tune Llama 3 with ORPO

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	19.63
IFEval (0-Shot)	22.12
BBH (3-Shot)	34.24
MATH Lvl 5 (4-Shot)	11.71
GPQA (0-shot)	9.17
MuSR (0-shot)	11.97
MMLU-PRO (5-shot)	28.57

Downloads last month: 11

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Model tree for Jellon/Lyra4-Gutenberg-12B-6bpw

Base model

Sao10K/MN-12B-Lyra-v4

Quantized

(27)

this model

Dataset used to train Jellon/Lyra4-Gutenberg-12B-6bpw

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

22.120
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

34.240
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

11.710
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

9.170
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

11.970
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

28.570

View on Papers With Code