nbeerbower
/

Hermes2-Gutenberg2-Mistral-7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Hermes2-Gutenberg2-Mistral-7B

NousResearch/Hermes-2-Pro-Mistral-7B finetuned on jondurbin/gutenberg-dpo-v0.1 and nbeerbower/gutenberg2-dpo.

Method

ORPO tuned with 2x RTX 3090 for 3 epochs.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	19.35
IFEval (0-Shot)	37.21
BBH (3-Shot)	28.91
MATH Lvl 5 (4-Shot)	5.66
GPQA (0-shot)	5.26
MuSR (0-shot)	16.92
MMLU-PRO (5-shot)	22.14

Downloads last month: 30

Safetensors

Model size

7.24B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Model tree for nbeerbower/Hermes2-Gutenberg2-Mistral-7B

Base model

mistralai/Mistral-7B-v0.1

Finetuned

NousResearch/Hermes-2-Pro-Mistral-7B

Finetuned

(13)

this model

Finetunes

1 model

Quantizations

Datasets used to train nbeerbower/Hermes2-Gutenberg2-Mistral-7B

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

37.210
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

28.910
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

5.660
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

5.260
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

16.920
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

22.140

View on Papers With Code