cognitivecomputations
/

dolphin-2.9.1-llama-3-70b

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Adding Evaluation Results

#6

by leaderboard-pr-bot - opened Oct 7, 2024

base: refs/heads/main

←

from: refs/pr/6

Discussion Files changed

Files changed (1) hide show

README.md +18 -4

README.md CHANGED Viewed

@@ -1,12 +1,9 @@
 ---
 license: llama3
-base_model: meta-llama/Meta-Llama-3-70B
 tags:
 - generated_from_trainer
 - axolotl
-model-index:
-- name: out
-  results: []
 datasets:
 - cognitivecomputations/Dolphin-2.9
 - teknium/OpenHermes-2.5
@@ -16,6 +13,9 @@ datasets:
 - microsoft/orca-math-word-problems-200k
 - Locutusque/function-calling-chatml
 - internlm/Agent-FLAN
 ---
 # Dolphin 2.9.1 Llama 3 70b 🐬
@@ -510,3 +510,17 @@ The following hyperparameters were used during training:
 - Pytorch 2.2.2+cu121
 - Datasets 2.19.1
 - Tokenizers 0.19.1

 ---
 license: llama3
 tags:
 - generated_from_trainer
 - axolotl
+base_model: meta-llama/Meta-Llama-3-70B
 datasets:
 - cognitivecomputations/Dolphin-2.9
 - teknium/OpenHermes-2.5
 - microsoft/orca-math-word-problems-200k
 - Locutusque/function-calling-chatml
 - internlm/Agent-FLAN
+model-index:
+- name: out
+  results: []
 ---
 # Dolphin 2.9.1 Llama 3 70b 🐬
 - Pytorch 2.2.2+cu121
 - Datasets 2.19.1
 - Tokenizers 0.19.1
+# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
+Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_cognitivecomputations__dolphin-2.9.1-llama-3-70b)
+|      Metric       |Value|
+|-------------------|----:|
+|Avg.               |23.41|
+|IFEval (0-Shot)    |37.60|
+|BBH (3-Shot)       |31.10|
+|MATH Lvl 5 (4-Shot)| 5.44|
+|GPQA (0-shot)      | 7.83|
+|MuSR (0-shot)      |23.70|
+|MMLU-PRO (5-shot)  |34.78|