cstr
/

llama3-8b-spaetzle-v13

Text Generation

Azure99/blossom-v5-llama3-8b

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

cstr commited on May 7

Commit

c29116a

•

1 Parent(s): c8736e2

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -15,10 +15,15 @@ llama3-8b-spaetzle-v13 is a merge of the following models:
 * [Azure99/blossom-v5-llama3-8b](https://huggingface.co/Azure99/blossom-v5-llama3-8b)
 * [VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct](https://huggingface.co/VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct)
 It should work not too bad for German or English, e.g. it achieves 64.14(.10) on EQ Bench v2_de with 170(171)/171 per [q4km GGUF](https://huggingface.co/cstr/llama3-8b-spaetzle-v13-GGUF) (bpe fixed).
 And for English EQ-Bench Score (v2): 75.59, Parseable: 171.
-No change in llama3 prompt format template.
 ## Sample output

 * [Azure99/blossom-v5-llama3-8b](https://huggingface.co/Azure99/blossom-v5-llama3-8b)
 * [VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct](https://huggingface.co/VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct)
+No change in llama3 prompt format template.
+## Benchmarks
 It should work not too bad for German or English, e.g. it achieves 64.14(.10) on EQ Bench v2_de with 170(171)/171 per [q4km GGUF](https://huggingface.co/cstr/llama3-8b-spaetzle-v13-GGUF) (bpe fixed).
 And for English EQ-Bench Score (v2): 75.59, Parseable: 171.
+| Model                        | Average    | ARC    | HellaSwag | MMLU   | TruthfulQA | Winogrande | GSM8K  |
+|------------------------------|------------|--------|-----------|--------|------------|------------|--------|
+|  cstr/llama3-8b-spaetzle-v13 | 71.26      | 68.69  | 85.05     | 68.06  | 59.43      | 79.24      | 67.1   |
 ## Sample output