anakin87
/

Llama-3-8b-ita-ties

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

anakin87 commited on May 24

Commit

1e83c67

•

1 Parent(s): c4b98ef

Update README.md

Files changed (1) hide show

README.md +13 -1

README.md CHANGED Viewed

@@ -11,10 +11,22 @@ license: llama3
 language:
 - it
 ---
-# merge
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details
 ### Merge Method

 language:
 - it
 ---
+# Llama-3-8b-ita-ties
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+I tried to merge two of the best Italian LLMs using Mergekit. The results are acceptable, but I could not improve on the best existing model.
+## Evaluation
+For a detailed comparison of model performance, check out the [Leaderboard for Italian Language Models](https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard).
+Here's a breakdown of the performance metrics:
+| Metric                      | hellaswag_it acc_norm | arc_it acc_norm | m_mmlu_it 5-shot acc | Average |
+|:----------------------------|:----------------------|:----------------|:---------------------|:--------|
+| **Accuracy Normalized**     | 0.6621               | 0.5535        | 0.5749              | 0.5968  |
 ## Merge Details
 ### Merge Method