Update README.md
Browse files
README.md
CHANGED
@@ -11,10 +11,22 @@ license: llama3
|
|
11 |
language:
|
12 |
- it
|
13 |
---
|
14 |
-
#
|
15 |
|
16 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
17 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
## Merge Details
|
19 |
### Merge Method
|
20 |
|
|
|
11 |
language:
|
12 |
- it
|
13 |
---
|
14 |
+
# Llama-3-8b-ita-ties
|
15 |
|
16 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
17 |
|
18 |
+
I tried to merge two of the best Italian LLMs using Mergekit. The results are acceptable, but I could not improve on the best existing model.
|
19 |
+
|
20 |
+
## Evaluation
|
21 |
+
|
22 |
+
For a detailed comparison of model performance, check out the [Leaderboard for Italian Language Models](https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard).
|
23 |
+
|
24 |
+
Here's a breakdown of the performance metrics:
|
25 |
+
|
26 |
+
| Metric | hellaswag_it acc_norm | arc_it acc_norm | m_mmlu_it 5-shot acc | Average |
|
27 |
+
|:----------------------------|:----------------------|:----------------|:---------------------|:--------|
|
28 |
+
| **Accuracy Normalized** | 0.6621 | 0.5535 | 0.5749 | 0.5968 |
|
29 |
+
|
30 |
## Merge Details
|
31 |
### Merge Method
|
32 |
|