Update README.md
Browse files
README.md
CHANGED
@@ -10,10 +10,22 @@ license: llama3
|
|
10 |
language:
|
11 |
- it
|
12 |
---
|
13 |
-
#
|
14 |
|
15 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
16 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
17 |
## Merge Details
|
18 |
### Merge Method
|
19 |
|
|
|
10 |
language:
|
11 |
- it
|
12 |
---
|
13 |
+
# Llama-3-8b-ita-slerp
|
14 |
|
15 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
16 |
|
17 |
+
I tried to merge two of the best Italian LLMs using Mergekit. The results are acceptable, but I could not improve on the best existing model.
|
18 |
+
|
19 |
+
## Evaluation
|
20 |
+
|
21 |
+
For a detailed comparison of model performance, check out the [Leaderboard for Italian Language Models](https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard).
|
22 |
+
|
23 |
+
Here's a breakdown of the performance metrics:
|
24 |
+
|
25 |
+
| Metric | hellaswag_it acc_norm | arc_it acc_norm | m_mmlu_it 5-shot acc | Average |
|
26 |
+
|:----------------------------|:----------------------|:----------------|:---------------------|:--------|
|
27 |
+
| **Accuracy Normalized** | 0.6879 | 0.5714 | 0.5732 | 0.6109 |
|
28 |
+
|
29 |
## Merge Details
|
30 |
### Merge Method
|
31 |
|