FinancialSupport commited on
Commit
f61de92
1 Parent(s): 860a6f7

Update README.md

Browse files

added evaluation :)

Files changed (1) hide show
  1. README.md +9 -0
README.md CHANGED
@@ -36,6 +36,15 @@ This is the model card of a 🤗 transformers model that has been pushed on the
36
  - **License:** cc-by-nc-sa-4.0
37
  - **Finetuned from model:**: [Minerva-3B-base-v1.0](https://huggingface.co/sapienzanlp/Minerva-3B-base-v1.0), developed by [Sapienza NLP](https://nlp.uniroma1.it) in collaboration with [Future Artificial Intelligence Research (FAIR)](https://fondazione-fair.it/) and [CINECA](https://www.cineca.it/)
38
 
 
 
 
 
 
 
 
 
 
39
 
40
  ## Uses
41
 
 
36
  - **License:** cc-by-nc-sa-4.0
37
  - **Finetuned from model:**: [Minerva-3B-base-v1.0](https://huggingface.co/sapienzanlp/Minerva-3B-base-v1.0), developed by [Sapienza NLP](https://nlp.uniroma1.it) in collaboration with [Future Artificial Intelligence Research (FAIR)](https://fondazione-fair.it/) and [CINECA](https://www.cineca.it/)
38
 
39
+ ## Evaluation
40
+
41
+ For a detailed comparison of model performance, check out the [Leaderboard for Italian Language Models](https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard).
42
+
43
+ Here's a breakdown of the performance metrics:
44
+ | Metric | hellaswag_it acc_norm | arc_it acc_norm | m_mmlu_it 5-shot acc | Average |
45
+ |:----------------------------|:----------------------|:----------------|:---------------------|:--------|
46
+ | **Accuracy Normalized** | 0.5187 | 0.3045 | 0.2612 | 0.361 |
47
+
48
 
49
  ## Uses
50