Update README.md
Browse filesITA-BENCH link update
README.md
CHANGED
@@ -168,7 +168,7 @@ For more details, please check [our tech report](https://nlp.uniroma1.it/minerva
|
|
168 |
|
169 |
## Model Evaluation
|
170 |
|
171 |
-
For Minerva's evaluation process, we utilized ITA-Bench, a new evaluation suite to test the capabilities of Italian-speaking models.
|
172 |
ITA-Bench is a collection of 18 benchmarks that assess the performance of language models on various tasks, including scientific knowledge,
|
173 |
commonsense reasoning, and mathematical problem-solving.
|
174 |
|
|
|
168 |
|
169 |
## Model Evaluation
|
170 |
|
171 |
+
For Minerva's evaluation process, we utilized [ITA-Bench](https://huggingface.co/collections/sapienzanlp/ita-bench-italian-benchmarks-for-llms-66337ca59e6df7d7d4933896), a new evaluation suite to test the capabilities of Italian-speaking models.
|
172 |
ITA-Bench is a collection of 18 benchmarks that assess the performance of language models on various tasks, including scientific knowledge,
|
173 |
commonsense reasoning, and mathematical problem-solving.
|
174 |
|