jarodrigues commited on
Commit
3518217
·
verified ·
1 Parent(s): 098f75b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -130,8 +130,8 @@ For testing, we reserved the translated datasets MRPC (similarity) and RTE (infe
130
  | Model | MRPC (F1) | RTE (F1) | COPA (F1) |
131
  |--------------------------|----------------|----------------|-----------|
132
  | **Gervásio 7B PTBR** | **0.7822** | **0.8321** | 0.2134 |
133
- | **LLaMA-2 (EN)** | 0.0369 | 0.0516 | 0.4867 |
134
- | **LLaMA-2 Chat (EN)** | 0.5432 | 0.3807 | **0.5493**|
135
  <br>
136
 
137
  For further testing our decoder, in addition to the testing data described above, we also reused some of the datasets that had been resorted for American Portuguese to test the state-of-the-art Sabiá model and that were originally developed with materials from Portuguese: ASSIN2 RTE (entailment) and ASSIN2 STS (similarity), BLUEX (question answering), ENEM 2022 (question answering) and FaQuAD (extractive question-answering).
 
130
  | Model | MRPC (F1) | RTE (F1) | COPA (F1) |
131
  |--------------------------|----------------|----------------|-----------|
132
  | **Gervásio 7B PTBR** | **0.7822** | **0.8321** | 0.2134 |
133
+ | **LLaMA-2 (English)** | 0.0369 | 0.0516 | 0.4867 |
134
+ | **LLaMA-2 Chat (English)** | 0.5432 | 0.3807 | **0.5493**|
135
  <br>
136
 
137
  For further testing our decoder, in addition to the testing data described above, we also reused some of the datasets that had been resorted for American Portuguese to test the state-of-the-art Sabiá model and that were originally developed with materials from Portuguese: ASSIN2 RTE (entailment) and ASSIN2 STS (similarity), BLUEX (question answering), ENEM 2022 (question answering) and FaQuAD (extractive question-answering).