BSC-LT
/

salamandraTA-7b-instruct

text-generation

text-generation-inference

Inference Endpoints

🇪🇺 Region: EU

Model card Files Files and versions Community

fdelucaf commited on 15 days ago

Commit

f5c46e3

·

verified ·

1 Parent(s): 87c6b12

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -322,7 +322,8 @@ including all of the official European languages plus Catalan, Basque, Galician,
 It amounts to 6,574,251,526 parallel sentence pairs.
 This highly multilingual corpus is predominantly composed of data sourced from [OPUS](https://opus.nlpl.eu/),
-with additional data taken from the [NTEU project](https://nteu.eu/), [Aina Project](https://projecteaina.cat/), and other sources (see: [Data Sources#](#pre-data-sources) and [References below](#pre-references)).
 Where little parallel Catalan <-> xx data could be found, synthetic Catalan data was generated from the Spanish side of the collected Spanish <-> xx corpora using
 [Projecte Aina’s Spanish-Catalan model](https://huggingface.co/projecte-aina/aina-translator-es-ca). The final distribution of languages was as below:

 It amounts to 6,574,251,526 parallel sentence pairs.
 This highly multilingual corpus is predominantly composed of data sourced from [OPUS](https://opus.nlpl.eu/),
+with additional data taken from the [NTEU project](https://nteu.eu/), [Aina Project](https://projecteaina.cat/), and other sources
+(see: [Data Sources](#pre-data-sources) and [References](#pre-references)).
 Where little parallel Catalan <-> xx data could be found, synthetic Catalan data was generated from the Spanish side of the collected Spanish <-> xx corpora using
 [Projecte Aina’s Spanish-Catalan model](https://huggingface.co/projecte-aina/aina-translator-es-ca). The final distribution of languages was as below: