Training dataset

by HappyLemon - opened Sep 28

Sep 28

Do I understand correctly, that SONAR was trained on short sentences? As in the paper it was said, that the same training data from NLLB (https://arxiv.org/pdf/2207.04672) was used (which is FLORES 200, right?) and the it consist of 3001 sentence translated to 200 languages with average length of 21 word?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment