Non-English language capabilities

by oliviermills - opened Apr 18, 2024

Apr 18, 2024

Curious to know how good it performs in non English and non Latin base scripts. As a base model for multilingual fine-tuning.

Apr 18, 2024

It would be nice to report a list of languages included in the training data and the amount of tokens in millions.

Apr 18, 2024

It would also be interesting to run this benchmark: https://huggingface.co/datasets/caro-holt/MultiQ

Measure accuracy in different languages + fidelity (replying in the same language as the query).

Apr 18, 2024

for italian:
Model Arc-c HellaS MMUL
LLama3 8b instruct 44.3 59.9 55.7

Apr 18, 2024

for italian:
Model Arc-c HellaS MMUL
LLama3 8b instruct 44.3 59.9 55.7

@FinancialSupport How did you test so I can test for other languages?

Apr 18, 2024

Apr 22, 2024

On a very quick test on private german and french data it beats ybelkada/Mixtral-8x7B-Instruct-v0.1-AWQ

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment