license: mit | |
datasets: | |
- mor40/chitanka_raw_document | |
language: | |
- bg | |
metrics: | |
- perplexity | |
library_name: transformers | |
pipeline_tag: fill-mask | |
# Model Card for Model ID | |
A LLM trained from scratch on bulgarian data. | |
The model and the model's tokenizer are trained _from scratch_ on bulgarian data from the **chitanka** dataset. | |
#### Metrics | |
Perprelixty - 6.75 | |