File size: 620 Bytes
f7483ec
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
---
language:
- ru
- myv
tags:
- erzya
- mordovian
- fill-mask
- pretraining
- embeddings
- masked-lm
- feature-extraction
- sentence-similarity
license: cc-by-sa-4.0
datasets:
  - slone/myv_ru_2022
---

This is a version of [LaBSE-en-ru-myv-v1](https://huggingface.co/slone/LaBSE-en-ru-myv-v1), fine-tuned for about 150K steps 
on the [myv_ru_2022](https://huggingface.co/datasets/slone/myv_ru_2022) dataset, in 
[this notebook](https://colab.research.google.com/drive/1SxeraKZS6KYKobzVNNyIQZa4WnhpJ_nb?usp=sharing).

It demonstrates slighly better results than the v1 model, both on bitext mining and on the MLM task.