Update README.md
Browse files
README.md
CHANGED
@@ -12,11 +12,14 @@ tags:
|
|
12 |
- torch
|
13 |
---
|
14 |
|
15 |
-
# ROBERTA BASE (cased)
|
|
|
|
|
|
|
16 |
|
17 |
This model is cased: it does make a difference between bulgarian and Bulgarian.
|
18 |
|
19 |
-
It was
|
20 |
|
21 |
Then, it was compressed via [progressive module replacing](https://arxiv.org/abs/2002.02925).
|
22 |
|
|
|
12 |
- torch
|
13 |
---
|
14 |
|
15 |
+
# ROBERTA BASE (cased) trained on private Bulgarian-English parallel data
|
16 |
+
This is a Multilingual Roberta model. It could be used for creating embeddings of Bulgarian sentences.
|
17 |
+
|
18 |
+
Using the ideas from [Sentence-BERT](https://arxiv.org/abs/2004.09813), the training is based on the idea that a translated sentence should be mapped to the same location in the vector space as the original sentence.
|
19 |
|
20 |
This model is cased: it does make a difference between bulgarian and Bulgarian.
|
21 |
|
22 |
+
It was trained on private Bulgarian-English parallel data.
|
23 |
|
24 |
Then, it was compressed via [progressive module replacing](https://arxiv.org/abs/2002.02925).
|
25 |
|