Migrate model card from transformers-repo
Browse filesRead announcement at https://discuss.huggingface.co/t/announcement-all-model-cards-will-be-migrated-to-hf-co-model-repos/2755
Original file history: https://github.com/huggingface/transformers/commits/master/model_cards/kiri-ai/distiluse-base-multilingual-cased-et/README.md
README.md
ADDED
@@ -0,0 +1,28 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language: et
|
3 |
+
---
|
4 |
+
## Model Description
|
5 |
+
|
6 |
+
This model is based off **Sentence-Transformer's** `distiluse-base-multilingual-cased` multilingual model that has been extended to understand sentence embeddings in Estonian.
|
7 |
+
|
8 |
+
## Sentence-Transformers
|
9 |
+
|
10 |
+
This model can be imported directly via the SentenceTransformers package as shown below:
|
11 |
+
|
12 |
+
```python
|
13 |
+
from sentence_transformers import SentenceTransformer
|
14 |
+
model = SentenceTransformer('kiri-ai/distiluse-base-multilingual-cased-et')
|
15 |
+
sentences = ['Here is a sample sentence','Another sample sentence']
|
16 |
+
embeddings = model.encode(sentences)
|
17 |
+
|
18 |
+
print("Sentence embeddings:")
|
19 |
+
print(embeddings)
|
20 |
+
```
|
21 |
+
|
22 |
+
## Fine-tuning
|
23 |
+
|
24 |
+
The fine-tuning and training processes were inspired by [sbert's](https://www.sbert.net/) multilingual training techniques which are available [here](https://www.sbert.net/examples/training/multilingual/README.html). The documentation shows and explains the step-by-step process of using parallel sentences to train models in a different language.
|
25 |
+
|
26 |
+
### Resources
|
27 |
+
|
28 |
+
The model was fine-tuned on English-Estonian parallel sentences taken from [OPUS](http://opus.nlpl.eu/) and [ParaCrawl](https://paracrawl.eu/).
|