julien-c HF staff commited on
Commit
4cc7064
·
1 Parent(s): 1346415

Migrate model card from transformers-repo

Browse files

Read announcement at https://discuss.huggingface.co/t/announcement-all-model-cards-will-be-migrated-to-hf-co-model-repos/2755
Original file history: https://github.com/huggingface/transformers/commits/master/model_cards/kiri-ai/distiluse-base-multilingual-cased-et/README.md

Files changed (1) hide show
  1. README.md +28 -0
README.md ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: et
3
+ ---
4
+ ## Model Description
5
+
6
+ This model is based off **Sentence-Transformer's** `distiluse-base-multilingual-cased` multilingual model that has been extended to understand sentence embeddings in Estonian.
7
+
8
+ ## Sentence-Transformers
9
+
10
+ This model can be imported directly via the SentenceTransformers package as shown below:
11
+
12
+ ```python
13
+ from sentence_transformers import SentenceTransformer
14
+ model = SentenceTransformer('kiri-ai/distiluse-base-multilingual-cased-et')
15
+ sentences = ['Here is a sample sentence','Another sample sentence']
16
+ embeddings = model.encode(sentences)
17
+
18
+ print("Sentence embeddings:")
19
+ print(embeddings)
20
+ ```
21
+
22
+ ## Fine-tuning
23
+
24
+ The fine-tuning and training processes were inspired by [sbert's](https://www.sbert.net/) multilingual training techniques which are available [here](https://www.sbert.net/examples/training/multilingual/README.html). The documentation shows and explains the step-by-step process of using parallel sentences to train models in a different language.
25
+
26
+ ### Resources
27
+
28
+ The model was fine-tuned on English-Estonian parallel sentences taken from [OPUS](http://opus.nlpl.eu/) and [ParaCrawl](https://paracrawl.eu/).