Create a model card
Browse files
README.md
ADDED
@@ -0,0 +1,55 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language: ["ru", "en"]
|
3 |
+
tags:
|
4 |
+
- russian
|
5 |
+
license: mit
|
6 |
+
widget:
|
7 |
+
- text: "translate ru to en: Интересный момент. Модель не видела русских диалогов, но может их понимать"
|
8 |
+
---
|
9 |
+
This pruned model of mt5-base [google/mt5-base](https://huggingface.co/google/mt5-base) with only some Rusian and English embeddings left.
|
10 |
+
|
11 |
+
|
12 |
+
The model has been fine-tuned for several tasks:
|
13 |
+
* translation (opus100 dataset)
|
14 |
+
* dialog (daily dialog dataset)
|
15 |
+
|
16 |
+
|
17 |
+
How to use:
|
18 |
+
```
|
19 |
+
# !pip install transformers sentencepiece
|
20 |
+
|
21 |
+
from transformers import AutoModelForSeq2SeqLM, AutoTokenizer, T5Tokenizer
|
22 |
+
import torch
|
23 |
+
|
24 |
+
model_name = 'artemnech/enrut5-base'
|
25 |
+
|
26 |
+
model = AutoModelForSeq2SeqLM.from_pretrained(model_name)
|
27 |
+
tokenizer = AutoTokenizer.from_pretrained(model_name)
|
28 |
+
|
29 |
+
def generate(text, **kwargs):
|
30 |
+
model.eval()
|
31 |
+
inputs = tokenizer(text, return_tensors='pt')
|
32 |
+
with torch.no_grad():
|
33 |
+
hypotheses = model.generate(**inputs, **kwargs)
|
34 |
+
return tokenizer.decode(hypotheses[0], skip_special_tokens=True)
|
35 |
+
|
36 |
+
print(generate('translate ru to en: Интересный момент. Модель не видела русских диалогов, но может их понимать', num_beams=4,))
|
37 |
+
# The Model didn't see Russian dialogues, but can understand them.
|
38 |
+
|
39 |
+
print(generate("translate en to ru: The Model didn't see Russian dialogues, but can understand them.", num_beams=4,))
|
40 |
+
# Модель не видела русских диалога, но может понимать их.
|
41 |
+
|
42 |
+
print(generate('dialog: user1>>: Hello', num_beams=2))
|
43 |
+
# Hi
|
44 |
+
|
45 |
+
print(generate('dialog: user1>>: Hello user2>>: Hi user1>>: Would you like to drink something?', num_beams=2))
|
46 |
+
# I'd like to drink a cup of coffee.
|
47 |
+
|
48 |
+
#An interesting point. The model has not seen Russian dialogues, but can understand them
|
49 |
+
|
50 |
+
print(generate('dialog: user1>>: Привет'))
|
51 |
+
# Hi
|
52 |
+
|
53 |
+
print(generate('dialog: user1>>: Привет user2>>: Hi user1>>: Хочешь выпить что-нибудь?', num_beams=2))
|
54 |
+
# I'd like to have a cup of coffee.
|
55 |
+
```
|