Jordan Myers commited on
Commit
340a52c
1 Parent(s): 66724ca

description update

Browse files
Files changed (2) hide show
  1. README.md +3 -3
  2. app.py +34 -2
README.md CHANGED
@@ -1,8 +1,8 @@
1
  ---
2
- title: English Tokipona Translator
3
- emoji: 📊
4
  colorFrom: indigo
5
- colorTo: blue
6
  sdk: gradio
7
  sdk_version: 3.15.0
8
  app_file: app.py
 
1
  ---
2
+ title: English / toki pona Translator
3
+ emoji: 💬
4
  colorFrom: indigo
5
+ colorTo: purple
6
  sdk: gradio
7
  sdk_version: 3.15.0
8
  app_file: app.py
app.py CHANGED
@@ -54,10 +54,42 @@ app = gr.Interface(
54
  ["Its not always perfect, but its pretty okay!", "English", "toki pona", 3],
55
  ["ilo pi ante toki ni li pona a!", "toki pona", "English", 3],
56
  ["kijetesantakalu li pona", "toki pona", "English", 3],
 
57
  ],
58
  cache_examples=False,
59
- title="A simple English / toki pona Neural Translation App",
60
- description="A simple English / toki pona Neural Translation App"
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
61
  )
62
 
63
  app.launch()
 
54
  ["Its not always perfect, but its pretty okay!", "English", "toki pona", 3],
55
  ["ilo pi ante toki ni li pona a!", "toki pona", "English", 3],
56
  ["kijetesantakalu li pona", "toki pona", "English", 3],
57
+ ["mi li toki e toki pona", "toki pona", "toki pona", 3]
58
  ],
59
  cache_examples=False,
60
+ title="English / toki pona Translation",
61
+ description="""
62
+ # A simple English / toki pona Neural Machine Translation App!
63
+
64
+ ### toki a! 💬
65
+
66
+ This is a simple english to toki pona / toki pona to english neural machine translation app.
67
+
68
+ Input your text to translate, a source language and target language, and desired number of return sequences!
69
+
70
+ ### Grammaticality / Regularization
71
+ English -> English and/or toki pona -> toki pona will result in some form of regularization.
72
+
73
+ This can approximate grammaticality, but it isn't always the best.
74
+
75
+ For example, "mi li toki e toki pona" [src: toki pona, tgt: toki pona] will result in ['mi toki e toki pona.', 'mi toki pona.', 'mi toki e toki pona']
76
+ (Thus, the ungrammatical "li" is dropped)
77
+
78
+ ### Model and Data
79
+ This app utilizes a fine-tuned version of Facebook/Meta AI's M2M100 418M param model.
80
+
81
+ By leveraging the pretrained weights of the massively multilingual M2M100 model,
82
+ we can jumpstart our transfer learning to accomplish machine translation for toki pona!
83
+
84
+ The model was fine-tuned on the English/toki pona bitexts found at https://tatoeba.org/
85
+
86
+ ### This app is a work in progress and obviously not all translations will be perfect.
87
+ In addition to parameter quantity and the hyper-parameters used while training,
88
+ the *quality of data* found on Tatoeba directly influences the perfomance of projects like this!
89
+
90
+ If you wish to contribute, please simply add high quality and diverse translations to Tatoeba!
91
+
92
+ """
93
  )
94
 
95
  app.launch()