fix typo readme
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ For the moment, only the tokenizer is available. The tokenizer is based on [Sent
|
|
20 |
|
21 |
Taking into account certain characteristics of the language, we chose that:
|
22 |
|
23 |
-
- the tokenizer passes in lower case all the texts because
|
24 |
- the sentence pieces can't go beyond the boundary of a word because the words are spaced by white spaces in the Bengali language.
|
25 |
|
26 |
## Intended uses & limitations
|
|
|
20 |
|
21 |
Taking into account certain characteristics of the language, we chose that:
|
22 |
|
23 |
+
- the tokenizer passes in lower case all the texts because the Bengali language is a monocameral scrip (no difference between capital and lower case);
|
24 |
- the sentence pieces can't go beyond the boundary of a word because the words are spaced by white spaces in the Bengali language.
|
25 |
|
26 |
## Intended uses & limitations
|