1-800-BAD-CODE
commited on
Commit
•
4567479
1
Parent(s):
585fa7a
Update README.md
Browse files
README.md
CHANGED
@@ -142,6 +142,24 @@ Outputs:
|
|
142 |
|
143 |
</details>
|
144 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
145 |
|
146 |
# Training Data
|
147 |
For all languages except Catalan, this model was trained with ~10M lines of text per language from StatMT's [News Crawl](https://data.statmt.org/news-crawl/).
|
|
|
142 |
|
143 |
</details>
|
144 |
|
145 |
+
If you prefer your output to not be broken into separate sentences, you can disable sentence boundary detection
|
146 |
+
in the API call:
|
147 |
+
|
148 |
+
```python
|
149 |
+
input_texts: List[str] = [
|
150 |
+
"hola amigo cómo estás es un día lluvioso hoy",
|
151 |
+
]
|
152 |
+
results: List[str] = m.infer(input_texts, apply_sbd=False)
|
153 |
+
print(results[0])
|
154 |
+
```
|
155 |
+
|
156 |
+
Instead of a `List[List[str]]` (a list of output sentences for each input), we get a `List[str]` (one output
|
157 |
+
sentence per input):
|
158 |
+
|
159 |
+
```text
|
160 |
+
Hola, amigo. ¿Cómo estás? Es un día lluvioso hoy.
|
161 |
+
```
|
162 |
+
|
163 |
|
164 |
# Training Data
|
165 |
For all languages except Catalan, this model was trained with ~10M lines of text per language from StatMT's [News Crawl](https://data.statmt.org/news-crawl/).
|