1-800-BAD-CODE
/

punctuation_fullstop_truecase_romance

Text2Text Generation

Model card Files Files and versions Community

1-800-BAD-CODE commited on Apr 8, 2023

Commit

4567479

·

1 Parent(s): 585fa7a

Update README.md

Files changed (1) hide show

README.md +18 -0

README.md CHANGED Viewed

@@ -142,6 +142,24 @@ Outputs:
 </details>
 # Training Data
 For all languages except Catalan, this model was trained with ~10M lines of text per language from StatMT's [News Crawl](https://data.statmt.org/news-crawl/).

 </details>
+If you prefer your output to not be broken into separate sentences, you can disable sentence boundary detection
+in the API call:
+```python
+input_texts: List[str] = [
+    "hola amigo cómo estás es un día lluvioso hoy",
+]
+results: List[str] = m.infer(input_texts, apply_sbd=False)
+print(results[0])
+```
+Instead of a `List[List[str]]` (a list of output sentences for each input), we get a `List[str]` (one output
+sentence per input):
+```text
+Hola, amigo. ¿Cómo estás? Es un día lluvioso hoy.
+```
 # Training Data
 For all languages except Catalan, this model was trained with ~10M lines of text per language from StatMT's [News Crawl](https://data.statmt.org/news-crawl/).