pstroe
/

roberta-base-latin-cased

Inference Endpoints

Model card Files Files and versions Community

pstroe commited on Dec 6, 2021

Commit

cfcf8c2

·

1 Parent(s): 02170f9

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -21,6 +21,8 @@ I undertook the following preprocessing steps:
 The result is a corpus of ~390 million tokens.
 ### Contact
 For contact, reach out to Phillip Ströbel [via mail](mailto:pstroebel@cl.uzh.ch) or [via Twitter](https://twitter.com/CLingophil).

 The result is a corpus of ~390 million tokens.
+The dataset used to train this model is available [HERE](https://huggingface.co/datasets/pstroe/cc100-latin).
 ### Contact
 For contact, reach out to Phillip Ströbel [via mail](mailto:pstroebel@cl.uzh.ch) or [via Twitter](https://twitter.com/CLingophil).