pdelobelle
commited on
Commit
•
2602fea
1
Parent(s):
59898b9
Update README.md
Browse files
README.md
CHANGED
@@ -20,10 +20,12 @@ tweety-7b-dutch is a foundation model with a focus on the Dutch language, incorp
|
|
20 |
|
21 |
Our tweety-7b-dutch model has an Apache 2.0 license, encouraging applications in research, content creation, and language analysis.
|
22 |
|
|
|
|
|
|
|
23 |
- **Developed by:** KU Leuven and UGent
|
24 |
- **Funded by:** KU Leuven BOF, VSC (Flemish Supercomputer Center), [Vlaams AI-onderzoeksprogramma](https://www.flandersairesearch.be/nl)
|
25 |
- **Model type:** Foundation model
|
26 |
-
- **Language(s) (NLP):** Dutch
|
27 |
- **License:** Apache 2.0
|
28 |
|
29 |
## Uses
|
|
|
20 |
|
21 |
Our tweety-7b-dutch model has an Apache 2.0 license, encouraging applications in research, content creation, and language analysis.
|
22 |
|
23 |
+
- **Tokenizer:** Dutch, 50k tokens ([yhavinga/gpt-neo-1.3B-dutch](https://huggingface.co/yhavinga/gpt-neo-1.3B-dutch))
|
24 |
+
- **Pre-training data:** Scraped Dutch ([yhavinga/mc4_nl_cleaned](https://huggingface.co/datasets/yhavinga/mc4_nl_cleaned))
|
25 |
+
- **Context window**: 8196 tokens
|
26 |
- **Developed by:** KU Leuven and UGent
|
27 |
- **Funded by:** KU Leuven BOF, VSC (Flemish Supercomputer Center), [Vlaams AI-onderzoeksprogramma](https://www.flandersairesearch.be/nl)
|
28 |
- **Model type:** Foundation model
|
|
|
29 |
- **License:** Apache 2.0
|
30 |
|
31 |
## Uses
|