Tweeties
/

tweety-7b-dutch-v24a

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

pdelobelle commited on May 9, 2024

Commit

224f136

•

1 Parent(s): 2602fea

Update README.md

Files changed (1) hide show

README.md +5 -2

README.md CHANGED Viewed

@@ -23,6 +23,7 @@ Our tweety-7b-dutch model has an Apache 2.0 license, encouraging applications in
 - **Tokenizer:** Dutch, 50k tokens ([yhavinga/gpt-neo-1.3B-dutch](https://huggingface.co/yhavinga/gpt-neo-1.3B-dutch))
 - **Pre-training data:** Scraped Dutch ([yhavinga/mc4_nl_cleaned](https://huggingface.co/datasets/yhavinga/mc4_nl_cleaned))
 - **Context window**: 8196 tokens
 - **Developed by:** KU Leuven and UGent
 - **Funded by:** KU Leuven BOF, VSC (Flemish Supercomputer Center), [Vlaams AI-onderzoeksprogramma](https://www.flandersairesearch.be/nl)
 - **Model type:** Foundation model
@@ -35,7 +36,9 @@ As a base model, tweety-7b-dutch is primed for direct applications across text g
 ## Technical Specifications
 ### Compute Infrastructure
-#### Hardware
-Training utilized Nvidia H100 and A100 GPUs. Inference is accessible on lower-end GPUs, basically any GPU capable of running mistral models.

 - **Tokenizer:** Dutch, 50k tokens ([yhavinga/gpt-neo-1.3B-dutch](https://huggingface.co/yhavinga/gpt-neo-1.3B-dutch))
 - **Pre-training data:** Scraped Dutch ([yhavinga/mc4_nl_cleaned](https://huggingface.co/datasets/yhavinga/mc4_nl_cleaned))
 - **Context window**: 8196 tokens
+- **Training data**: 8.5B tokens
 - **Developed by:** KU Leuven and UGent
 - **Funded by:** KU Leuven BOF, VSC (Flemish Supercomputer Center), [Vlaams AI-onderzoeksprogramma](https://www.flandersairesearch.be/nl)
 - **Model type:** Foundation model
 ## Technical Specifications
 ### Compute Infrastructure
+Training utilized Nvidia H100 and A100 GPUs. Inference is accessible on lower-end GPUs, basically any GPU capable of running mistral models.
+### Model Weights
+- This model was trained in bfloat16.
+- [GGUF weights](https://huggingface.co/BramVanroy/tweety-7b-dutch-v24a-GGUF) are released by Bram Vanroy.