BramVanroy
/

Llama-2-13b-chat-dutch

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

BramVanroy commited on Aug 14, 2023

Commit

8e395ff

•

1 Parent(s): b960df1

Update README.md

Files changed (1) hide show

README.md +6 -3

README.md CHANGED Viewed

@@ -24,15 +24,18 @@ See the original [meta-llama/Llama-2-13b-hf](https://huggingface.co/meta-llama/L
 ## Model description
-I could not get Llama 2 13B to produce much Dutch, even though the description paper indicates that it was trained on a (small) portion of Dutch data. I therefore
-continue training the original Llama 2 13B checkpoint on Dutch data [in regular CLM](https://huggingface.co/BramVanroy/llama2-13b-ft-mc4_nl_cleaned_tiny). In a second
-step I finetuned that model on a collection of synthetic (translated) instruction and chat datasets that I have [collected](https://huggingface.co/datasets/BramVanroy/dutch_chat_datasets). See their pages for licensing, usage, creation, and citation information.
 - https://huggingface.co/datasets/BramVanroy/dolly-15k-dutch
 - https://huggingface.co/datasets/BramVanroy/alpaca-cleaned-dutch-baize
 - https://huggingface.co/datasets/BramVanroy/stackoverflow-chat-dutch
 - https://huggingface.co/datasets/BramVanroy/quora-chat-dutch
 ## Intended uses & limitations

 ## Model description
+I could not get the original Llama 2 13B to produce much Dutch, even though the description paper indicates that it was trained on a (small) portion of Dutch data. I therefore
+continued training the original Llama 2 13B checkpoint on Dutch data [in regular CLM](https://huggingface.co/BramVanroy/llama2-13b-ft-mc4_nl_cleaned_tiny). In a second
+step I finetuned that model on a collection of synthetic (translated) instruction and chat datasets that I have [collected](https://huggingface.co/datasets/BramVanroy/dutch_chat_datasets).
+See their pages for licensing, usage, creation, and citation information.
 - https://huggingface.co/datasets/BramVanroy/dolly-15k-dutch
 - https://huggingface.co/datasets/BramVanroy/alpaca-cleaned-dutch-baize
 - https://huggingface.co/datasets/BramVanroy/stackoverflow-chat-dutch
 - https://huggingface.co/datasets/BramVanroy/quora-chat-dutch
+This model is the result of that process. While not perfect by any means, it can perform reasonably well in Dutch depending on the prompts. It is also decent at helping with programming tasks.
 ## Intended uses & limitations