BramVanroy commited on
Commit
8e395ff
1 Parent(s): b960df1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -3
README.md CHANGED
@@ -24,15 +24,18 @@ See the original [meta-llama/Llama-2-13b-hf](https://huggingface.co/meta-llama/L
24
 
25
  ## Model description
26
 
27
- I could not get Llama 2 13B to produce much Dutch, even though the description paper indicates that it was trained on a (small) portion of Dutch data. I therefore
28
- continue training the original Llama 2 13B checkpoint on Dutch data [in regular CLM](https://huggingface.co/BramVanroy/llama2-13b-ft-mc4_nl_cleaned_tiny). In a second
29
- step I finetuned that model on a collection of synthetic (translated) instruction and chat datasets that I have [collected](https://huggingface.co/datasets/BramVanroy/dutch_chat_datasets). See their pages for licensing, usage, creation, and citation information.
 
30
 
31
  - https://huggingface.co/datasets/BramVanroy/dolly-15k-dutch
32
  - https://huggingface.co/datasets/BramVanroy/alpaca-cleaned-dutch-baize
33
  - https://huggingface.co/datasets/BramVanroy/stackoverflow-chat-dutch
34
  - https://huggingface.co/datasets/BramVanroy/quora-chat-dutch
35
 
 
 
36
 
37
  ## Intended uses & limitations
38
 
 
24
 
25
  ## Model description
26
 
27
+ I could not get the original Llama 2 13B to produce much Dutch, even though the description paper indicates that it was trained on a (small) portion of Dutch data. I therefore
28
+ continued training the original Llama 2 13B checkpoint on Dutch data [in regular CLM](https://huggingface.co/BramVanroy/llama2-13b-ft-mc4_nl_cleaned_tiny). In a second
29
+ step I finetuned that model on a collection of synthetic (translated) instruction and chat datasets that I have [collected](https://huggingface.co/datasets/BramVanroy/dutch_chat_datasets).
30
+ See their pages for licensing, usage, creation, and citation information.
31
 
32
  - https://huggingface.co/datasets/BramVanroy/dolly-15k-dutch
33
  - https://huggingface.co/datasets/BramVanroy/alpaca-cleaned-dutch-baize
34
  - https://huggingface.co/datasets/BramVanroy/stackoverflow-chat-dutch
35
  - https://huggingface.co/datasets/BramVanroy/quora-chat-dutch
36
 
37
+ This model is the result of that process. While not perfect by any means, it can perform reasonably well in Dutch depending on the prompts. It is also decent at helping with programming tasks.
38
+
39
 
40
  ## Intended uses & limitations
41