BramVanroy
commited on
Commit
•
d5db176
1
Parent(s):
00da1c0
Update README.md
Browse files
README.md
CHANGED
@@ -24,6 +24,23 @@ datasets:
|
|
24 |
|
25 |
This is a GGUF version of [BramVanroy/GEITje-7B-ultra](https://huggingface.co/BramVanroy/GEITje-7B-ultra), a powerful Dutch chatbot, which ultimately is Mistral-based model, further pretrained on Dutch and additionally treated with supervised-finetuning and DPO alignment. For more information on the model, data, licensing, usage, see the main model's README.
|
26 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
27 |
Available quantization types and expected performance differences compared to base `f16`, higher perplexity=worse (from llama.cpp):
|
28 |
|
29 |
```
|
|
|
24 |
|
25 |
This is a GGUF version of [BramVanroy/GEITje-7B-ultra](https://huggingface.co/BramVanroy/GEITje-7B-ultra), a powerful Dutch chatbot, which ultimately is Mistral-based model, further pretrained on Dutch and additionally treated with supervised-finetuning and DPO alignment. For more information on the model, data, licensing, usage, see the main model's README.
|
26 |
|
27 |
+
|
28 |
+
## Citation
|
29 |
+
|
30 |
+
If you use GEITje 7B Ultra (SFT) or any of its derivatives or quantizations, place cite the following paper:
|
31 |
+
|
32 |
+
```bibtex
|
33 |
+
@misc{vanroy2024geitje7bultraconversational,
|
34 |
+
title={GEITje 7B Ultra: A Conversational Model for Dutch},
|
35 |
+
author={Bram Vanroy},
|
36 |
+
year={2024},
|
37 |
+
eprint={2412.04092},
|
38 |
+
archivePrefix={arXiv},
|
39 |
+
primaryClass={cs.CL},
|
40 |
+
url={https://arxiv.org/abs/2412.04092},
|
41 |
+
}
|
42 |
+
```
|
43 |
+
|
44 |
Available quantization types and expected performance differences compared to base `f16`, higher perplexity=worse (from llama.cpp):
|
45 |
|
46 |
```
|