BramVanroy
/

GEITje-7B-ultra-GGUF

Inference Endpoints

Model card Files Files and versions Community

BramVanroy commited on about 1 month ago

Commit

d5db176

•

1 Parent(s): 00da1c0

Update README.md

Files changed (1) hide show

README.md +17 -0

README.md CHANGED Viewed

@@ -24,6 +24,23 @@ datasets:
 This is a  GGUF version of [BramVanroy/GEITje-7B-ultra](https://huggingface.co/BramVanroy/GEITje-7B-ultra), a powerful Dutch chatbot, which ultimately is Mistral-based model, further pretrained on Dutch and additionally treated with supervised-finetuning and DPO alignment. For more information on the model, data, licensing, usage, see the main model's README.
 Available quantization types and expected performance differences compared to base `f16`, higher perplexity=worse (from llama.cpp):
 ```

 This is a  GGUF version of [BramVanroy/GEITje-7B-ultra](https://huggingface.co/BramVanroy/GEITje-7B-ultra), a powerful Dutch chatbot, which ultimately is Mistral-based model, further pretrained on Dutch and additionally treated with supervised-finetuning and DPO alignment. For more information on the model, data, licensing, usage, see the main model's README.
+## Citation
+If you use GEITje 7B Ultra (SFT) or any of its derivatives or quantizations, place cite the following paper:
+```bibtex
+@misc{vanroy2024geitje7bultraconversational,
+      title={GEITje 7B Ultra: A Conversational Model for Dutch},
+      author={Bram Vanroy},
+      year={2024},
+      eprint={2412.04092},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2412.04092},
+}
+```
 Available quantization types and expected performance differences compared to base `f16`, higher perplexity=worse (from llama.cpp):
 ```