Update README.md
Browse files
README.md
CHANGED
@@ -22,7 +22,7 @@ pipeline_tag: text-generation
|
|
22 |
<div style="margin:auto; text-align:center">
|
23 |
<h1 style="margin-bottom: 0">Llama 3 8B - Dutch</h1>
|
24 |
<em>A conversational model for Dutch, based on Llama 3 8B</em>
|
25 |
-
<em><a href="https://huggingface.co/spaces/ReBatch/Llama-3-Dutch">Try chatting with the model!</a></em>
|
26 |
</div>
|
27 |
|
28 |
This model is a [QLORA](https://huggingface.co/blog/4bit-transformers-bitsandbytes) and [ORPO](https://huggingface.co/docs/trl/main/en/orpo_trainer) fine-tuned version of [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) on the synthetic feedback dataset [BramVanroy/ultra_feedback_dutch](https://huggingface.co/datasets/BramVanroy/ultra_feedback_dutch)
|
|
|
22 |
<div style="margin:auto; text-align:center">
|
23 |
<h1 style="margin-bottom: 0">Llama 3 8B - Dutch</h1>
|
24 |
<em>A conversational model for Dutch, based on Llama 3 8B</em>
|
25 |
+
<p><em><a href="https://huggingface.co/spaces/ReBatch/Llama-3-Dutch">Try chatting with the model!</a></em></p>
|
26 |
</div>
|
27 |
|
28 |
This model is a [QLORA](https://huggingface.co/blog/4bit-transformers-bitsandbytes) and [ORPO](https://huggingface.co/docs/trl/main/en/orpo_trainer) fine-tuned version of [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) on the synthetic feedback dataset [BramVanroy/ultra_feedback_dutch](https://huggingface.co/datasets/BramVanroy/ultra_feedback_dutch)
|