VAGOsolutions
/

SauerkrautLM-Qwen-32b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

DavidGF commited on Apr 13

Commit

9ecfa84

•

1 Parent(s): 766efc6

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -47,8 +47,9 @@ The model **SauerkrautLM-Qwen-32b** is a **joint effort** between **VAGO solutio
 - **Contact:** [VAGO solutions](https://vago-solutions.ai), [Hyperspace.ai](https://hyperspace.computer/)
 ### Training procedure:
-We trained this model for 2 epochs on 160k data samples with SFT.
-Afterwards we applied DPO for 1 epoch with 110k data.
 **We teached German language skills on this model.** As far as we know, it is the first Qwen 32B model with bilingual skills in German and English. Nevertheless, formulations may occur that are not entirely correct (still work in progress).

 - **Contact:** [VAGO solutions](https://vago-solutions.ai), [Hyperspace.ai](https://hyperspace.computer/)
 ### Training procedure:
+- We trained this model for 2 epochs on 160k data samples with SFT.
+- Afterwards we applied DPO for 1 epoch with 110k data.
+- LaserRMT version coming soon
 **We teached German language skills on this model.** As far as we know, it is the first Qwen 32B model with bilingual skills in German and English. Nevertheless, formulations may occur that are not entirely correct (still work in progress).