LumiOpen
/

Poro-34B-chat-OpenAssistant

Model card Files Files and versions Community

laineyyy commited on 28 days ago

Commit

ef14d95

·

verified ·

1 Parent(s): 9562e8f

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -24,7 +24,7 @@ We use a curated subset of Open Assistant 2 and translated the dataset into Finn
 ### DPO
-We use the HelpSteer2 preference binarized into chosen-rejected pairs using the helpfulness score as discussed in the [HelpSteer2](https://arxiv.org/abs/2406.08673) paper. We translated the dataset into Finnish using Poro.
 - **English**: [HelpSteer2](https://huggingface.co/datasets/nvidia/HelpSteer2)
@@ -32,7 +32,7 @@ We use the HelpSteer2 preference binarized into chosen-rejected pairs using the
 ## Recipes
-We used 4 nodes (8 x AMD MI250X) to obtain a global batch size of 128 for SFT and 64 for DPO.
 **SFT**

 ### DPO
+We use the HelpSteer2 preference binarized into chosen-rejected pairs using the helpfulness score as recommended in the [HelpSteer2](https://arxiv.org/abs/2406.08673) paper. We translated the dataset into Finnish using Poro.
 - **English**: [HelpSteer2](https://huggingface.co/datasets/nvidia/HelpSteer2)
 ## Recipes
+We used 4 nodes (8 x AMD MI250X) to obtain a global batch size of 128 for SFT and 64 for DPO. We used the [Alignment Handbook](https://github.com/huggingface/alignment-handbook/) codebase for finetuning.
 **SFT**