Cartinoe5930
commited on
Commit
•
14fdbc6
1
Parent(s):
d358026
Update README.md
Browse files
README.md
CHANGED
@@ -35,7 +35,7 @@ For more details, please check the GitHub Repository!
|
|
35 |
## Training Details
|
36 |
|
37 |
- **Hardward:** We utilized A100 80G for finetuning
|
38 |
-
- **Training factors:** The [
|
39 |
- **Training Details:** DPO training 1 epoch on [ko_Ultrafeedback_binarized](https://huggingface.co/datasets/maywell/ko_Ultrafeedback_binarized) dataset. [KoRAE-13b](https://huggingface.co/Cartinoe5930/KoRAE-13b) model was used.
|
40 |
|
41 |
For more details, please check the GitHub Repository!
|
|
|
35 |
## Training Details
|
36 |
|
37 |
- **Hardward:** We utilized A100 80G for finetuning
|
38 |
+
- **Training factors:** The [TRL DPOTrainer](https://huggingface.co/docs/trl/main/en/dpo_trainer) and [Huggingface PEFT](https://huggingface.co/docs/peft/index) were utilized for finetuning.
|
39 |
- **Training Details:** DPO training 1 epoch on [ko_Ultrafeedback_binarized](https://huggingface.co/datasets/maywell/ko_Ultrafeedback_binarized) dataset. [KoRAE-13b](https://huggingface.co/Cartinoe5930/KoRAE-13b) model was used.
|
40 |
|
41 |
For more details, please check the GitHub Repository!
|