ENERGY-DRINK-LOVE
/

eeve_dpo-v3

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jingyeom commited on Mar 7

Commit

bdb1bfc

•

1 Parent(s): a53f438

Update README.md

Files changed (1) hide show

README.md +10 -4

README.md CHANGED Viewed

@@ -13,19 +13,25 @@ should probably proofread and complete it, then remove this comment. -->
 # ENERGY-DRINK-LOVE/eeve_dpo-v3
 ### Our Team
-<br>
 ## Model
 ### Base Model
-*
 ### Hardware and Software
 ### Training Method
 ## Benchmark

 # ENERGY-DRINK-LOVE/eeve_dpo-v3
 ### Our Team
+* Jingyeom Kim
+* Youjin Chung
 ## Model
 ### Base Model
+* [yanolja/EEVE-Korean-Instruct-10.8B-v1.0](https://huggingface.co/yanolja/EEVE-Korean-Instruct-10.8B-v1.0)
 ### Hardware and Software
+* Hardware: A100 * 8 for training our model
+* Deepspeed library & Huggingface TRL Trainer
+### Dataset
+* DPO_dataset
+  * 자체 제작 dpo dataset(AI-hub dataset 활용)
+  * OpenOrca DPO 등 영어 데이터셋 번역(자체 모델 활용)
 ### Training Method
+* [DPO](https://arxiv.org/abs/2305.18290)
 ## Benchmark