martimfasantos commited on
Commit
4d1b5f4
·
verified ·
1 Parent(s): 01fee44

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -5
README.md CHANGED
@@ -1,20 +1,24 @@
1
  ---
2
  license: gemma
3
- base_model: martimfasantos/sft-xcomet_xl_xxl-chosen-10lp-shuff-full-gemma-it
 
4
  tags:
5
  - xcomet_xl_xxl
6
  - generated_from_trainer
7
  model-index:
8
- - name: dpo-xcomet_xl_xxl-10p-shuff-1e-7-full-from-sft-gemma-it
9
  results: []
 
 
 
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
  should probably proofread and complete it, then remove this comment. -->
14
 
15
- # dpo-xcomet_xl_xxl-10p-shuff-1e-7-full-from-sft-gemma-it
16
 
17
- This model is a fine-tuned version of [martimfasantos/sft-xcomet_xl_xxl-chosen-10lp-shuff-full-gemma-it](https://huggingface.co/martimfasantos/sft-xcomet_xl_xxl-chosen-10lp-shuff-full-gemma-it) on the Unbabel/TowerAligned-v0.1 dataset.
18
 
19
  ## Model description
20
 
@@ -56,4 +60,4 @@ The following hyperparameters were used during training:
56
  - Transformers 4.43.3
57
  - Pytorch 2.3.1+cu121
58
  - Datasets 2.20.0
59
- - Tokenizers 0.19.1
 
1
  ---
2
  license: gemma
3
+ base_model:
4
+ - martimfasantos/gemma-2-2b-it-MT-SFT
5
  tags:
6
  - xcomet_xl_xxl
7
  - generated_from_trainer
8
  model-index:
9
+ - name: gemma-2-2b-it-MT-DPO
10
  results: []
11
+ datasets:
12
+ - sardinelab/MT-pref
13
+ pipeline_tag: translation
14
  ---
15
 
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
17
  should probably proofread and complete it, then remove this comment. -->
18
 
19
+ # gemma-2-2b-it-MT-DPO
20
 
21
+ This model is a fine-tuned version of [martimfasantos/gemma-2-2b-it-MT-SFT](https://huggingface.co/martimfasantos/gemma-2-2b-it-MT-SFT) on the sardinelab/MT-pref dataset.
22
 
23
  ## Model description
24
 
 
60
  - Transformers 4.43.3
61
  - Pytorch 2.3.1+cu121
62
  - Datasets 2.20.0
63
+ - Tokenizers 0.19.1