nisten
/

shqiponja-15b-v1

Generated from Trainer

Model card Files Files and versions Community

nisten commited on Feb 3

Commit

ec7e02f

•

1 Parent(s): 63856c4

Update README.md

Files changed (1) hide show

README.md +9 -4

README.md CHANGED Viewed

@@ -1,14 +1,20 @@
 ---
-license: cc-by-nc-nd-4.0
 library_name: peft
 tags:
 - generated_from_trainer
 base_model: nisten/shqiponja-15b-v1
 model-index:
-- name: alora-out
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
@@ -75,5 +81,4 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 10
-- num_epochs: 3

 ---
+license: gpl-3.0
 library_name: peft
 tags:
 - generated_from_trainer
 base_model: nisten/shqiponja-15b-v1
 model-index:
+- name: shqiponja-15
   results: []
+datasets:
+- iamshnoo/alpaca-cleaned-albanian
+- noxneural/lilium_albanicum_eng_alb
 ---
+**15.6b 2expert MoE**
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 10
+- num_epochs: 3