teddy-f-47
/

phi-pl-400M-v_0_1

Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

teddy-f-47 commited on Jan 8, 2024

Commit

12482b0

·

1 Parent(s): 8427f91

Update README.md

Files changed (1) hide show

README.md +6 -4

README.md CHANGED Viewed

@@ -13,15 +13,15 @@ should probably proofread and complete it, then remove this comment. -->
 # phi-1_5-pl-v_0_1
-This model is based on [microsoft/phi-1_5](https://huggingface.co/microsoft/phi-1_5). It is trained from scratch on the 20231201 Polish Wikipedia dump.
 ## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
@@ -50,11 +50,13 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.1
 - num_epochs: 2
 - seed: 42
 ### Training results
-train_loss: 2.727
 ### Framework versions

 # phi-1_5-pl-v_0_1
+This model is based on [microsoft/phi-1_5](https://huggingface.co/microsoft/phi-1_5). It was trained from scratch on the 20231201 Polish Wikipedia dump.
 ## Model description
+The model was trained for a context length of 1024 tokens.
 ## Intended uses & limitations
+The model is intended for research purposes only. It may generate fictitious, incorrect, unethical, or biased texts. At its current state, it is not suitable for production purposes.
 ## Training and evaluation data
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.1
 - num_epochs: 2
+- precision: bf16
 - seed: 42
 ### Training results
+- runtime: 2d 21h 26m 36s
+- train_loss: 2.727
 ### Framework versions