teddy-f-47 commited on
Commit
12482b0
·
1 Parent(s): 8427f91

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -4
README.md CHANGED
@@ -13,15 +13,15 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # phi-1_5-pl-v_0_1
15
 
16
- This model is based on [microsoft/phi-1_5](https://huggingface.co/microsoft/phi-1_5). It is trained from scratch on the 20231201 Polish Wikipedia dump.
17
 
18
  ## Model description
19
 
20
- More information needed
21
 
22
  ## Intended uses & limitations
23
 
24
- More information needed
25
 
26
  ## Training and evaluation data
27
 
@@ -50,11 +50,13 @@ The following hyperparameters were used during training:
50
  - lr_scheduler_type: cosine
51
  - lr_scheduler_warmup_ratio: 0.1
52
  - num_epochs: 2
 
53
  - seed: 42
54
 
55
  ### Training results
56
 
57
- train_loss: 2.727
 
58
 
59
  ### Framework versions
60
 
 
13
 
14
  # phi-1_5-pl-v_0_1
15
 
16
+ This model is based on [microsoft/phi-1_5](https://huggingface.co/microsoft/phi-1_5). It was trained from scratch on the 20231201 Polish Wikipedia dump.
17
 
18
  ## Model description
19
 
20
+ The model was trained for a context length of 1024 tokens.
21
 
22
  ## Intended uses & limitations
23
 
24
+ The model is intended for research purposes only. It may generate fictitious, incorrect, unethical, or biased texts. At its current state, it is not suitable for production purposes.
25
 
26
  ## Training and evaluation data
27
 
 
50
  - lr_scheduler_type: cosine
51
  - lr_scheduler_warmup_ratio: 0.1
52
  - num_epochs: 2
53
+ - precision: bf16
54
  - seed: 42
55
 
56
  ### Training results
57
 
58
+ - runtime: 2d 21h 26m 36s
59
+ - train_loss: 2.727
60
 
61
  ### Framework versions
62