emdemor commited on
Commit
a627c2e
1 Parent(s): f9fbbdb

Model save

Browse files
Files changed (1) hide show
  1. README.md +18 -9
README.md CHANGED
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [microsoft/Phi-3-mini-4k-instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 1.5860
22
 
23
  ## Model description
24
 
@@ -37,26 +37,35 @@ More information needed
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
- - learning_rate: 0.0001
41
  - train_batch_size: 8
42
  - eval_batch_size: 8
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
  - lr_scheduler_warmup_ratio: 0.1
47
- - num_epochs: 1
48
  - mixed_precision_training: Native AMP
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:-----:|:----:|:---------------:|
54
- | 1.9237 | 0.16 | 20 | 1.8701 |
55
- | 1.8479 | 0.32 | 40 | 1.7706 |
56
- | 1.7419 | 0.48 | 60 | 1.6614 |
57
- | 1.674 | 0.64 | 80 | 1.6176 |
58
- | 1.6024 | 0.8 | 100 | 1.5984 |
59
- | 1.6203 | 0.96 | 120 | 1.5860 |
 
 
 
 
 
 
 
 
 
60
 
61
 
62
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [microsoft/Phi-3-mini-4k-instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 1.4475
22
 
23
  ## Model description
24
 
 
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
+ - learning_rate: 0.0005
41
  - train_batch_size: 8
42
  - eval_batch_size: 8
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
  - lr_scheduler_warmup_ratio: 0.1
47
+ - num_epochs: 3
48
  - mixed_precision_training: Native AMP
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:-----:|:----:|:---------------:|
54
+ | 1.8852 | 0.08 | 50 | 1.7072 |
55
+ | 1.6036 | 0.16 | 100 | 1.4856 |
56
+ | 1.5186 | 0.24 | 150 | 1.4662 |
57
+ | 1.4822 | 0.32 | 200 | 1.4603 |
58
+ | 1.5035 | 0.4 | 250 | 1.4578 |
59
+ | 1.4813 | 0.48 | 300 | 1.4558 |
60
+ | 1.4878 | 0.56 | 350 | 1.4534 |
61
+ | 1.4765 | 0.64 | 400 | 1.4523 |
62
+ | 1.4803 | 0.72 | 450 | 1.4485 |
63
+ | 1.4925 | 0.8 | 500 | 1.4478 |
64
+ | 1.49 | 0.88 | 550 | 1.4467 |
65
+ | 1.4888 | 0.96 | 600 | 1.4461 |
66
+ | 1.4732 | 1.04 | 650 | 1.4470 |
67
+ | 1.4677 | 1.12 | 700 | 1.4476 |
68
+ | 1.4402 | 1.2 | 750 | 1.4475 |
69
 
70
 
71
  ### Framework versions