vxbrandon commited on
Commit
a4e4990
·
verified ·
1 Parent(s): 31deee2

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 2.2401
19
 
20
  ## Model description
21
 
@@ -43,7 +43,7 @@ The following hyperparameters were used during training:
43
  - total_train_batch_size: 8
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
- - training_steps: 175
47
 
48
  ### Training results
49
 
@@ -56,6 +56,10 @@ The following hyperparameters were used during training:
56
  | 2.2562 | 0.01 | 125 | 2.2787 |
57
  | 2.4057 | 0.01 | 150 | 2.2709 |
58
  | 2.3147 | 0.01 | 175 | 2.2635 |
 
 
 
 
59
 
60
 
61
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 2.2258
19
 
20
  ## Model description
21
 
 
43
  - total_train_batch_size: 8
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
+ - training_steps: 275
47
 
48
  ### Training results
49
 
 
56
  | 2.2562 | 0.01 | 125 | 2.2787 |
57
  | 2.4057 | 0.01 | 150 | 2.2709 |
58
  | 2.3147 | 0.01 | 175 | 2.2635 |
59
+ | 2.2796 | 0.02 | 200 | 2.2600 |
60
+ | 2.2157 | 0.02 | 225 | 2.2557 |
61
+ | 2.303 | 0.02 | 250 | 2.2542 |
62
+ | 2.0701 | 0.02 | 275 | 2.2511 |
63
 
64
 
65
  ### Framework versions
model-00001-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1994e4a7394df70c7cde65ffa4783ad0a77d5fa04c3fe731bb1a0869c9aeb4ef
3
  size 4943162336
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b5793021cfe4b316ec904751e3a53e74be76e747642784c02894f655122c74c0
3
  size 4943162336
model-00002-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a9f1c0cf3a18188774c1677854f5dd9c63708b97c0806af4b4afa374e06dd799
3
  size 4999819336
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2159f4aae48e2fedc2f651548f0418158bf448130fe38a2b9c2c9551d55b170a
3
  size 4999819336
model-00003-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:dc74101a374325023f08ef07364e92051197a9238ab0da5f8c4911b1220f8bef
3
  size 4540516344
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1dc9d4f37654a4f1af8e1933f34363e11bae3b99234907c2e0c431858e471d39
3
  size 4540516344
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2112ab2def61572271b158640dd6fe00fef638a9586e61f80d38ebce709e6d07
3
  size 6456
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ff3ad4b083256c5e6d6bad4c9227c91ce5c0eb52a6aea8b22be84174529ef2a0
3
  size 6456