Shakhovak
/

flan-t5-large-absa-multitask-laptops

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-large](https://huggingface.co/google/flan-t5-large) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0761
 ## Model description
@@ -41,24 +41,30 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
-- num_epochs: 4
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 1.4592        | 0.32  | 200  | 0.2723          |
-| 0.2422        | 0.63  | 400  | 0.1839          |
-| 0.1823        | 0.95  | 600  | 0.1460          |
-| 0.1348        | 1.26  | 800  | 0.1322          |
-| 0.1181        | 1.58  | 1000 | 0.1089          |
-| 0.1047        | 1.9   | 1200 | 0.1064          |
-| 0.0835        | 2.21  | 1400 | 0.0928          |
-| 0.0685        | 2.53  | 1600 | 0.0909          |
-| 0.0614        | 2.84  | 1800 | 0.0873          |
-| 0.0518        | 3.16  | 2000 | 0.0731          |
-| 0.0418        | 3.48  | 2200 | 0.0733          |
-| 0.0449        | 3.79  | 2400 | 0.0761          |
 ### Framework versions

 This model is a fine-tuned version of [google/flan-t5-large](https://huggingface.co/google/flan-t5-large) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0404
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
+- num_epochs: 6
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.6928        | 0.32  | 200  | 0.2892          |
+| 0.2913        | 0.63  | 400  | 0.1697          |
+| 0.2028        | 0.95  | 600  | 0.1296          |
+| 0.1448        | 1.26  | 800  | 0.1183          |
+| 0.1275        | 1.58  | 1000 | 0.1028          |
+| 0.11          | 1.9   | 1200 | 0.0827          |
+| 0.0821        | 2.21  | 1400 | 0.0754          |
+| 0.069         | 2.53  | 1600 | 0.0758          |
+| 0.0685        | 2.84  | 1800 | 0.0598          |
+| 0.0517        | 3.16  | 2000 | 0.0682          |
+| 0.0427        | 3.48  | 2200 | 0.0526          |
+| 0.0414        | 3.79  | 2400 | 0.0498          |
+| 0.0326        | 4.11  | 2600 | 0.0479          |
+| 0.0325        | 4.42  | 2800 | 0.0423          |
+| 0.0236        | 4.74  | 3000 | 0.0481          |
+| 0.0264        | 5.06  | 3200 | 0.0416          |
+| 0.024         | 5.37  | 3400 | 0.0401          |
+| 0.0164        | 5.69  | 3600 | 0.0404          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:102d147697d2a0395f329d05a26377bdf4adb2abddedb58ea038a8ae621ac95e
 size 3132668808

 version https://git-lfs.github.com/spec/v1
+oid sha256:a772cbb887ff9cd07bb3e00d126ecf4fde1f603666f28cbbf4ca62435474efb3
 size 3132668808

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:491f6bb47797b9eb30da8202170b1d862dcd8f264066c3488226a3dc16d08385
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:eac21efae427b9b946c8ddbb7c2b77d246a427662027c316674d223dbf294995
 size 5112