Padomin
/

t5-base-TEDxJP-10front-1body-10rear

@@ -16,16 +16,16 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [sonoisa/t5-base-japanese](https://huggingface.co/sonoisa/t5-base-japanese) on the te_dx_jp dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4365
-- Wer: 0.1692
-- Mer: 0.1634
-- Wil: 0.2497
-- Wip: 0.7503
-- Hits: 55920
-- Substitutions: 6354
-- Deletions: 2313
-- Insertions: 2258
-- Cer: 0.1345
 ## Model description
@@ -47,7 +47,7 @@ The following hyperparameters were used during training:
 - learning_rate: 0.0001
 - train_batch_size: 32
 - eval_batch_size: 32
-- seed: 20
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
@@ -57,16 +57,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss | Wer    | Mer    | Wil    | Wip    | Hits  | Substitutions | Deletions | Insertions | Cer    |
 |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:------:|:-----:|:-------------:|:---------:|:----------:|:------:|
-| 0.5883        | 1.0   | 1457  | 0.4647          | 0.2029 | 0.1923 | 0.2814 | 0.7186 | 55034 | 6672          | 2881      | 3553       | 0.1752 |
-| 0.5189        | 2.0   | 2914  | 0.4197          | 0.1740 | 0.1686 | 0.2551 | 0.7449 | 55401 | 6342          | 2844      | 2050       | 0.1352 |
-| 0.4904        | 3.0   | 4371  | 0.4104          | 0.1727 | 0.1668 | 0.2537 | 0.7463 | 55699 | 6398          | 2490      | 2265       | 0.1366 |
-| 0.4099        | 4.0   | 5828  | 0.4057          | 0.1696 | 0.1643 | 0.2506 | 0.7494 | 55704 | 6331          | 2552      | 2069       | 0.1321 |
-| 0.3865        | 5.0   | 7285  | 0.4108          | 0.1700 | 0.1644 | 0.2498 | 0.7502 | 55831 | 6272          | 2484      | 2227       | 0.1337 |
-| 0.335         | 6.0   | 8742  | 0.4177          | 0.1688 | 0.1631 | 0.2487 | 0.7513 | 55940 | 6292          | 2355      | 2253       | 0.1322 |
-| 0.2904        | 7.0   | 10199 | 0.4221          | 0.1687 | 0.1631 | 0.2487 | 0.7513 | 55902 | 6289          | 2396      | 2210       | 0.1334 |
-| 0.2684        | 8.0   | 11656 | 0.4262          | 0.1696 | 0.1639 | 0.2504 | 0.7496 | 55879 | 6373          | 2335      | 2243       | 0.1345 |
-| 0.2681        | 9.0   | 13113 | 0.4326          | 0.1696 | 0.1639 | 0.2505 | 0.7495 | 55897 | 6379          | 2311      | 2265       | 0.1340 |
-| 0.2342        | 10.0  | 14570 | 0.4365          | 0.1692 | 0.1634 | 0.2497 | 0.7503 | 55920 | 6354          | 2313      | 2258       | 0.1345 |
 ### Framework versions

 This model is a fine-tuned version of [sonoisa/t5-base-japanese](https://huggingface.co/sonoisa/t5-base-japanese) on the te_dx_jp dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4366
+- Wer: 0.1686
+- Mer: 0.1630
+- Wil: 0.2490
+- Wip: 0.7510
+- Hits: 55913
+- Substitutions: 6325
+- Deletions: 2349
+- Insertions: 2213
+- Cer: 0.1324
 ## Model description
 - learning_rate: 0.0001
 - train_batch_size: 32
 - eval_batch_size: 32
+- seed: 30
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
 | Training Loss | Epoch | Step  | Validation Loss | Wer    | Mer    | Wil    | Wip    | Hits  | Substitutions | Deletions | Insertions | Cer    |
 |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:------:|:-----:|:-------------:|:---------:|:----------:|:------:|
+| 0.5904        | 1.0   | 1457  | 0.4553          | 0.2049 | 0.1941 | 0.2824 | 0.7176 | 54935 | 6595          | 3057      | 3580       | 0.1816 |
+| 0.5001        | 2.0   | 2914  | 0.4201          | 0.1858 | 0.1776 | 0.2657 | 0.7343 | 55561 | 6554          | 2472      | 2973       | 0.1501 |
+| 0.4615        | 3.0   | 4371  | 0.4099          | 0.1748 | 0.1685 | 0.2544 | 0.7456 | 55706 | 6326          | 2555      | 2410       | 0.1414 |
+| 0.3988        | 4.0   | 5828  | 0.4040          | 0.1710 | 0.1654 | 0.2514 | 0.7486 | 55734 | 6319          | 2534      | 2189       | 0.1346 |
+| 0.3859        | 5.0   | 7285  | 0.4131          | 0.1689 | 0.1635 | 0.2487 | 0.7513 | 55808 | 6245          | 2534      | 2129       | 0.1327 |
+| 0.3259        | 6.0   | 8742  | 0.4138          | 0.1695 | 0.1639 | 0.2508 | 0.7492 | 55837 | 6400          | 2350      | 2198       | 0.1325 |
+| 0.2915        | 7.0   | 10199 | 0.4233          | 0.1696 | 0.1637 | 0.2499 | 0.7501 | 55932 | 6344          | 2311      | 2297       | 0.1329 |
+| 0.2638        | 8.0   | 11656 | 0.4298          | 0.1689 | 0.1633 | 0.2492 | 0.7508 | 55892 | 6319          | 2376      | 2213       | 0.1325 |
+| 0.2888        | 9.0   | 13113 | 0.4321          | 0.1686 | 0.1630 | 0.2492 | 0.7508 | 55909 | 6343          | 2335      | 2210       | 0.1319 |
+| 0.2614        | 10.0  | 14570 | 0.4366          | 0.1686 | 0.1630 | 0.2490 | 0.7510 | 55913 | 6325          | 2349      | 2213       | 0.1324 |
 ### Framework versions