File size: 8,604 Bytes
0710ed6 2df7f66 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 |
---
tags:
- generated_from_trainer
datasets:
- common_voice_13_0
metrics:
- wer
model-index:
- name: DeCRED_small_cv_2
results: []
---
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->
# DeCRED_small_cv_2
This model is a fine-tuned version of [](https://huggingface.co/) on the common_voice_13_0 dataset.
It achieves the following results on the evaluation set:
- Loss: 1.0669
- Cer: 0.0663
- Wer: 0.1563
- Mer: 0.1532
- Wil: 0.2546
- Wip: 0.7454
- Hits: 128002
- Substitutions: 17367
- Deletions: 2812
- Insertions: 2975
## Model description
More information needed
## Intended uses & limitations
More information needed
## Training and evaluation data
More information needed
## Training procedure
### Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 0.002
- train_batch_size: 128
- eval_batch_size: 64
- seed: 42
- distributed_type: multi-GPU
- num_devices: 2
- total_train_batch_size: 256
- total_eval_batch_size: 128
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- lr_scheduler_warmup_steps: 10000
- num_epochs: 50.0
### Training results
| Training Loss | Epoch | Step | Cer | Deletions | Hits | Insertions | Validation Loss | Mer | Substitutions | Wer | Wil | Wip |
|:-------------:|:-----:|:------:|:------:|:---------:|:------:|:----------:|:---------------:|:------:|:-------------:|:------:|:------:|:------:|
| 1.6144 | 5.0 | 14885 | 0.1526 | 5701 | 107636 | 6856 | 1.4427 | 0.3057 | 34844 | 0.3199 | 0.4765 | 0.5235 |
| 1.5552 | 6.0 | 17862 | 0.1438 | 5401 | 109845 | 6588 | 1.3968 | 0.2903 | 32935 | 0.3032 | 0.4549 | 0.5451 |
| 1.5102 | 7.0 | 20839 | 0.1327 | 5227 | 112073 | 5890 | 1.3578 | 0.2726 | 30881 | 0.2834 | 0.4305 | 0.5695 |
| 1.4504 | 8.0 | 23816 | 0.1256 | 4512 | 113841 | 5758 | 1.3262 | 0.2605 | 29828 | 0.2706 | 0.4147 | 0.5853 |
| 1.4098 | 9.0 | 26793 | 0.1195 | 4580 | 115426 | 5441 | 1.2918 | 0.2486 | 28175 | 0.2578 | 0.3967 | 0.6033 |
| 1.3717 | 10.0 | 29770 | 0.1146 | 4666 | 116052 | 4869 | 1.2777 | 0.2417 | 27463 | 0.2497 | 0.3875 | 0.6125 |
| 1.3573 | 11.0 | 32747 | 0.1123 | 4193 | 117058 | 5268 | 1.2628 | 0.2372 | 26930 | 0.2456 | 0.3804 | 0.6196 |
| 1.3433 | 12.0 | 35724 | 0.1085 | 4814 | 117567 | 4293 | 1.2455 | 0.2289 | 25800 | 0.2356 | 0.3683 | 0.6317 |
| 1.3281 | 13.0 | 38701 | 0.1068 | 3789 | 118972 | 5402 | 1.2333 | 0.2254 | 25420 | 0.2336 | 0.3623 | 0.6377 |
| 1.3068 | 14.0 | 41678 | 0.1019 | 4076 | 119434 | 4622 | 1.2159 | 0.2184 | 24671 | 0.2252 | 0.3527 | 0.6473 |
| 1.2847 | 15.0 | 44655 | 0.1017 | 4042 | 119608 | 4683 | 1.2081 | 0.2176 | 24531 | 0.2244 | 0.3513 | 0.6487 |
| 1.2753 | 16.0 | 47632 | 0.1007 | 4211 | 119928 | 4304 | 1.2023 | 0.2135 | 24042 | 0.2197 | 0.3454 | 0.6546 |
| 1.2793 | 17.0 | 50609 | 0.0950 | 3660 | 121093 | 4365 | 1.1862 | 0.2062 | 23428 | 0.2123 | 0.3354 | 0.6646 |
| 1.2676 | 18.0 | 53586 | 0.0927 | 3813 | 121198 | 4207 | 1.1843 | 0.2047 | 23170 | 0.2105 | 0.3328 | 0.6672 |
| 1.2256 | 19.0 | 56563 | 0.0936 | 3948 | 121257 | 4033 | 1.1795 | 0.2034 | 22976 | 0.2089 | 0.3308 | 0.6692 |
| 1.2238 | 20.0 | 59540 | 0.0932 | 3634 | 121864 | 4372 | 1.1736 | 0.2012 | 22683 | 0.2071 | 0.3270 | 0.6730 |
| 1.2206 | 21.0 | 62517 | 0.0892 | 3862 | 122333 | 3732 | 1.1623 | 0.1947 | 21986 | 0.1996 | 0.3178 | 0.6822 |
| 1.2018 | 22.0 | 65494 | 0.0893 | 4037 | 122051 | 3703 | 1.1614 | 0.1964 | 22093 | 0.2013 | 0.3200 | 0.6800 |
| 1.1791 | 23.0 | 68471 | 1.1510 | 0.0868 | 0.1953 | 0.1906 | 0.3114 | 0.6886 | 122943 | 21479 | 3759 | 3708 |
| 1.1958 | 24.0 | 71448 | 1.1438 | 0.0855 | 0.1931 | 0.1883 | 0.3078 | 0.6922 | 123356 | 21215 | 3610 | 3784 |
| 1.1672 | 25.0 | 74425 | 1.1420 | 0.0863 | 0.1940 | 0.1891 | 0.3088 | 0.6912 | 123289 | 21266 | 3626 | 3861 |
| 1.1595 | 26.0 | 77402 | 1.1358 | 0.0843 | 0.1898 | 0.1852 | 0.3026 | 0.6974 | 123784 | 20743 | 3654 | 3735 |
| 1.1803 | 27.0 | 80379 | 1.1343 | 0.0838 | 0.1901 | 0.1856 | 0.3041 | 0.6959 | 123595 | 20966 | 3620 | 3580 |
| 1.1488 | 28.0 | 83356 | 1.1262 | 0.0810 | 0.1855 | 0.1809 | 0.2972 | 0.7028 | 124441 | 20511 | 3229 | 3746 |
| 1.1303 | 29.0 | 86333 | 1.1233 | 0.0801 | 0.1837 | 0.1793 | 0.2946 | 0.7054 | 124600 | 20302 | 3279 | 3636 |
| 1.1266 | 30.0 | 89310 | 1.1203 | 0.0791 | 0.1818 | 0.1777 | 0.2918 | 0.7082 | 124687 | 20007 | 3487 | 3447 |
| 1.14 | 31.0 | 92287 | 1.1179 | 0.0790 | 0.1813 | 0.1769 | 0.2905 | 0.7095 | 124938 | 19925 | 3318 | 3616 |
| 1.1151 | 32.0 | 95264 | 1.1115 | 0.0776 | 0.1794 | 0.1752 | 0.2885 | 0.7115 | 125137 | 19847 | 3197 | 3534 |
| 1.1043 | 33.0 | 98241 | 1.1080 | 0.0773 | 0.1785 | 0.1744 | 0.2866 | 0.7134 | 125253 | 19624 | 3304 | 3522 |
| 1.1157 | 34.0 | 101218 | 1.1039 | 0.0762 | 0.1750 | 0.1710 | 0.2817 | 0.7183 | 125705 | 19302 | 3174 | 3458 |
| 1.0911 | 35.0 | 104195 | 1.1004 | 0.0747 | 0.1740 | 0.1700 | 0.2800 | 0.7200 | 125869 | 19160 | 3152 | 3466 |
| 1.0722 | 36.0 | 107172 | 1.0978 | 0.0743 | 0.1719 | 0.1684 | 0.2776 | 0.7224 | 125819 | 18952 | 3410 | 3111 |
| 1.092 | 37.0 | 110149 | 1.0953 | 0.0742 | 0.1714 | 0.1676 | 0.2763 | 0.7237 | 126142 | 18878 | 3161 | 3362 |
| 1.0763 | 38.0 | 113126 | 1.0914 | 0.0722 | 0.1686 | 0.1651 | 0.2726 | 0.7274 | 126377 | 18617 | 3187 | 3181 |
| 1.0667 | 39.0 | 116103 | 1.0918 | 0.0729 | 0.1690 | 0.1654 | 0.2728 | 0.7272 | 126366 | 18602 | 3213 | 3224 |
| 1.0651 | 40.0 | 119080 | 1.0845 | 0.0718 | 0.1662 | 0.1627 | 0.2690 | 0.7310 | 126749 | 18373 | 3059 | 3191 |
| 1.0761 | 41.0 | 122057 | 1.0836 | 0.0703 | 0.1648 | 0.1614 | 0.2673 | 0.7327 | 126911 | 18271 | 2999 | 3156 |
| 1.0509 | 42.0 | 125034 | 1.0828 | 0.0709 | 0.1647 | 0.1615 | 0.2670 | 0.7330 | 126714 | 18177 | 3290 | 2942 |
| 1.0409 | 43.0 | 128011 | 1.0798 | 0.0707 | 0.1640 | 0.1607 | 0.2658 | 0.7342 | 126946 | 18103 | 3132 | 3070 |
| 1.0525 | 44.0 | 130988 | 1.0760 | 0.0688 | 0.1608 | 0.1575 | 0.2614 | 0.7386 | 127451 | 17870 | 2860 | 3100 |
| 1.0359 | 45.0 | 133965 | 1.0745 | 0.0680 | 0.1601 | 0.1568 | 0.2602 | 0.7398 | 127571 | 17771 | 2839 | 3115 |
| 1.0144 | 46.0 | 136942 | 1.0738 | 0.0681 | 0.1607 | 0.1574 | 0.2614 | 0.7386 | 127503 | 17888 | 2790 | 3139 |
| 1.054 | 47.0 | 139919 | 1.0691 | 0.0672 | 0.1586 | 0.1554 | 0.2578 | 0.7422 | 127745 | 17575 | 2861 | 3062 |
| 1.0427 | 48.0 | 142896 | 1.0681 | 0.0667 | 0.1573 | 0.1542 | 0.2562 | 0.7438 | 127851 | 17473 | 2857 | 2981 |
| 1.0067 | 49.0 | 145873 | 1.0682 | 0.0668 | 0.1568 | 0.1537 | 0.2553 | 0.7447 | 127906 | 17401 | 2874 | 2957 |
| 1.0054 | 50.0 | 148850 | 1.0669 | 0.0663 | 0.1563 | 0.1532 | 0.2546 | 0.7454 | 128002 | 17367 | 2812 | 2975 |
### Framework versions
- Transformers 4.40.0.dev0
- Pytorch 2.2.0+rocm5.6
- Datasets 2.18.0
- Tokenizers 0.15.2
### Wandb run
https://wandb.ai/butspeechfit/decred_commonvoice_en/runs/DeCRED_small_cv_2_continue |