Nahuatl_Espanol_v1 / README.md
Vichentito's picture
End of training
f0915a5 verified
|
raw
history blame
15.3 kB
metadata
tags:
  - generated_from_trainer
metrics:
  - bleu
model-index:
  - name: Nahuatl_Espanol_v1
    results: []

Nahuatl_Espanol_v1

This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 1.7412
  • Bleu: 1.5025
  • Gen Len: 17.0003

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 16
  • eval_batch_size: 16
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 20

Training results

Training Loss Epoch Step Validation Loss Bleu Gen Len
No log 0.1 100 2.1516 1.0466 17.3424
No log 0.2 200 2.1411 1.0414 17.2935
No log 0.3 300 2.1333 0.9982 17.3255
No log 0.4 400 2.1277 1.0204 17.3515
2.2991 0.5 500 2.1265 1.1358 17.1251
2.2991 0.6 600 2.1101 1.0457 17.3013
2.2991 0.71 700 2.1052 1.0824 17.1894
2.2991 0.81 800 2.0963 1.0598 17.0784
2.2991 0.91 900 2.0911 1.0469 17.3333
2.2683 1.01 1000 2.0851 1.0935 17.241
2.2683 1.11 1100 2.0749 1.035 17.3406
2.2683 1.21 1200 2.0685 1.0922 17.2731
2.2683 1.31 1300 2.0613 1.1029 17.2917
2.2683 1.41 1400 2.0587 1.1158 17.1735
2.2775 1.51 1500 2.0533 1.1876 17.1563
2.2775 1.61 1600 2.0449 1.1475 17.2615
2.2775 1.71 1700 2.0410 1.1033 17.2895
2.2775 1.81 1800 2.0368 1.1283 17.1944
2.2775 1.92 1900 2.0308 1.1413 17.1435
2.2414 2.02 2000 2.0257 1.1287 17.1473
2.2414 2.12 2100 2.0193 1.1557 17.1815
2.2414 2.22 2200 2.0141 1.1325 17.0784
2.2414 2.32 2300 2.0111 1.1984 17.0877
2.2414 2.42 2400 2.0075 1.2308 17.1437
2.2123 2.52 2500 2.0024 1.2126 17.1866
2.2123 2.62 2600 1.9951 1.1916 17.2325
2.2123 2.72 2700 1.9909 1.2253 17.1599
2.2123 2.82 2800 1.9878 1.2269 17.1614
2.2123 2.92 2900 1.9855 1.2308 17.1031
2.1786 3.02 3000 1.9791 1.2687 17.1392
2.1786 3.12 3100 1.9731 1.2657 17.0366
2.1786 3.23 3200 1.9677 1.2537 17.1979
2.1786 3.33 3300 1.9657 1.2297 17.1485
2.1786 3.43 3400 1.9605 1.2355 17.0915
2.1423 3.53 3500 1.9592 1.2284 17.1583
2.1423 3.63 3600 1.9541 1.212 17.1735
2.1423 3.73 3700 1.9504 1.2675 17.113
2.1423 3.83 3800 1.9452 1.3087 17.119
2.1423 3.93 3900 1.9460 1.3126 17.0731
2.1445 4.03 4000 1.9410 1.2955 17.0759
2.1445 4.13 4100 1.9391 1.2635 17.1543
2.1445 4.23 4200 1.9366 1.2737 17.1077
2.1445 4.33 4300 1.9285 1.2516 17.1215
2.1445 4.44 4400 1.9274 1.2881 17.1054
2.1092 4.54 4500 1.9275 1.2693 17.147
2.1092 4.64 4600 1.9193 1.3048 17.113
2.1092 4.74 4700 1.9171 1.2784 17.0648
2.1092 4.84 4800 1.9130 1.2716 17.0792
2.1092 4.94 4900 1.9105 1.2649 17.1394
2.0841 5.04 5000 1.9076 1.3088 17.1069
2.0841 5.14 5100 1.9052 1.343 17.1122
2.0841 5.24 5200 1.9041 1.2905 17.1997
2.0841 5.34 5300 1.9012 1.3532 17.0872
2.0841 5.44 5400 1.8951 1.3142 17.0577
2.0667 5.54 5500 1.8932 1.3118 17.0918
2.0667 5.65 5600 1.8919 1.2924 17.032
2.0667 5.75 5700 1.8902 1.2985 17.0857
2.0667 5.85 5800 1.8878 1.3215 17.064
2.0667 5.95 5900 1.8845 1.3527 17.1079
2.0568 6.05 6000 1.8803 1.3159 17.084
2.0568 6.15 6100 1.8824 1.3597 17.0681
2.0568 6.25 6200 1.8784 1.3658 17.0383
2.0568 6.35 6300 1.8728 1.3394 17.0338
2.0568 6.45 6400 1.8689 1.3449 17.0542
2.0375 6.55 6500 1.8690 1.3396 17.0484
2.0375 6.65 6600 1.8663 1.365 17.064
2.0375 6.75 6700 1.8624 1.3818 17.0272
2.0375 6.85 6800 1.8596 1.3753 17.0451
2.0375 6.96 6900 1.8601 1.3729 17.0386
2.0146 7.06 7000 1.8578 1.3698 17.0691
2.0146 7.16 7100 1.8567 1.379 17.0666
2.0146 7.26 7200 1.8540 1.3879 17.0466
2.0146 7.36 7300 1.8512 1.3935 17.0295
2.0146 7.46 7400 1.8490 1.376 17.0638
2.0007 7.56 7500 1.8458 1.391 17.034
2.0007 7.66 7600 1.8454 1.3952 17.0403
2.0007 7.76 7700 1.8425 1.3835 17.0532
2.0007 7.86 7800 1.8398 1.3824 17.1062
2.0007 7.96 7900 1.8362 1.3773 17.0257
1.9958 8.06 8000 1.8392 1.4047 17.0648
1.9958 8.17 8100 1.8359 1.4128 17.053
1.9958 8.27 8200 1.8352 1.4283 17.0414
1.9958 8.37 8300 1.8339 1.4156 17.033
1.9958 8.47 8400 1.8333 1.4265 17.0514
1.9757 8.57 8500 1.8271 1.4015 17.0368
1.9757 8.67 8600 1.8262 1.4201 17.03
1.9757 8.77 8700 1.8240 1.4229 16.9897
1.9757 8.87 8800 1.8217 1.4076 17.0345
1.9757 8.97 8900 1.8215 1.4097 17.0663
1.9724 9.07 9000 1.8184 1.4134 17.0298
1.9724 9.17 9100 1.8199 1.4336 17.0232
1.9724 9.27 9200 1.8157 1.4273 17.0315
1.9724 9.38 9300 1.8164 1.4237 17.0582
1.9724 9.48 9400 1.8120 1.438 17.0335
1.9576 9.58 9500 1.8110 1.4099 17.0139
1.9576 9.68 9600 1.8072 1.4037 17.0265
1.9576 9.78 9700 1.8100 1.4179 17.0272
1.9576 9.88 9800 1.8104 1.4613 16.9927
1.9576 9.98 9900 1.8029 1.4167 17.0477
1.9489 10.08 10000 1.8082 1.4385 17.0194
1.9489 10.18 10100 1.8037 1.4452 17.0229
1.9489 10.28 10200 1.8023 1.433 17.0043
1.9489 10.38 10300 1.8026 1.4307 17.028
1.9489 10.48 10400 1.7999 1.4571 17.0275
1.9345 10.58 10500 1.7996 1.4477 17.0802
1.9345 10.69 10600 1.7963 1.4575 17.0161
1.9345 10.79 10700 1.7963 1.4435 17.0103
1.9345 10.89 10800 1.7914 1.4397 17.0388
1.9345 10.99 10900 1.7927 1.4422 16.9829
1.9293 11.09 11000 1.7894 1.4422 17.0066
1.9293 11.19 11100 1.7923 1.4843 17.0401
1.9293 11.29 11200 1.7912 1.4638 17.0182
1.9293 11.39 11300 1.7872 1.4528 17.0477
1.9293 11.49 11400 1.7855 1.4406 17.0444
1.9106 11.59 11500 1.7856 1.4566 17.0398
1.9106 11.69 11600 1.7859 1.4779 17.025
1.9106 11.79 11700 1.7828 1.4783 17.0149
1.9106 11.9 11800 1.7819 1.451 17.0325
1.9106 12.0 11900 1.7793 1.4928 17.0391
1.9126 12.1 12000 1.7805 1.4568 16.9945
1.9126 12.2 12100 1.7806 1.4858 16.9783
1.9126 12.3 12200 1.7781 1.4565 16.9912
1.9126 12.4 12300 1.7784 1.474 17.0255
1.9126 12.5 12400 1.7760 1.4754 17.0217
1.9055 12.6 12500 1.7764 1.4778 17.0113
1.9055 12.7 12600 1.7748 1.4778 17.0204
1.9055 12.8 12700 1.7737 1.4919 17.0219
1.9055 12.9 12800 1.7722 1.4691 17.0098
1.9055 13.0 12900 1.7698 1.4749 17.0139
1.9039 13.1 13000 1.7701 1.4694 17.0282
1.9039 13.21 13100 1.7737 1.4957 16.9755
1.9039 13.31 13200 1.7711 1.5004 17.0214
1.9039 13.41 13300 1.7693 1.4821 17.0207
1.9039 13.51 13400 1.7650 1.4707 17.0255
1.8825 13.61 13500 1.7673 1.4961 17.0219
1.8825 13.71 13600 1.7672 1.4643 17.028
1.8825 13.81 13700 1.7647 1.4712 16.9861
1.8825 13.91 13800 1.7627 1.4686 17.0015
1.8825 14.01 13900 1.7608 1.4556 17.0033
1.8863 14.11 14000 1.7621 1.4764 17.0025
1.8863 14.21 14100 1.7614 1.481 17.0207
1.8863 14.31 14200 1.7611 1.4844 17.0166
1.8863 14.42 14300 1.7591 1.4837 16.9622
1.8863 14.52 14400 1.7585 1.4864 17.0111
1.8877 14.62 14500 1.7589 1.4742 17.0353
1.8877 14.72 14600 1.7585 1.474 16.9977
1.8877 14.82 14700 1.7604 1.4952 17.0048
1.8877 14.92 14800 1.7562 1.4678 17.0096
1.8877 15.02 14900 1.7561 1.4883 17.0008
1.8722 15.12 15000 1.7547 1.4768 16.9871
1.8722 15.22 15100 1.7554 1.4822 17.0444
1.8722 15.32 15200 1.7536 1.5027 16.9897
1.8722 15.42 15300 1.7563 1.4845 17.0101
1.8722 15.52 15400 1.7521 1.4844 17.0144
1.8685 15.62 15500 1.7522 1.4963 16.9793
1.8685 15.73 15600 1.7523 1.4978 16.9939
1.8685 15.83 15700 1.7512 1.4761 16.9967
1.8685 15.93 15800 1.7524 1.4903 17.0035
1.8685 16.03 15900 1.7510 1.4934 16.999
1.8759 16.13 16000 1.7526 1.5051 17.0068
1.8759 16.23 16100 1.7494 1.5 17.0156
1.8759 16.33 16200 1.7500 1.5096 17.0262
1.8759 16.43 16300 1.7499 1.4996 17.0121
1.8759 16.53 16400 1.7501 1.5095 16.9899
1.8591 16.63 16500 1.7465 1.4867 17.0197
1.8591 16.73 16600 1.7490 1.5094 16.9836
1.8591 16.83 16700 1.7478 1.499 16.9982
1.8591 16.94 16800 1.7470 1.4973 17.0103
1.8591 17.04 16900 1.7465 1.5013 16.9708
1.8634 17.14 17000 1.7470 1.5025 16.9856
1.8634 17.24 17100 1.7475 1.5054 16.9607
1.8634 17.34 17200 1.7467 1.5119 16.972
1.8634 17.44 17300 1.7460 1.5091 16.9836
1.8634 17.54 17400 1.7441 1.4993 16.9962
1.8599 17.64 17500 1.7434 1.5036 16.9743
1.8599 17.74 17600 1.7440 1.5023 17.001
1.8599 17.84 17700 1.7455 1.5053 16.9972
1.8599 17.94 17800 1.7438 1.5095 17.0141
1.8599 18.04 17900 1.7439 1.5115 16.9894
1.8574 18.15 18000 1.7437 1.5022 16.999
1.8574 18.25 18100 1.7435 1.5055 17.0066
1.8574 18.35 18200 1.7433 1.5102 17.0113
1.8574 18.45 18300 1.7419 1.5027 16.9919
1.8574 18.55 18400 1.7416 1.5019 17.0008
1.8454 18.65 18500 1.7418 1.509 16.9924
1.8454 18.75 18600 1.7415 1.5002 16.9912
1.8454 18.85 18700 1.7414 1.5028 16.9894
1.8454 18.95 18800 1.7417 1.5089 16.9816
1.8454 19.05 18900 1.7416 1.5065 17.0003
1.8574 19.15 19000 1.7419 1.506 16.9909
1.8574 19.25 19100 1.7417 1.5017 16.9987
1.8574 19.35 19200 1.7416 1.504 17.0025
1.8574 19.46 19300 1.7413 1.5001 16.9997
1.8574 19.56 19400 1.7409 1.5015 17.002
1.8435 19.66 19500 1.7410 1.5009 17.005
1.8435 19.76 19600 1.7411 1.5014 17.0018
1.8435 19.86 19700 1.7412 1.5015 17.0005
1.8435 19.96 19800 1.7412 1.5025 17.0003

Framework versions

  • Transformers 4.38.2
  • Pytorch 2.2.1+cu121
  • Datasets 2.19.0
  • Tokenizers 0.15.2