Edit model card

Youtube3kTTSModel

This model is a fine-tuned version of microsoft/speecht5_tts on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 0.4839

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.0001
  • train_batch_size: 4
  • eval_batch_size: 2
  • seed: 42
  • gradient_accumulation_steps: 8
  • total_train_batch_size: 32
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_steps: 100
  • training_steps: 5000
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss
0.6301 0.2222 100 0.5639
0.6013 0.4444 200 0.5471
0.5666 0.6667 300 0.5315
0.5632 0.8889 400 0.5254
0.5547 1.1111 500 0.5184
0.5583 1.3333 600 0.5211
0.5527 1.5556 700 0.5150
0.5508 1.7778 800 0.5123
0.5432 2.0 900 0.5135
0.5478 2.2222 1000 0.5077
0.5419 2.4444 1100 0.5073
0.5439 2.6667 1200 0.5083
0.5381 2.8889 1300 0.5108
0.5355 3.1111 1400 0.5075
0.5317 3.3333 1500 0.5053
0.5345 3.5556 1600 0.5022
0.5329 3.7778 1700 0.5006
0.53 4.0 1800 0.4965
0.5261 4.2222 1900 0.4971
0.5272 4.4444 2000 0.4976
0.5272 4.6667 2100 0.4943
0.5282 4.8889 2200 0.4938
0.5188 5.1111 2300 0.4980
0.523 5.3333 2400 0.4894
0.5225 5.5556 2500 0.4915
0.5178 5.7778 2600 0.4960
0.5165 6.0 2700 0.4893
0.5098 6.2222 2800 0.4892
0.512 6.4444 2900 0.4868
0.5177 6.6667 3000 0.4868
0.5128 6.8889 3100 0.4883
0.5062 7.1111 3200 0.4852
0.5104 7.3333 3300 0.4898
0.5126 7.5556 3400 0.4887
0.5093 7.7778 3500 0.4908
0.5075 8.0 3600 0.4828
0.5029 8.2222 3700 0.4842
0.5079 8.4444 3800 0.4850
0.5049 8.6667 3900 0.4853
0.5034 8.8889 4000 0.4849
0.4984 9.1111 4100 0.4833
0.5079 9.3333 4200 0.4863
0.5023 9.5556 4300 0.4830
0.5023 9.7778 4400 0.4833
0.5037 10.0 4500 0.4825
0.5035 10.2222 4600 0.4822
0.5011 10.4444 4700 0.4826
0.4969 10.6667 4800 0.4815
0.4958 10.8889 4900 0.4839
0.4972 11.1111 5000 0.4839

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.1+cu121
  • Datasets 3.0.0
  • Tokenizers 0.19.1
Downloads last month
55
Safetensors
Model size
144M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for Mohsen21/Youtube3kTTSModel

Finetuned
(615)
this model