metadata

license: cc-by-nc-4.0
base_model: facebook/mms-tts-vie
tags:
  - generated_from_trainer
datasets:
  - audiofolder
model-index:
  - name: speecht5_finetuned_voxpopuli_nl
    results: []

speecht5_finetuned_voxpopuli_nl

This model is a fine-tuned version of facebook/mms-tts-vie on the audiofolder dataset. It achieves the following results on the evaluation set:

Loss: 4.6235

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 1e-05
train_batch_size: 4
eval_batch_size: 2
seed: 42
gradient_accumulation_steps: 8
total_train_batch_size: 32
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 5
training_steps: 100

Training results

Training Loss	Epoch	Step	Validation Loss
No log	10.0	10	5.4309
No log	20.0	20	5.1375
3.1586	30.0	30	4.9770
3.1586	40.0	40	4.8637
2.9299	50.0	50	4.7869
2.9299	60.0	60	4.7277
2.9299	70.0	70	4.6799
2.8457	80.0	80	4.6510
2.8457	90.0	90	4.6388
2.8227	100.0	100	4.6235

Framework versions

Transformers 4.35.2
Pytorch 2.1.0+cu121
Datasets 2.16.1
Tokenizers 0.15.0