metadata

language:
  - fy
base_model: distil-small.en
tags:
  - generated_from_trainer
datasets:
  - mozilla-foundation/common_voice_6_1
metrics:
  - wer
model-index:
  - name: DistilFT-Frisian-1h
    results:
      - task:
          name: Automatic Speech Recognition
          type: automatic-speech-recognition
        dataset:
          name: mozilla-foundation/common_voice_6_fy_NL
          type: mozilla-foundation/common_voice_6_1
          args: 'config: fy-NL, split: train-1h'
        metrics:
          - name: Wer
            type: wer
            value: 54.30048119764748

DistilFT-Frisian-1h

This model is a fine-tuned version of distil-small.en on the mozilla-foundation/common_voice_6_fy_NL dataset. It achieves the following results on the evaluation set:

Loss: 1.9212
Wer: 54.3005

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 0.0001
train_batch_size: 8
eval_batch_size: 8
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 200
training_steps: 3000
mixed_precision_training: Native AMP

Training results

Training Loss	Epoch	Step	Validation Loss	Wer
0.2482	5.6180	500	1.8089	66.9720
0.1076	11.2360	1000	1.8466	62.2349
0.0448	16.8539	1500	1.9436	59.3548
0.0062	22.4719	2000	1.8986	56.5960
0.0016	28.0899	2500	1.9025	54.4324
0.0001	33.7079	3000	1.9212	54.3005

Framework versions

Transformers 4.41.0.dev0
Pytorch 2.3.0+cu121
Datasets 2.19.1
Tokenizers 0.19.1