metadata

language:
  - 'no'
license: apache-2.0
tags:
  - audio
  - asr
  - automatic-speech-recognition
  - hf-asr-leaderboard
model-index:
  - name: scream_tertius_dropout_replicate_test7a
    results: []

scream_tertius_dropout_replicate_test7a

This model is a fine-tuned version of openai/whisper-small on the NbAiLab/NCC_speech_all_v5 dataset.

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 2e-05
lr_scheduler_type: linear
per_device_train_batch_size: 32
total_train_batch_size_per_node: 128
total_train_batch_size: 1024
total_optimization_steps: 20,000
starting_optimization_step: None
finishing_optimization_step: 20,000
num_train_dataset_workers: 32
num_hosts: 8
total_num_training_examples: 20,480,000
steps_per_epoch: 1314
num_beams: 5
dropout: True
dropout_probability: 0.1

Training results

step	eval_loss	train_loss	eval_wer	eval_cer
0	1.3582	7.9231	169.1230	127.5435
1000	0.9203	0.9748	24.0256	9.2618
2000	0.9951	0.6747	18.7576	7.4326
3000	1.1073	0.5495	16.7479	7.1000
4000	1.1093	0.4612	14.4336	6.4147
5000	1.1719	0.4326	14.1900	6.2837
6000	1.2627	0.3998	12.8197	5.9814
7000	1.2785	0.3765	12.7893	6.1476
8000	1.1395	0.3869	12.5152	6.0519
9000	1.2327	0.3616	12.7893	6.1829

Framework versions

Transformers 4.29.0.dev0
Datasets 2.12.0
Tokenizers 0.13.3