pere's picture
Saving weights and logs of step 9000 - epoch 6
bf9d74d
|
raw
history blame
2.06 kB
metadata
language:
  - 'no'
license: apache-2.0
tags:
  - audio
  - asr
  - automatic-speech-recognition
  - hf-asr-leaderboard
model-index:
  - name: scream_tertius_dropout_replicate_test7a
    results: []

scream_tertius_dropout_replicate_test7a

This model is a fine-tuned version of openai/whisper-small on the NbAiLab/NCC_speech_all_v5 dataset.

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • lr_scheduler_type: linear
  • per_device_train_batch_size: 32
  • total_train_batch_size_per_node: 128
  • total_train_batch_size: 1024
  • total_optimization_steps: 20,000
  • starting_optimization_step: None
  • finishing_optimization_step: 20,000
  • num_train_dataset_workers: 32
  • num_hosts: 8
  • total_num_training_examples: 20,480,000
  • steps_per_epoch: 1314
  • num_beams: 5
  • dropout: True
  • dropout_probability: 0.1

Training results

step eval_loss train_loss eval_wer eval_cer
0 1.3582 7.9231 169.1230 127.5435
1000 0.9203 0.9748 24.0256 9.2618
2000 0.9951 0.6747 18.7576 7.4326
3000 1.1073 0.5495 16.7479 7.1000
4000 1.1093 0.4612 14.4336 6.4147
5000 1.1719 0.4326 14.1900 6.2837
6000 1.2627 0.3998 12.8197 5.9814
7000 1.2785 0.3765 12.7893 6.1476
8000 1.1395 0.3869 12.5152 6.0519
9000 1.2327 0.3616 12.7893 6.1829

Framework versions

  • Transformers 4.29.0.dev0
  • Datasets 2.12.0
  • Tokenizers 0.13.3