versae's picture
Model exports
c4e12ae
|
raw
history blame
6.43 kB
metadata
language:
  - 'no'
license: apache-2.0
tags:
  - audio
  - asr
  - automatic-speech-recognition
  - hf-asr-leaderboard
model-index:
  - name: nb-whisper-small-publicbeta-25k
    results: []

nb-whisper-small-publicbeta-25k

This model is a fine-tuned version of openai/whisper-small on the NbAiLab/ncc_speech2 dataset.

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • lr_scheduler_type: linear
  • per_device_train_batch_size: 32
  • total_train_batch_size_per_node: 128
  • total_train_batch_size: 1024
  • total_optimization_steps: 25,000
  • starting_optimization_step: None
  • finishing_optimization_step: 25,000
  • num_train_dataset_workers: 32
  • num_hosts: 8
  • total_num_training_examples: 25,600,000
  • steps_per_epoch: 7313
  • num_beams: 5
  • weight_decay: 0.01
  • adam_beta1: 0.9
  • adam_beta2: 0.98
  • adam_epsilon: 1e-06
  • dropout: True
  • bpe_dropout_probability: 0.1
  • activation_dropout_probability: 0.1

Training results

step validation_fleurs_loss train_loss validation_fleurs_wer validation_fleurs_cer validation_fleurs_exact_wer validation_fleurs_exact_cer validation_stortinget_loss validation_stortinget_wer validation_stortinget_cer validation_stortinget_exact_wer validation_stortinget_exact_cer
0 1.2013 3.1115 218.8876 174.4279 388.7694 278.9901 1.4191 71.3727 46.4810 76.7531 49.0057
1000 0.5627 1.1938 16.3593 6.2586 20.0717 7.2820 0.4640 20.7725 11.8840 24.4401 12.5992
2000 0.3961 0.9944 11.7192 4.0146 15.4719 4.9384 0.3737 16.5674 10.1748 20.0976 10.8109
3000 0.3696 0.9185 10.8269 4.1576 14.7551 5.1220 0.3426 14.9167 9.5103 18.3471 10.1061
4000 0.3467 0.8298 9.7858 4.2513 13.6201 5.1558 0.3251 14.3438 9.2267 17.7666 9.8219
5000 0.3266 0.8400 10.0833 4.2711 13.8889 5.2138 0.3110 13.9022 9.1039 17.2299 9.6697
6000 0.3280 0.7875 8.7745 3.3636 12.6344 4.3295 0.3058 13.5598 8.8853 16.9561 9.4543
7000 0.3177 0.7937 8.5961 3.7581 12.7539 4.6775 0.2991 13.1425 8.6226 16.4905 9.1878
8000 0.3383 0.7872 8.8935 3.8666 12.9630 4.7934 0.2917 13.0831 8.6552 16.4486 9.2255
9000 0.3320 0.7526 9.1612 4.0738 13.0526 5.0495 0.2899 12.8380 8.4996 16.1350 9.0495
10000 0.3267 0.7547 9.5181 4.1280 13.3513 5.1462 0.2894 12.7106 8.4593 16.0502 9.0189
11000 0.3358 0.7120 9.0125 4.1379 13.4409 5.1703 0.2889 12.8828 8.5885 16.1915 9.1459
12000 0.3179 0.7387 9.1910 4.2563 13.5006 5.2331 0.2825 12.6795 8.4383 16.0152 8.9950
13000 0.3152 0.7295 8.7448 4.0541 12.7539 4.9529 0.2832 12.5267 8.4567 15.8700 9.0105

Framework versions

  • Transformers 4.31.0.dev0
  • Datasets 2.13.0
  • Tokenizers 0.13.3