NbAiLabArchive
/

scream_tertius_dropout_replicate_test7a

Automatic Speech Recognition

hf-asr-leaderboard

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

scream_tertius_dropout_replicate_test7a / README.md

pere's picture

Saving weights and logs of step 17000 - epoch 12

c55c158 over 1 year ago

|

2.53 kB

	---
	language:
	- 'no'
	license: apache-2.0
	tags:
	- audio
	- asr
	- automatic-speech-recognition
	- hf-asr-leaderboard
	model-index:
	- name: scream_tertius_dropout_replicate_test7a
	results: []
	---

	<!-- This model card has been generated automatically according to the information Keras had access to. You should
	probably proofread and complete it, then remove this comment. -->

	# scream_tertius_dropout_replicate_test7a

	This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the NbAiLab/NCC_speech_all_v5 dataset.

	## Model description

	More information needed

	## Intended uses & limitations

	More information needed

	## Training and evaluation data

	More information needed

	## Training procedure

	### Training hyperparameters

	The following hyperparameters were used during training:
	- learning_rate: 2e-05
	- lr_scheduler_type: linear
	- per_device_train_batch_size: 32
	- total_train_batch_size_per_node: 128
	- total_train_batch_size: 1024
	- total_optimization_steps: 20,000
	- starting_optimization_step: None
	- finishing_optimization_step: 20,000
	- num_train_dataset_workers: 32
	- num_hosts: 8
	- total_num_training_examples: 20,480,000
	- steps_per_epoch: 1314
	- num_beams: 5
	- dropout: True
	- dropout_probability: 0.1

	### Training results

	\| step \| eval_loss \| train_loss \| eval_wer \| eval_cer \|
	\|:-----:\|:---------:\|:----------:\|:--------:\|:--------:\|
	\| 0 \| 1.3582 \| 7.9231 \| 169.1230 \| 127.5435 \|
	\| 1000 \| 0.9203 \| 0.9748 \| 24.0256 \| 9.2618 \|
	\| 2000 \| 0.9951 \| 0.6747 \| 18.7576 \| 7.4326 \|
	\| 3000 \| 1.1073 \| 0.5495 \| 16.7479 \| 7.1000 \|
	\| 4000 \| 1.1093 \| 0.4612 \| 14.4336 \| 6.4147 \|
	\| 5000 \| 1.1719 \| 0.4326 \| 14.1900 \| 6.2837 \|
	\| 6000 \| 1.2627 \| 0.3998 \| 12.8197 \| 5.9814 \|
	\| 7000 \| 1.2785 \| 0.3765 \| 12.7893 \| 6.1476 \|
	\| 8000 \| 1.1395 \| 0.3869 \| 12.5152 \| 6.0519 \|
	\| 9000 \| 1.2327 \| 0.3616 \| 12.7893 \| 6.1829 \|
	\| 10000 \| 1.0855 \| 0.3620 \| 11.4495 \| 5.6790 \|
	\| 11000 \| 1.1018 \| 0.3453 \| 11.7540 \| 5.7848 \|
	\| 12000 \| 0.9953 \| 0.3486 \| 11.7235 \| 5.7294 \|
	\| 13000 \| 1.1321 \| 0.3365 \| 12.0280 \| 6.0015 \|
	\| 14000 \| 1.2654 \| 0.3335 \| 11.6322 \| 5.8050 \|
	\| 15000 \| 1.2149 \| 0.3061 \| 11.8453 \| 5.8503 \|
	\| 16000 \| 1.1539 \| 0.3090 \| 11.9367 \| 5.8503 \|
	\| 17000 \| 1.2530 \| 0.3103 \| 11.7540 \| 5.8251 \|


	### Framework versions

	- Transformers 4.29.0.dev0
	- Datasets 2.12.0
	- Tokenizers 0.13.3