khederwaaOne commited on
Commit
a828605
1 Parent(s): 239ab96

End of training

Browse files
Files changed (2) hide show
  1. README.md +79 -0
  2. generation_config.json +15 -0
README.md ADDED
@@ -0,0 +1,79 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - so
4
+ license: apache-2.0
5
+ base_model: steja/whisper-small-somali
6
+ tags:
7
+ - generated_from_trainer
8
+ datasets:
9
+ - google/fleurs
10
+ metrics:
11
+ - wer
12
+ model-index:
13
+ - name: Whisper Small So - kheder yusuf
14
+ results:
15
+ - task:
16
+ name: Automatic Speech Recognition
17
+ type: automatic-speech-recognition
18
+ dataset:
19
+ name: google/fleurs
20
+ type: google/fleurs
21
+ config: so_so
22
+ split: None
23
+ args: 'config: so_so, split: test'
24
+ metrics:
25
+ - name: Wer
26
+ type: wer
27
+ value: 21.001297668382936
28
+ ---
29
+
30
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
31
+ should probably proofread and complete it, then remove this comment. -->
32
+
33
+ # Whisper Small So - kheder yusuf
34
+
35
+ This model is a fine-tuned version of [steja/whisper-small-somali](https://huggingface.co/steja/whisper-small-somali) on the google/fleurs dataset.
36
+ It achieves the following results on the evaluation set:
37
+ - Loss: 0.2840
38
+ - Wer: 21.0013
39
+
40
+ ## Model description
41
+
42
+ More information needed
43
+
44
+ ## Intended uses & limitations
45
+
46
+ More information needed
47
+
48
+ ## Training and evaluation data
49
+
50
+ More information needed
51
+
52
+ ## Training procedure
53
+
54
+ ### Training hyperparameters
55
+
56
+ The following hyperparameters were used during training:
57
+ - learning_rate: 1e-05
58
+ - train_batch_size: 16
59
+ - eval_batch_size: 8
60
+ - seed: 42
61
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
+ - lr_scheduler_type: linear
63
+ - lr_scheduler_warmup_steps: 500
64
+ - training_steps: 1000
65
+ - mixed_precision_training: Native AMP
66
+
67
+ ### Training results
68
+
69
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
70
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|
71
+ | 0.257 | 3.47 | 1000 | 0.2840 | 21.0013 |
72
+
73
+
74
+ ### Framework versions
75
+
76
+ - Transformers 4.37.2
77
+ - Pytorch 2.2.1+cu121
78
+ - Datasets 2.19.0
79
+ - Tokenizers 0.15.2
generation_config.json ADDED
@@ -0,0 +1,15 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "begin_suppress_tokens": [
3
+ 220,
4
+ 50257
5
+ ],
6
+ "bos_token_id": 50257,
7
+ "decoder_start_token_id": 50258,
8
+ "eos_token_id": 50257,
9
+ "language": "somali",
10
+ "max_length": 448,
11
+ "pad_token_id": 50257,
12
+ "task": "transcribe",
13
+ "transformers_version": "4.37.2",
14
+ "use_cache": false
15
+ }