lw2333 commited on
Commit
4e4143d
·
1 Parent(s): a23fcb3

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +84 -0
README.md ADDED
@@ -0,0 +1,84 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - hi
4
+ license: apache-2.0
5
+ base_model: openai/whisper-small
6
+ tags:
7
+ - hf-asr-leaderboard
8
+ - generated_from_trainer
9
+ datasets:
10
+ - mozilla-foundation/common_voice_11_0
11
+ metrics:
12
+ - wer
13
+ model-index:
14
+ - name: Whisper Small Hi - Sanchit Gandhi
15
+ results:
16
+ - task:
17
+ name: Automatic Speech Recognition
18
+ type: automatic-speech-recognition
19
+ dataset:
20
+ name: Common Voice 11.0
21
+ type: mozilla-foundation/common_voice_11_0
22
+ config: hi
23
+ split: test
24
+ args: 'config: hi, split: test'
25
+ metrics:
26
+ - name: Wer
27
+ type: wer
28
+ value: 33.09912807923474
29
+ ---
30
+
31
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
32
+ should probably proofread and complete it, then remove this comment. -->
33
+
34
+ # Whisper Small Hi - Sanchit Gandhi
35
+
36
+ This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 11.0 dataset.
37
+ It achieves the following results on the evaluation set:
38
+ - Loss: 0.4278
39
+ - Wer: 33.0991
40
+
41
+ ## Model description
42
+
43
+ More information needed
44
+
45
+ ## Intended uses & limitations
46
+
47
+ More information needed
48
+
49
+ ## Training and evaluation data
50
+
51
+ More information needed
52
+
53
+ ## Training procedure
54
+
55
+ ### Training hyperparameters
56
+
57
+ The following hyperparameters were used during training:
58
+ - learning_rate: 1e-05
59
+ - train_batch_size: 1
60
+ - eval_batch_size: 8
61
+ - seed: 42
62
+ - gradient_accumulation_steps: 16
63
+ - total_train_batch_size: 16
64
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
65
+ - lr_scheduler_type: linear
66
+ - lr_scheduler_warmup_steps: 500
67
+ - training_steps: 4000
68
+
69
+ ### Training results
70
+
71
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
72
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|
73
+ | 0.0776 | 2.45 | 1000 | 0.3089 | 36.4514 |
74
+ | 0.0207 | 4.89 | 2000 | 0.3399 | 33.1372 |
75
+ | 0.0012 | 7.34 | 3000 | 0.4067 | 33.4081 |
76
+ | 0.0005 | 9.8 | 4000 | 0.4278 | 33.0991 |
77
+
78
+
79
+ ### Framework versions
80
+
81
+ - Transformers 4.31.0
82
+ - Pytorch 2.0.1
83
+ - Datasets 2.14.4
84
+ - Tokenizers 0.13.3