hedronstone
commited on
Commit
•
77ad35d
1
Parent(s):
307e94d
Update README.md
Browse files
README.md
CHANGED
@@ -10,11 +10,23 @@ datasets:
|
|
10 |
metrics:
|
11 |
- WER
|
12 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
13 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
14 |
Trained on:
|
15 |
* _"train+validation"_ Swahili subsets of [Common Voice 11.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_11_0)
|
16 |
* _"train+validation+test"_ Swahili subsets of [Google Fleurs](https://huggingface.co/datasets/google/fleurs/).
|
17 |
-
|
18 |
-
|GPU|Train Epochs|Training Loss|Validation Loss|WER|Train Time| Train Batch Size | Validation Batch Size
|
19 |
-
|---|----------------|---------------|-------------|----------|---|---|---|
|
20 |
-
| 1 x A100 | 1000 | 0.082000 |0.352631| 19.887087 | 11.2 hrs | 32 | 8 |
|
|
|
10 |
metrics:
|
11 |
- WER
|
12 |
---
|
13 |
+
## Model
|
14 |
+
* Name: Whisper Large-v2 Swahili
|
15 |
+
* Description: Fine-tuned Whisper weights for speech-to-text task.
|
16 |
+
* Dataset:
|
17 |
+
- Train and validation splits for Swahili subsets of [Common Voice 11.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_11_0).
|
18 |
+
- Train, validation and test splits for Swahili subsets of [Google Fleurs](https://huggingface.co/datasets/google/fleurs/).
|
19 |
+
* Performance: **19.887087 WER**
|
20 |
|
21 |
+
## Weights
|
22 |
+
* Date of release: 12.09.2022
|
23 |
+
* Size:
|
24 |
+
* License: MIT
|
25 |
+
|
26 |
+
## Usage
|
27 |
+
To use these weights in HuggingFace's `transformers` library, you can do the following:
|
28 |
+
```python
|
29 |
+
from transformers import WhisperForConditionalGeneration
|
30 |
Trained on:
|
31 |
* _"train+validation"_ Swahili subsets of [Common Voice 11.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_11_0)
|
32 |
* _"train+validation+test"_ Swahili subsets of [Google Fleurs](https://huggingface.co/datasets/google/fleurs/).
|
|
|
|
|
|
|
|