Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,38 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
datasets:
|
4 |
+
- techiaith/commonvoice_18_0_cy
|
5 |
+
language:
|
6 |
+
- cy
|
7 |
+
base_model:
|
8 |
+
- openai/whisper-base
|
9 |
+
pipeline_tag: automatic-speech-recognition
|
10 |
+
tags:
|
11 |
+
- whispercpp
|
12 |
+
---
|
13 |
+
|
14 |
+
|
15 |
+
# whisper-base-ft-cv-cy-cpp
|
16 |
+
|
17 |
+
This model is a version of the [openai/whisper-base](https://huggingface.co/openai/whisper-base) model, fine-tuned on the
|
18 |
+
[techiaith/commonvoice_18_0_cy](https://huggingface.co/datasets/techiaith/commonvoice_18_0_cy) dataset, and then
|
19 |
+
[converted for use in whisper.cpp](https://github.com/ggerganov/whisper.cpp/tree/master/models#fine-tuned-models). Whispercpp is
|
20 |
+
a C/C++ port of Whisper that provides high performance inference on offline hardware such as desktops, laptops and mobile devices.
|
21 |
+
|
22 |
+
The model is a smaller in size to the corresponding cloud hosted model
|
23 |
+
[techiaith/whisper-large-v3-ft-cv-cy](https://huggingface.co/techiaith/whisper-large-v3-ft-cv-cy).
|
24 |
+
It achieves the following WER results for transcribing:
|
25 |
+
|
26 |
+
- Wer: 42.68
|
27 |
+
- Cer: 14.14
|
28 |
+
|
29 |
+
## Usage
|
30 |
+
|
31 |
+
whispercpp makes it easy to use models in many platforms and applications. See the 'examples' folder
|
32 |
+
in the whispercpp github repo for more information and example code.
|
33 |
+
|
34 |
+
To get quickly started with whispercpp's basic usage however, follow the '[Quick Start](https://github.com/ggerganov/whisper.cpp?tab=readme-ov-file#quick-start)'
|
35 |
+
but download this model with the following command:
|
36 |
+
|
37 |
+
|
38 |
+
`$ wget https://huggingface.co/techiaith/whisper-base-ft-cv-cy-cpp/resolve/main/ggml-model.bin`
|