benderrodriguez
commited on
Commit
•
9ab26e5
1
Parent(s):
dcd6a69
Benchmark description update
Browse files- src/about.py +3 -3
src/about.py
CHANGED
@@ -54,17 +54,17 @@ The following datasets are used in our evaluation:
|
|
54 |
"SASPEECH: A Hebrew Single Speaker Dataset for Text To Speech and Voice Conversion" (Sharoni, O., Shenberg, R., Cooper, E. (2023) SASPEECH: A Hebrew Single Speaker Dataset for Text To Speech and Voice Conversion. Proc. INTERSPEECH 2023,)
|
55 |
|
56 |
### [google/fleurs/he](https://huggingface.co/datasets/google/fleurs)
|
57 |
-
- **Size**:
|
58 |
- **Domain**: Read speech covering common topics and phrases in Hebrew
|
59 |
- **Source**: Created as part of Google's FLEURS project, designed for multilingual speech tasks and evaluation. Data collected through crowdsourcing from Hebrew speakers.
|
60 |
|
61 |
### [mozilla-foundation/common_voice_17_0/he](https://huggingface.co/datasets/mozilla-foundation/common_voice_17_0)
|
62 |
-
- **Size**:
|
63 |
- **Domain**: Read sentences in Hebrew from various texts.
|
64 |
- **Source**: Collected through Mozilla's Common Voice initiative, where volunteers contribute recordings and validate other speakers' contributions
|
65 |
|
66 |
### [imvladikon/hebrew_speech_kan](https://huggingface.co/datasets/imvladikon/hebrew_speech_kan)
|
67 |
-
- **Size**: 1.7 hours (validation
|
68 |
- **Domain**: Varied content types from the Kan (Israeli Public Broadcasting Corporation) youtube channel
|
69 |
- **Source**: Published by Vladimir Gurevich. Scraped audio and subtitles data from YouTube channel "כאן" (Kan).
|
70 |
"""
|
|
|
54 |
"SASPEECH: A Hebrew Single Speaker Dataset for Text To Speech and Voice Conversion" (Sharoni, O., Shenberg, R., Cooper, E. (2023) SASPEECH: A Hebrew Single Speaker Dataset for Text To Speech and Voice Conversion. Proc. INTERSPEECH 2023,)
|
55 |
|
56 |
### [google/fleurs/he](https://huggingface.co/datasets/google/fleurs)
|
57 |
+
- **Size**: 2 hours (test set of the corpus)
|
58 |
- **Domain**: Read speech covering common topics and phrases in Hebrew
|
59 |
- **Source**: Created as part of Google's FLEURS project, designed for multilingual speech tasks and evaluation. Data collected through crowdsourcing from Hebrew speakers.
|
60 |
|
61 |
### [mozilla-foundation/common_voice_17_0/he](https://huggingface.co/datasets/mozilla-foundation/common_voice_17_0)
|
62 |
+
- **Size**: 2 hours (validated set of the corpus)
|
63 |
- **Domain**: Read sentences in Hebrew from various texts.
|
64 |
- **Source**: Collected through Mozilla's Common Voice initiative, where volunteers contribute recordings and validate other speakers' contributions
|
65 |
|
66 |
### [imvladikon/hebrew_speech_kan](https://huggingface.co/datasets/imvladikon/hebrew_speech_kan)
|
67 |
+
- **Size**: 1.7 hours (validation set of the corpus)
|
68 |
- **Domain**: Varied content types from the Kan (Israeli Public Broadcasting Corporation) youtube channel
|
69 |
- **Source**: Published by Vladimir Gurevich. Scraped audio and subtitles data from YouTube channel "כאן" (Kan).
|
70 |
"""
|