Update README.md
Browse files
README.md
CHANGED
@@ -132,6 +132,7 @@ language:
|
|
132 |
2. [Training Data and Code](https://huggingface.co/utter-project/mHuBERT-147#training)
|
133 |
3. [ML-SUPERB Scores](https://huggingface.co/utter-project/mHuBERT-147#ml-superb-scores)
|
134 |
4. [Languages and Datasets](https://huggingface.co/utter-project/mHuBERT-147#languages-and-datasets)
|
|
|
135 |
6. [Citing and Funding Information](https://huggingface.co/utter-project/mHuBERT-147#citing-and-funding-information)
|
136 |
|
137 |
# mHuBERT-147 models
|
@@ -186,6 +187,14 @@ See more information in [our paper](https://arxiv.org/pdf/2406.06371).
|
|
186 |
|
187 |
**Languages present not indexed by Huggingface:** Asturian (ast), Basaa (bas), Cebuano (ceb), Central Kurdish/Sorani (ckb), Hakha Chin (cnh), Hawaiian (haw), Upper Sorbian (hsb) Kabyle (kab), Moksha (mdf), Meadow Mari (mhr), Hill Mari (mrj), Erzya (myv), Taiwanese Hokkien (nan-tw), Sursilvan (rm-sursilv), Vallader (rm-vallader), Sakha (sah), Santali (sat), Scots (sco), Saraiki (skr), Tigre (tig), Tok Pisin (tpi), Akwapen Twi (tw-akuapem), Asante Twi (tw-asante), Votic (vot), Waray (war), Cantonese (yue).
|
188 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
189 |
|
190 |
# Citing and Funding Information
|
191 |
|
|
|
132 |
2. [Training Data and Code](https://huggingface.co/utter-project/mHuBERT-147#training)
|
133 |
3. [ML-SUPERB Scores](https://huggingface.co/utter-project/mHuBERT-147#ml-superb-scores)
|
134 |
4. [Languages and Datasets](https://huggingface.co/utter-project/mHuBERT-147#languages-and-datasets)
|
135 |
+
5. [Intermediate Checkpoints](https://huggingface.co/utter-project/mHuBERT-147#intermediate-checkpoints)
|
136 |
6. [Citing and Funding Information](https://huggingface.co/utter-project/mHuBERT-147#citing-and-funding-information)
|
137 |
|
138 |
# mHuBERT-147 models
|
|
|
187 |
|
188 |
**Languages present not indexed by Huggingface:** Asturian (ast), Basaa (bas), Cebuano (ceb), Central Kurdish/Sorani (ckb), Hakha Chin (cnh), Hawaiian (haw), Upper Sorbian (hsb) Kabyle (kab), Moksha (mdf), Meadow Mari (mhr), Hill Mari (mrj), Erzya (myv), Taiwanese Hokkien (nan-tw), Sursilvan (rm-sursilv), Vallader (rm-vallader), Sakha (sah), Santali (sat), Scots (sco), Saraiki (skr), Tigre (tig), Tok Pisin (tpi), Akwapen Twi (tw-akuapem), Asante Twi (tw-asante), Votic (vot), Waray (war), Cantonese (yue).
|
189 |
|
190 |
+
# Intermediate Checkpoints
|
191 |
+
|
192 |
+
For allowing research in training dynamics, the intermediate checkpoints for the three iterations are made available under the **CC-BY-NC-SA-4.0** license via a protected link.
|
193 |
+
|
194 |
+
* **Downloading page:** https://download.europe.naverlabs.com/mhubert147/
|
195 |
+
* **User:** user
|
196 |
+
* **Password:** license mentioned above in bold
|
197 |
+
|
198 |
|
199 |
# Citing and Funding Information
|
200 |
|