ukrainian-stt-et

Sleeping

App Files Files Community

ukrainian-stt-et / README.md

Yurii Paniv

Fix dtype handling

1040275 about 1 year ago

preview code

raw

history blame contribute delete

2.04 kB

	---
	title: "Ukrainian Speech-to-Text"
	emoji: 🐌
	colorFrom: blue
	colorTo: yellow
	sdk: gradio
	sdk_version: 3.41.2
	app_file: app.py
	pinned: false
	---

	# 🇺🇦🎤 Voice recognition for Ukrainian language
	This is a repository with aim to apply various speech recognition models on Ukrainian language.

	You can see online demo here: https://huggingface.co/spaces/robinhad/ukrainian-stt.
	Github link: https://github.com/robinhad/voice-recognition-ua.
	Source code is in this repository together with auto-deploy pipeline scripts.


	# 🧮 Models
	Model name \| CER \| WER \| License \| Note
	:-------------------------\|:-------------------------\|:-------------------------\|:-------------------------\|:-------------------------
	[Wav2Vec2](https://github.com/robinhad/voice-recognition-ua/releases/tag/release%2Fwav2vec2-v0.1) \| 6,01% \| 27,99% \| MIT \| Common Voice 8 dataset, `test` set used as validation
	[DeepSpeech with Wiki LM](https://github.com/robinhad/voice-recognition-ua/releases/tag/v0.4) \| 12% \| 30,65% \| CC-BY-NC 4.0 \| Common Voice 6 dataset
	[DeepSpeech](https://github.com/robinhad/voice-recognition-ua/releases/tag/v0.4) \| 16% \| 57% \| CC-BY-NC 4.0 \| Common Voice 6 dataset


	Checkout latest releases here: https://github.com/robinhad/voice-recognition-ua/releases/.

	If you'd like to check out different models for Ukrainian language, please visit https://github.com/egorsmkv/speech-recognition-uk.

	# 🤖 Training scripts
	Guides for training are available in corresponding folders for each model.

	# Support
	If you like my work, please support here: https://send.monobank.ua/jar/48iHq4xAXm

	# 🤝 Attribution
	[@robinhad](https://github.com/robinhad) - model training.
	[@egorsmkv](https://github.com/egorsmkv) - organized [Ukrainian Speech recognition community](https://github.com/egorsmkv/speech-recognition-uk).
	[@tarasfrompir](https://github.com/tarasfrompir) - created synthetic 1200h Ukrainian Speech-to-Text dataset.
	[@AlexeyBoiler](https://github.com/AlexeyBoiler) - hosted Ukrainian Speech-to-Text dataset.