YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
Khmer Automatic Speech Recognition
Installation
Install from PyPI
pip install sdab
Install from source
# clone repo
git clone https://github.com/MetythornPenn/sdab.git
# install lib from source
pip install -e .
Usage
Download sample audio
wget -O audio.wav https://github.com/MetythornPenn/sdab/blob/main/sample/audio.wav
Python API
from sdab import Sdab
file_path = "audio.wav"
model_name = "metythorn/khmer-asr-openslr" # or local directory path
sdab = Sdab( file_path = file_path, model_name = model_name)
print(sdab.result)
# result : ααααΆαααααα
αααα’αααααΎααα
ααααΈαααααΆααααΆααααααΆαααααααα»αααααααΆααΆα’αΆα
ααααα»ααΆ
file_path
: path of audio filemodel_name
: pretrain model path fromhuggingface
orlocal
device
: should becpu
orcuda
but I usecpu
by defaulttokenized
: show[PAD]
in output,False
by defaultreturn
: Khmer text from ASR
Reference
- Inspired by Techcast
- Khmer word segmentation from SeangHay khmercut | khmersegment
- Wav2Vec2 from Facebook Wav2Vec2
license: apache-2.0 datasets: - openslr/openslr language: - km tags: - asr - khmer asr - khmer speech to text - speech to text
- Downloads last month
- 167
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.