Update README.md
Browse files
README.md
CHANGED
@@ -45,6 +45,29 @@ To train, fine-tune or play with the model you will need to install [NVIDIA NeMo
|
|
45 |
pip install nemo_toolkit['all']
|
46 |
```
|
47 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
48 |
### Input
|
49 |
|
50 |
This model accepts 16000 kHz Mono-channel Audio (wav files) as input.
|
|
|
45 |
pip install nemo_toolkit['all']
|
46 |
```
|
47 |
|
48 |
+
### Automatically instantiate the model
|
49 |
+
|
50 |
+
```python
|
51 |
+
import nemo.collections.asr as nemo_asr
|
52 |
+
asr_model = nemo_asr.models.EncDecCTCModel.from_pretrained("nvidia/stt_zh_citrinet_1024_gamma_0_25")
|
53 |
+
```
|
54 |
+
|
55 |
+
### Transcribing using Python
|
56 |
+
First, let's get a sample of spoken Mandarin Chinese.
|
57 |
+
|
58 |
+
Then simply do:
|
59 |
+
```
|
60 |
+
asr_model.transcribe(['<Path of audio file(s)>'])
|
61 |
+
```
|
62 |
+
|
63 |
+
### Transcribing many audio files
|
64 |
+
|
65 |
+
```shell
|
66 |
+
python [NEMO_GIT_FOLDER]/examples/asr/transcribe_speech.py
|
67 |
+
pretrained_name="nvidia/stt_zh_citrinet_1024_gamma_0_25"
|
68 |
+
audio_dir="<DIRECTORY CONTAINING AUDIO FILES>"
|
69 |
+
```
|
70 |
+
|
71 |
### Input
|
72 |
|
73 |
This model accepts 16000 kHz Mono-channel Audio (wav files) as input.
|