Update README.md
Browse files
README.md
CHANGED
@@ -152,9 +152,11 @@ It is a "large" versions of Conformer-CTC (around 120M parameters) model.
|
|
152 |
This model can be efficiently (best latency and throughput) deployed with [NVIDIA Riva](https://developer.nvidia.com/riva), a GPU-accelerated speech AI SDK, on-premises, embedded, on the edge or with any cloud provider.
|
153 |
|
154 |
Additionally, with RIVA you get:
|
|
|
|
|
|
|
|
|
155 |
* Streaming speech recognition mode
|
156 |
-
* Ability to boost specific words (e.g. brand and product names)
|
157 |
-
* Conformer checkpoints trained on proprietary data
|
158 |
|
159 |
[Live Riva demo](https://developer.nvidia.com/riva#demos)
|
160 |
|
@@ -189,9 +191,7 @@ asr_model.transcribe(['2086-149220-0033.wav'])
|
|
189 |
### Transcribing many audio files
|
190 |
|
191 |
```shell
|
192 |
-
python [NEMO_GIT_FOLDER]/examples/asr/transcribe_speech.py \
|
193 |
-
pretrained_name="nvidia/stt_en_conformer_ctc_large" \
|
194 |
-
audio_dir="<DIRECTORY CONTAINING AUDIO FILES>"
|
195 |
```
|
196 |
|
197 |
### Input
|
|
|
152 |
This model can be efficiently (best latency and throughput) deployed with [NVIDIA Riva](https://developer.nvidia.com/riva), a GPU-accelerated speech AI SDK, on-premises, embedded, on the edge or with any cloud provider.
|
153 |
|
154 |
Additionally, with RIVA you get:
|
155 |
+
* Customization to achieve best WER
|
156 |
+
* Ability to boost specific words (e.g. brand and product names)
|
157 |
+
* External language model, punctuation and captialization
|
158 |
+
* Conformer checkpoints trained on proprietary data
|
159 |
* Streaming speech recognition mode
|
|
|
|
|
160 |
|
161 |
[Live Riva demo](https://developer.nvidia.com/riva#demos)
|
162 |
|
|
|
191 |
### Transcribing many audio files
|
192 |
|
193 |
```shell
|
194 |
+
python [NEMO_GIT_FOLDER]/examples/asr/transcribe_speech.py \n pretrained_name="nvidia/stt_en_conformer_ctc_large" \n audio_dir="<DIRECTORY CONTAINING AUDIO FILES>"
|
|
|
|
|
195 |
```
|
196 |
|
197 |
### Input
|