Commit
·
0c6a3c8
1
Parent(s):
9eced64
Update README.md
Browse files
README.md
CHANGED
@@ -149,16 +149,14 @@ It is a "large" versions of Conformer-CTC (around 120M parameters) model.
|
|
149 |
|
150 |
## NVIDIA Riva: Deployment
|
151 |
|
152 |
-
|
153 |
|
154 |
-
Additionally,
|
155 |
-
*
|
156 |
-
|
157 |
-
|
158 |
-
* Conformer checkpoints trained on proprietary data
|
159 |
-
* Streaming speech recognition mode
|
160 |
|
161 |
-
[
|
162 |
|
163 |
## NVIDIA NeMo: Training
|
164 |
|
@@ -191,7 +189,9 @@ asr_model.transcribe(['2086-149220-0033.wav'])
|
|
191 |
### Transcribing many audio files
|
192 |
|
193 |
```shell
|
194 |
-
python [NEMO_GIT_FOLDER]/examples/asr/transcribe_speech.py
|
|
|
|
|
195 |
```
|
196 |
|
197 |
### Input
|
|
|
149 |
|
150 |
## NVIDIA Riva: Deployment
|
151 |
|
152 |
+
For the best real-time accuracy, latency, and throughput, deploy the model with [NVIDIA Riva](https://developer.nvidia.com/riva), an accelerated speech AI SDK deployable on-prem, in all clouds, multi-cloud, hybrid, at the edge, and embedded.
|
153 |
|
154 |
+
Additionally, Riva provides:
|
155 |
+
* World-class out-of-the-box accuracy for the most common languages with model checkpoints trained on proprietary data with hundreds of thousands of GPU-compute hours
|
156 |
+
* Best in class accuracy via customization with run-time word boosting (e.g., brand and product names), acoustic model training, language model training, and inverse text normalization customizations
|
157 |
+
* Streaming speech recognition, Kubernetes compatible scaling, and Enterprise-grade support
|
|
|
|
|
158 |
|
159 |
+
[Check out Riva live demo.](https://developer.nvidia.com/riva#demos)
|
160 |
|
161 |
## NVIDIA NeMo: Training
|
162 |
|
|
|
189 |
### Transcribing many audio files
|
190 |
|
191 |
```shell
|
192 |
+
python [NEMO_GIT_FOLDER]/examples/asr/transcribe_speech.py
|
193 |
+
pretrained_name="nvidia/stt_en_conformer_ctc_large"
|
194 |
+
audio_dir="<DIRECTORY CONTAINING AUDIO FILES>"
|
195 |
```
|
196 |
|
197 |
### Input
|