espnet
/

owsm_ctc_v3.1_1B

Automatic Speech Recognition

speech-translation

language-identification

Model card Files Files and versions Community

pyf98 commited on Mar 23, 2024

Commit

437a576

·

verified ·

1 Parent(s): 24122a4

Update README.md

Files changed (1) hide show

README.md +7 -3

README.md CHANGED Viewed

@@ -20,7 +20,8 @@ Currently, the code for OWSM-CTC has not been merged into ESPnet main branch. In
 - Code in my repo: https://github.com/pyf98/espnet/tree/owsm-ctc
 - Current model on HF: https://huggingface.co/pyf98/owsm_ctc_v3.1_1B
-An example script to run short-form ASR/ST:
 ```python
 import soundfile as sf
 import numpy as np
@@ -46,7 +47,8 @@ res = s2t(speech)[0]
 print(res)
 ```
-An example script to run long-form ASR:
 ```python
 import soundfile as sf
 import torch
@@ -77,7 +79,9 @@ if __name__ == "__main__":
     print(text)
 ```
-An example for CTC forced alignment using `ctc-segmentation`. It can be efficiently applied to audio of an arbitrary length.
 For model downloading, please refer to https://github.com/espnet/espnet?tab=readme-ov-file#ctc-segmentation-demo
 ```python

 - Code in my repo: https://github.com/pyf98/espnet/tree/owsm-ctc
 - Current model on HF: https://huggingface.co/pyf98/owsm_ctc_v3.1_1B
+### Example script for short-form ASR/ST
 ```python
 import soundfile as sf
 import numpy as np
 print(res)
 ```
+### Example script for long-form ASR/ST
 ```python
 import soundfile as sf
 import torch
     print(text)
 ```
+### Example for CTC forced alignment using `ctc-segmentation`
+It can be efficiently applied to audio of an arbitrary length.
 For model downloading, please refer to https://github.com/espnet/espnet?tab=readme-ov-file#ctc-segmentation-demo
 ```python