Finnish-NLP
/

whisper-large-finnish-v3-ct2

speech-recognition

Inference Endpoints

Model card Files Files and versions Community

RASMUS commited on Dec 13, 2023

Commit

a032b7b

•

1 Parent(s): c2c369e

Update README.md

Files changed (1) hide show

README.md +22 -2

README.md CHANGED Viewed

@@ -6,7 +6,26 @@ tags:
 - speech-recognition
 ---
-How to use in python, tested with faster-whisper==0.10.0
 ```python
 import faster_whisper
 model = faster_whisper.WhisperModel("Finnish-NLP/whisper-large-finnish-v3-ct2")
@@ -17,4 +36,5 @@ segments, info = model.transcribe(audio_path, word_timestamps=True, beam_size=5,
 for segment in segments:
     for word in segment.words:
         print("[%.2fs -> %.2fs] %s" % (word.start, word.end, word.word))
-```

 - speech-recognition
 ---
+Example how to use with WhisperX (https://github.com/m-bain/whisperX)
+```python
+import whisperx
+device = "cuda"
+audio_file = "oma_nauhoitus_16Khz.wav"
+batch_size = 16 # reduce if low on GPU mem
+compute_type = "float16" # change to "int8" if low on GPU mem (may reduce accuracy)
+# 1. Transcribe with original whisper (batched)
+model = whisperx.load_model("Finnish-NLP/whisper-large-finnish-v3-ct2", device, compute_type=compute_type)
+audio = whisperx.load_audio(audio_file)
+result = model.transcribe(audio, batch_size=batch_size)
+print(result["segments"]) # before alignment
+```
+How to use in Python with faster-whisper (https://github.com/SYSTRAN/faster-whisper)
 ```python
 import faster_whisper
 model = faster_whisper.WhisperModel("Finnish-NLP/whisper-large-finnish-v3-ct2")
 for segment in segments:
     for word in segment.words:
         print("[%.2fs -> %.2fs] %s" % (word.start, word.end, word.word))
+```