jed351
/

whisper_medium_cantonese_cm_voice

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

Share the training codes

#1

by keithhon - opened Mar 25, 2023

Mar 25, 2023

Hi, would it be possible to share your training codes for this model? I wonder how the resulted cer is such low..

keithhon changed discussion status to closed Mar 25, 2023

jed351

Owner Mar 25, 2023

I literally followed the tutorial from the blog

https://huggingface.co/blog/fine-tune-whisper

keithhon changed discussion status to open Mar 25, 2023

Mar 25, 2023

did you multiply the wer by 100?

keithhon changed discussion status to closed Mar 25, 2023

jed351

Owner Mar 25, 2023

It was a very preliminary study on the prospect of using Whisper to transcript YouTube videos for language model usage. Didn't modify anything.

The results wasn't up to expectations so I didn't continue pursuing it.

Mar 25, 2023

I see. Becoz the tutorial’s WER is 3x%, and yours is 0.x%… maybe the fine tuned model over fitted too much

jed351

Owner Mar 29, 2023

I found the bash scripts I used. The model was trained for 50,000 steps on 2 GPUs, which is 20 times higher than the tutorial. I think the model is overfitted....

Mar 29, 2023

I see… thanks

jed351

Owner Mar 29, 2023

Btw can you contact me on LinkedIn (link on my HF profile) to discuss some other projects related to Cantonese

Mar 30, 2023

Sure, invitation sent

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment