Error with loading tokenizer.
I am using: AutoProcessor.from_pretrained("utter-project/mHuBERT-147")
I receive this error:
OSError: Can't load tokenizer for 'utter-project/mHuBERT-147'. If you were trying to load it from '', make sure you don't have a local directory with the same name. Otherwise, make sure 'utter-project/mHuBERT-147' is the correct path to a directory containing all relevant files for a Wav2Vec2CTCTokenizer tokenizer.
Thanks for the interest in using our model.
There is no tokenizer associated to this release, as mHuBERT-147 is not an ASR model. It is a speech representation model.
You can, however, use it to train an ASR system, if you want. :)
All the best,
Thank you for your answer.
I don't use it as an ASR model.
So my question is:
self.feature_extractor = Wav2Vec2FeatureExtractor.from_pretrained("utter-project/mHuBERT-147")
self.hubert = HubertModel.from_pretrained("utter-project/mHuBERT-147")
Do I have to use it in this way?
I'm not sure what you want to do, but the code you sent is correct. It loads the pretrained model correctly!
I think they try to follow generic Hubert class documentation: