Hi Xenova,
Thank you so much for your work. I had a question. Is it possible to use this model in a mobile app? My idea was to build a mobile app where one of the functionality would be a user would click the microphone button and just speak what they want and then get the transcribed text of what they speak. So I was looking for a way where the entire thing can be run locally. Is your solution something that can be run locally like this.