Voice conversion framework based on VITS
Launch a web interface for text generation
Run image generation application