Id Card Recognition
Identify and verify ID documents
Create a video by syncing spoken audio to an image
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Generate Vietnamese speech from text and reference audio
Audio Conditioned LipSync with Latent Diffusion Models
Optical illusions and style transfer with FLUX