LLaDA
Large Language Diffusion Models
Unified Framework for Generalized Video Face Restoration
Scalable and Versatile 3D Generation from images
Find similar images from a collection
Detect and annotate poses in images and videos
FitDiT is a high-fidelity virtual try-on model.
Upgraded to v1.0!
Gaze detection using Moondream
Extract clothing from images using a mask
Create audio from videos or text prompts
Generate images with Switti
Animation Sketches sequence Colorization
Execute custom code from environment variables
Execute commands from environment
A demo of Indic Parler-TTS
Generate anime-style multi-view images from texts
Create top-quality 3D(.GLB) models from text or images
Optical illusions and style transfer with FLUX