FitDiT is a high-fidelity virtual try-on model.
Gaze detection using Moondream
Scalable and Versatile 3D Generation from images
Co-Speech Gesture Video Generation