Use YOLOv10 to detect objects in real-time
Gemini understands audio and video!
Huggingface space for JanusFlow-1.3B
Gaze detection using Moondream
Create videos with FFMPEG + Qwen2.5-Coder