Sesame CSM
Conversational speech generation
Conversational speech generation
An Agentic Framework with Tools for Complex Reasoning
A leaderboard for LLMs powering smolagents
Image to Compositional 3D Scene Generation
Fast image relighting using Latent Bridge Matching
Enhance image quality by enlarging it without losing details
A demo for exploring and analyzing large-scale model repos
Generate edited images with prompts
Conversational speech generation
Enhance image quality by enlarging it without losing details
Fast image relighting using Latent Bridge Matching
Generate virtual camera views from input images
Scalable and Versatile 3D Generation from images
Wan: Open and Advanced Large-Scale Video Generative Models
Generate edited images with prompts
Try on virtual garments on your uploaded images
Gemini 2.0 native image generation co-doodling
Convert images and text to document formats
MultiImages-to-3D Generation
Execute user-defined code
Send text and get detailed responses
Text-to-3D and Image-to-3D Generation
Embedding Leaderboard
Image to Compositional 3D Scene Generation
The ultimate guide to training LLM on large GPU Clusters
Generate images from text prompts
FLUX Multilingual Text-Driven Image Generation and Editing
Blazingly Fast and Embarrassingly Simple Song Generation
Generate animated videos from images and prompts
Edit and enhance images with custom color and edge modifications
VGGT (CVPR 2025)
Unleashing a limitless torrent of ingenious ideas