Apolinário from multimodal AI art PRO

multimodalart

AI & ML interests

None yet

Recent Activity

liked a model about 5 hours ago
strangerzonehf/Flux-Super-Blend-LoRA
liked a model about 8 hours ago
black-forest-labs/FLUX.1-Fill-dev
liked a dataset about 8 hours ago
microsoft/orca-agentinstruct-1M-v1

Articles

Organizations

Posts 5

view post
Post
21565
The first open Stable Diffusion 3-like architecture model is JUST out 💣 - but it is not SD3! 🤔

It is Tencent-Hunyuan/HunyuanDiT by Tencent, a 1.5B parameter DiT (diffusion transformer) text-to-image model 🖼️✨, trained with multi-lingual CLIP + multi-lingual T5 text-encoders for english 🤝 chinese understanding

Try it out by yourself here ▶️ https://huggingface.co/spaces/multimodalart/HunyuanDiT
(a bit too slow as the model is chunky and the research code isn't super optimized for inference speed yet)

In the paper they claim to be SOTA open source based on human preference evaluation!