shi-labs
/

versatile-diffusion

VersatileDiffusionPipeline

image-variation

Model card Files Files and versions Community

JamesXu commited on Nov 22, 2022

Commit

b837d07

·

1 Parent(s): 9b53c6b

Update README.md

Files changed (1) hide show

README.md +10 -1

README.md CHANGED Viewed

@@ -1,3 +1,12 @@
-## Versatile Diffusion (v1.0, four-flow)
 We built **Versatile Diffusion (VD), the first unified multi-flow multimodal diffusion framework**, as a step towards **Universal Generative AI**. Versatile Diffusion can natively support image-to-text, image-variation, text-to-image, and text-variation, and can be further extended to other applications such as semantic-style disentanglement, image-text dual-guided generation, latent image-to-text-to-image editing, and more. Future versions will support more modalities such as speech, music, video and 3D.

+---
+license: mit
+tags:
+- vision
+- generation
+datasets:
+- Laion2B-en
+---
+# Versatile Diffusion (v1.0, four-flow)
 We built **Versatile Diffusion (VD), the first unified multi-flow multimodal diffusion framework**, as a step towards **Universal Generative AI**. Versatile Diffusion can natively support image-to-text, image-variation, text-to-image, and text-variation, and can be further extended to other applications such as semantic-style disentanglement, image-text dual-guided generation, latent image-to-text-to-image editing, and more. Future versions will support more modalities such as speech, music, video and 3D.