dataautogpt3
/

OpenDalle

Text-to-Image

Diffusers

Safetensors

StableDiffusionXLPipeline

Inference Endpoints

Model card Files Files and versions Community

multimodalart HF staff commited on Dec 20, 2023

Commit

a266a14

•

1 Parent(s): 4c1e31f

Update README.md

Browse files

Files changed (1) hide show

README.md +33 -16

README.md CHANGED Viewed

@@ -2,6 +2,21 @@
 license: mit
 pipeline_tag: text-to-image
 widget:
   - text: "in the style of artgerm, comic style,3D model, mythical seascape, negative space, space quixotic dreams, temporal hallucination, psychedelic, mystical, intricate details, very bright neon colors, (vantablack background:1.5), pointillism, pareidolia, melting, symbolism, very high contrast, chiaroscuro"
     parameters:
       negative_prompt: "bad quality, bad anatomy, worst quality, low quality, low resolution, extra fingers, blur, blurry, ugly, wrong proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image, embedding:ac_neg1,"
@@ -17,26 +32,28 @@ widget:
       negative_prompt: "bad quality, bad anatomy, worst quality, low quality, low resolutions, extra fingers, blur, blurry, ugly, wrongs proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image"
     output:
       url: ComfyUI_00284_.jpeg
-  - text: "panther head coming out of smoke, dark, moody, detailed, shadows"
-    output:
-      url: GBvRG6FXcAEOvcG.jpeg
   - text: "cinematic film still of Kodak Motion Picture Film: (Sharp Detailed Image) An Oscar winning movie for Best Cinematography a woman in a kimono standing on a subway train in Japan Kodak Motion Picture Film Style, shallow depth of field, vignette, highly detailed, high budget, bokeh, cinemascope, moody, epic, gorgeous, film grain, grainy,"
     parameters:
       negative_prompt: "bad quality, bad anatomy, worst quality, low quality, low resolutions, extra fingers, blur, blurry, ugly, wrongs proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image"
     output:
       url: ComfyUI_00265_.jpeg
-  - text: ' '
-    output:
-      url: GBvPhMyWoAAp_fT.jpeg
-  - text: ' '
-    output:
-      url: GBvMRyqXMAAX8jj.jpeg
-  - text: "Manga from the early 1990s, characterized by its surreal aesthetic. The artwork is depicted in matte colors and created using a digital medium. Notable illustrators include Junji Ito, Yoshiyuki Sadamoto, and Rumiko Takahashi."
-    output:
-      url: ComfyUI_00497_.jpeg
-  - text: ' '
-    output:
-      url: GBuwGoJXUAA89jm.jpeg
 ---
 I'm thrilled to share an update on a recent project of mine. After some dedicated work, I've developed a highly effective text-to-image model. This innovation results from integrating the DPO model from Hugging Face with several advanced counterparts, including Juggernaut7XL, ALBEDOXL, MEARGEHEAVEN, and a model of my own design. The outcome is a unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base SDXL and a step closer to DALLE-3 in terms of prompt comprehension. Notably, this model excels in interpreting and adhering to the given prompts, focusing more on semantic accuracy than on ultra-high-fidelity image generation.
-available on ```https://civitai.com/models/238116/opendalle ```

 license: mit
 pipeline_tag: text-to-image
 widget:
+  - text: "panther head coming out of smoke, dark, moody, detailed, shadows"
+    output:
+      url: GBvRG6FXcAEOvcG.jpeg
+  - text: "Manga from the early 1990s, characterized by its surreal aesthetic. The artwork is depicted in matte colors and created using a digital medium. Notable illustrators include Junji Ito, Yoshiyuki Sadamoto, and Rumiko Takahashi."
+    output:
+      url: ComfyUI_00497_.jpeg
+  - text: '-'
+    output:
+      url: GBvPhMyWoAAp_fT.jpeg
+  - text: '-'
+    output:
+      url: GBvMRyqXMAAX8jj.jpeg
+  - text: '-'
+    output:
+      url: GBuwGoJXUAA89jm.jpeg
   - text: "in the style of artgerm, comic style,3D model, mythical seascape, negative space, space quixotic dreams, temporal hallucination, psychedelic, mystical, intricate details, very bright neon colors, (vantablack background:1.5), pointillism, pareidolia, melting, symbolism, very high contrast, chiaroscuro"
     parameters:
       negative_prompt: "bad quality, bad anatomy, worst quality, low quality, low resolution, extra fingers, blur, blurry, ugly, wrong proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image, embedding:ac_neg1,"
       negative_prompt: "bad quality, bad anatomy, worst quality, low quality, low resolutions, extra fingers, blur, blurry, ugly, wrongs proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image"
     output:
       url: ComfyUI_00284_.jpeg
   - text: "cinematic film still of Kodak Motion Picture Film: (Sharp Detailed Image) An Oscar winning movie for Best Cinematography a woman in a kimono standing on a subway train in Japan Kodak Motion Picture Film Style, shallow depth of field, vignette, highly detailed, high budget, bokeh, cinemascope, moody, epic, gorgeous, film grain, grainy,"
     parameters:
       negative_prompt: "bad quality, bad anatomy, worst quality, low quality, low resolutions, extra fingers, blur, blurry, ugly, wrongs proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image"
     output:
       url: ComfyUI_00265_.jpeg
 ---
+# OpenDalle
+<Gallery />
 I'm thrilled to share an update on a recent project of mine. After some dedicated work, I've developed a highly effective text-to-image model. This innovation results from integrating the DPO model from Hugging Face with several advanced counterparts, including Juggernaut7XL, ALBEDOXL, MEARGEHEAVEN, and a model of my own design. The outcome is a unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base SDXL and a step closer to DALLE-3 in terms of prompt comprehension. Notably, this model excels in interpreting and adhering to the given prompts, focusing more on semantic accuracy than on ultra-high-fidelity image generation.
+also available on ```https://civitai.com/models/238116/opendalle ```
+## `*.safetensors` for AUTOMATIC1111, ComfyUI, InvokeAI
+[Download *.safetensors file](https://huggingface.co/dataautogpt3/OpenDalle/resolve/main/OpenDalle.safetensors?download=true)
+## Use it with 🧨 diffusers
+```python
+from diffusers import AutoPipelineForText2Image
+import torch
+pipeline = AutoPipelineForText2Image.from_pretrained('dataautogpt3/OpenDalle', torch_dtype=torch.float16).to('cuda')
+image = pipeline('Manga from the early 1990s, characterized by its surreal aesthetic. The artwork is depicted in matte colors and created using a digital medium. Notable illustrators include Junji Ito, Yoshiyuki Sadamoto, and Rumiko Takahashi.').images[0]
+```