Commit
•
e95fdac
1
Parent(s):
4c1e31f
Update README.md (#4)
Browse files- Update README.md (a266a14be5186f45096af329ffcf1f2f1b05fccd)
Co-authored-by: Apolinário from multimodal AI art <multimodalart@users.noreply.huggingface.co>
README.md
CHANGED
@@ -2,6 +2,21 @@
|
|
2 |
license: mit
|
3 |
pipeline_tag: text-to-image
|
4 |
widget:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
5 |
- text: "in the style of artgerm, comic style,3D model, mythical seascape, negative space, space quixotic dreams, temporal hallucination, psychedelic, mystical, intricate details, very bright neon colors, (vantablack background:1.5), pointillism, pareidolia, melting, symbolism, very high contrast, chiaroscuro"
|
6 |
parameters:
|
7 |
negative_prompt: "bad quality, bad anatomy, worst quality, low quality, low resolution, extra fingers, blur, blurry, ugly, wrong proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image, embedding:ac_neg1,"
|
@@ -17,26 +32,28 @@ widget:
|
|
17 |
negative_prompt: "bad quality, bad anatomy, worst quality, low quality, low resolutions, extra fingers, blur, blurry, ugly, wrongs proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image"
|
18 |
output:
|
19 |
url: ComfyUI_00284_.jpeg
|
20 |
-
- text: "panther head coming out of smoke, dark, moody, detailed, shadows"
|
21 |
-
output:
|
22 |
-
url: GBvRG6FXcAEOvcG.jpeg
|
23 |
- text: "cinematic film still of Kodak Motion Picture Film: (Sharp Detailed Image) An Oscar winning movie for Best Cinematography a woman in a kimono standing on a subway train in Japan Kodak Motion Picture Film Style, shallow depth of field, vignette, highly detailed, high budget, bokeh, cinemascope, moody, epic, gorgeous, film grain, grainy,"
|
24 |
parameters:
|
25 |
negative_prompt: "bad quality, bad anatomy, worst quality, low quality, low resolutions, extra fingers, blur, blurry, ugly, wrongs proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image"
|
26 |
output:
|
27 |
url: ComfyUI_00265_.jpeg
|
28 |
-
- text: ' '
|
29 |
-
output:
|
30 |
-
url: GBvPhMyWoAAp_fT.jpeg
|
31 |
-
- text: ' '
|
32 |
-
output:
|
33 |
-
url: GBvMRyqXMAAX8jj.jpeg
|
34 |
-
- text: "Manga from the early 1990s, characterized by its surreal aesthetic. The artwork is depicted in matte colors and created using a digital medium. Notable illustrators include Junji Ito, Yoshiyuki Sadamoto, and Rumiko Takahashi."
|
35 |
-
output:
|
36 |
-
url: ComfyUI_00497_.jpeg
|
37 |
-
- text: ' '
|
38 |
-
output:
|
39 |
-
url: GBuwGoJXUAA89jm.jpeg
|
40 |
---
|
|
|
|
|
|
|
|
|
|
|
41 |
I'm thrilled to share an update on a recent project of mine. After some dedicated work, I've developed a highly effective text-to-image model. This innovation results from integrating the DPO model from Hugging Face with several advanced counterparts, including Juggernaut7XL, ALBEDOXL, MEARGEHEAVEN, and a model of my own design. The outcome is a unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base SDXL and a step closer to DALLE-3 in terms of prompt comprehension. Notably, this model excels in interpreting and adhering to the given prompts, focusing more on semantic accuracy than on ultra-high-fidelity image generation.
|
42 |
-
available on ```https://civitai.com/models/238116/opendalle ```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2 |
license: mit
|
3 |
pipeline_tag: text-to-image
|
4 |
widget:
|
5 |
+
- text: "panther head coming out of smoke, dark, moody, detailed, shadows"
|
6 |
+
output:
|
7 |
+
url: GBvRG6FXcAEOvcG.jpeg
|
8 |
+
- text: "Manga from the early 1990s, characterized by its surreal aesthetic. The artwork is depicted in matte colors and created using a digital medium. Notable illustrators include Junji Ito, Yoshiyuki Sadamoto, and Rumiko Takahashi."
|
9 |
+
output:
|
10 |
+
url: ComfyUI_00497_.jpeg
|
11 |
+
- text: '-'
|
12 |
+
output:
|
13 |
+
url: GBvPhMyWoAAp_fT.jpeg
|
14 |
+
- text: '-'
|
15 |
+
output:
|
16 |
+
url: GBvMRyqXMAAX8jj.jpeg
|
17 |
+
- text: '-'
|
18 |
+
output:
|
19 |
+
url: GBuwGoJXUAA89jm.jpeg
|
20 |
- text: "in the style of artgerm, comic style,3D model, mythical seascape, negative space, space quixotic dreams, temporal hallucination, psychedelic, mystical, intricate details, very bright neon colors, (vantablack background:1.5), pointillism, pareidolia, melting, symbolism, very high contrast, chiaroscuro"
|
21 |
parameters:
|
22 |
negative_prompt: "bad quality, bad anatomy, worst quality, low quality, low resolution, extra fingers, blur, blurry, ugly, wrong proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image, embedding:ac_neg1,"
|
|
|
32 |
negative_prompt: "bad quality, bad anatomy, worst quality, low quality, low resolutions, extra fingers, blur, blurry, ugly, wrongs proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image"
|
33 |
output:
|
34 |
url: ComfyUI_00284_.jpeg
|
|
|
|
|
|
|
35 |
- text: "cinematic film still of Kodak Motion Picture Film: (Sharp Detailed Image) An Oscar winning movie for Best Cinematography a woman in a kimono standing on a subway train in Japan Kodak Motion Picture Film Style, shallow depth of field, vignette, highly detailed, high budget, bokeh, cinemascope, moody, epic, gorgeous, film grain, grainy,"
|
36 |
parameters:
|
37 |
negative_prompt: "bad quality, bad anatomy, worst quality, low quality, low resolutions, extra fingers, blur, blurry, ugly, wrongs proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image"
|
38 |
output:
|
39 |
url: ComfyUI_00265_.jpeg
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
40 |
---
|
41 |
+
|
42 |
+
# OpenDalle
|
43 |
+
|
44 |
+
<Gallery />
|
45 |
+
|
46 |
I'm thrilled to share an update on a recent project of mine. After some dedicated work, I've developed a highly effective text-to-image model. This innovation results from integrating the DPO model from Hugging Face with several advanced counterparts, including Juggernaut7XL, ALBEDOXL, MEARGEHEAVEN, and a model of my own design. The outcome is a unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base SDXL and a step closer to DALLE-3 in terms of prompt comprehension. Notably, this model excels in interpreting and adhering to the given prompts, focusing more on semantic accuracy than on ultra-high-fidelity image generation.
|
47 |
+
also available on ```https://civitai.com/models/238116/opendalle ```
|
48 |
+
|
49 |
+
## `*.safetensors` for AUTOMATIC1111, ComfyUI, InvokeAI
|
50 |
+
[Download *.safetensors file](https://huggingface.co/dataautogpt3/OpenDalle/resolve/main/OpenDalle.safetensors?download=true)
|
51 |
+
|
52 |
+
## Use it with 🧨 diffusers
|
53 |
+
```python
|
54 |
+
from diffusers import AutoPipelineForText2Image
|
55 |
+
import torch
|
56 |
+
|
57 |
+
pipeline = AutoPipelineForText2Image.from_pretrained('dataautogpt3/OpenDalle', torch_dtype=torch.float16).to('cuda')
|
58 |
+
image = pipeline('Manga from the early 1990s, characterized by its surreal aesthetic. The artwork is depicted in matte colors and created using a digital medium. Notable illustrators include Junji Ito, Yoshiyuki Sadamoto, and Rumiko Takahashi.').images[0]
|
59 |
+
```
|