dataautogpt3 multimodalart HF staff commited on
Commit
e95fdac
1 Parent(s): 4c1e31f

Update README.md (#4)

Browse files

- Update README.md (a266a14be5186f45096af329ffcf1f2f1b05fccd)


Co-authored-by: Apolinário from multimodal AI art <multimodalart@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +33 -16
README.md CHANGED
@@ -2,6 +2,21 @@
2
  license: mit
3
  pipeline_tag: text-to-image
4
  widget:
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5
  - text: "in the style of artgerm, comic style,3D model, mythical seascape, negative space, space quixotic dreams, temporal hallucination, psychedelic, mystical, intricate details, very bright neon colors, (vantablack background:1.5), pointillism, pareidolia, melting, symbolism, very high contrast, chiaroscuro"
6
  parameters:
7
  negative_prompt: "bad quality, bad anatomy, worst quality, low quality, low resolution, extra fingers, blur, blurry, ugly, wrong proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image, embedding:ac_neg1,"
@@ -17,26 +32,28 @@ widget:
17
  negative_prompt: "bad quality, bad anatomy, worst quality, low quality, low resolutions, extra fingers, blur, blurry, ugly, wrongs proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image"
18
  output:
19
  url: ComfyUI_00284_.jpeg
20
- - text: "panther head coming out of smoke, dark, moody, detailed, shadows"
21
- output:
22
- url: GBvRG6FXcAEOvcG.jpeg
23
  - text: "cinematic film still of Kodak Motion Picture Film: (Sharp Detailed Image) An Oscar winning movie for Best Cinematography a woman in a kimono standing on a subway train in Japan Kodak Motion Picture Film Style, shallow depth of field, vignette, highly detailed, high budget, bokeh, cinemascope, moody, epic, gorgeous, film grain, grainy,"
24
  parameters:
25
  negative_prompt: "bad quality, bad anatomy, worst quality, low quality, low resolutions, extra fingers, blur, blurry, ugly, wrongs proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image"
26
  output:
27
  url: ComfyUI_00265_.jpeg
28
- - text: ' '
29
- output:
30
- url: GBvPhMyWoAAp_fT.jpeg
31
- - text: ' '
32
- output:
33
- url: GBvMRyqXMAAX8jj.jpeg
34
- - text: "Manga from the early 1990s, characterized by its surreal aesthetic. The artwork is depicted in matte colors and created using a digital medium. Notable illustrators include Junji Ito, Yoshiyuki Sadamoto, and Rumiko Takahashi."
35
- output:
36
- url: ComfyUI_00497_.jpeg
37
- - text: ' '
38
- output:
39
- url: GBuwGoJXUAA89jm.jpeg
40
  ---
 
 
 
 
 
41
  I'm thrilled to share an update on a recent project of mine. After some dedicated work, I've developed a highly effective text-to-image model. This innovation results from integrating the DPO model from Hugging Face with several advanced counterparts, including Juggernaut7XL, ALBEDOXL, MEARGEHEAVEN, and a model of my own design. The outcome is a unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base SDXL and a step closer to DALLE-3 in terms of prompt comprehension. Notably, this model excels in interpreting and adhering to the given prompts, focusing more on semantic accuracy than on ultra-high-fidelity image generation.
42
- available on ```https://civitai.com/models/238116/opendalle ```
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  license: mit
3
  pipeline_tag: text-to-image
4
  widget:
5
+ - text: "panther head coming out of smoke, dark, moody, detailed, shadows"
6
+ output:
7
+ url: GBvRG6FXcAEOvcG.jpeg
8
+ - text: "Manga from the early 1990s, characterized by its surreal aesthetic. The artwork is depicted in matte colors and created using a digital medium. Notable illustrators include Junji Ito, Yoshiyuki Sadamoto, and Rumiko Takahashi."
9
+ output:
10
+ url: ComfyUI_00497_.jpeg
11
+ - text: '-'
12
+ output:
13
+ url: GBvPhMyWoAAp_fT.jpeg
14
+ - text: '-'
15
+ output:
16
+ url: GBvMRyqXMAAX8jj.jpeg
17
+ - text: '-'
18
+ output:
19
+ url: GBuwGoJXUAA89jm.jpeg
20
  - text: "in the style of artgerm, comic style,3D model, mythical seascape, negative space, space quixotic dreams, temporal hallucination, psychedelic, mystical, intricate details, very bright neon colors, (vantablack background:1.5), pointillism, pareidolia, melting, symbolism, very high contrast, chiaroscuro"
21
  parameters:
22
  negative_prompt: "bad quality, bad anatomy, worst quality, low quality, low resolution, extra fingers, blur, blurry, ugly, wrong proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image, embedding:ac_neg1,"
 
32
  negative_prompt: "bad quality, bad anatomy, worst quality, low quality, low resolutions, extra fingers, blur, blurry, ugly, wrongs proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image"
33
  output:
34
  url: ComfyUI_00284_.jpeg
 
 
 
35
  - text: "cinematic film still of Kodak Motion Picture Film: (Sharp Detailed Image) An Oscar winning movie for Best Cinematography a woman in a kimono standing on a subway train in Japan Kodak Motion Picture Film Style, shallow depth of field, vignette, highly detailed, high budget, bokeh, cinemascope, moody, epic, gorgeous, film grain, grainy,"
36
  parameters:
37
  negative_prompt: "bad quality, bad anatomy, worst quality, low quality, low resolutions, extra fingers, blur, blurry, ugly, wrongs proportions, watermark, image artifacts, lowres, ugly, jpeg artifacts, deformed, noisy image"
38
  output:
39
  url: ComfyUI_00265_.jpeg
 
 
 
 
 
 
 
 
 
 
 
 
40
  ---
41
+
42
+ # OpenDalle
43
+
44
+ <Gallery />
45
+
46
  I'm thrilled to share an update on a recent project of mine. After some dedicated work, I've developed a highly effective text-to-image model. This innovation results from integrating the DPO model from Hugging Face with several advanced counterparts, including Juggernaut7XL, ALBEDOXL, MEARGEHEAVEN, and a model of my own design. The outcome is a unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base SDXL and a step closer to DALLE-3 in terms of prompt comprehension. Notably, this model excels in interpreting and adhering to the given prompts, focusing more on semantic accuracy than on ultra-high-fidelity image generation.
47
+ also available on ```https://civitai.com/models/238116/opendalle ```
48
+
49
+ ## `*.safetensors` for AUTOMATIC1111, ComfyUI, InvokeAI
50
+ [Download *.safetensors file](https://huggingface.co/dataautogpt3/OpenDalle/resolve/main/OpenDalle.safetensors?download=true)
51
+
52
+ ## Use it with 🧨 diffusers
53
+ ```python
54
+ from diffusers import AutoPipelineForText2Image
55
+ import torch
56
+
57
+ pipeline = AutoPipelineForText2Image.from_pretrained('dataautogpt3/OpenDalle', torch_dtype=torch.float16).to('cuda')
58
+ image = pipeline('Manga from the early 1990s, characterized by its surreal aesthetic. The artwork is depicted in matte colors and created using a digital medium. Notable illustrators include Junji Ito, Yoshiyuki Sadamoto, and Rumiko Takahashi.').images[0]
59
+ ```