File size: 1,816 Bytes
53519ec 70d5e1f 53519ec 70d5e1f |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 |
---
license: cc-by-2.0
library_name: diffusers
tags:
- art
- code
- stable-diffusion-xl
- stable-diffusion-xl-diffusers
---
# SDXL LoRA DreamBooth - comic_style_LoRA
<Gallery />
| Image 1 | Image 2 |
|---|---|
| ![example 1](./1.png) | ![example 2](./2.png) |
| Image 3 | Image 4 |
|---|---|
| ![example 3](./3.png) | ![sample 4](./4.png) |
## Model description
These are comic_style LoRA adaption weights for `stabilityai/stable-diffusion-xl-base-1.0`.
The weights were trained using [DreamBooth](https://dreambooth.github.io/).
LoRA for the text encoder was enabled.
Special VAE used for training: `madebyollin/sdxl-vae-fp16-fix`.
DataSet: custom hand-drawn sketches by [ritwikraha](https://www.ritwikraha.com/)
## Trigger words
You should use a photo in the style of TOK comics to trigger the image generation.
## Usage
```
!pip install diffusers accelerate -q
import torch
from PIL import Image
from diffusers import DiffusionPipeline, AutoencoderKL
vae = AutoencoderKL.from_pretrained("madebyollin/sdxl-vae-fp16-fix", torch_dtype=torch.float16)
pipe = DiffusionPipeline.from_pretrained(
"stabilityai/stable-diffusion-xl-base-1.0",
vae=vae,
torch_dtype=torch.float16,
variant="fp16",
use_safetensors=True
)
pipe.load_lora_weights('ritwikraha/comics_style_LoRA')
_ = pipe.to("cuda")
prompt = "a photo of 18th century London in the style of TOK comics, 8k"
negative_prompt ="ugly face, multiple bodies, bad anatomy, disfigured, extra fingers"
image = pipe(prompt=prompt,
negative_prompt=negative_prompt,
guidance_scale=3,
num_inference_steps=50).images[0]
image
```
## Download model
Weights for this model are available in Safetensors format.
[Download](ritwikraha/comics_style_LoRA/tree/main) them in the Files & versions tab.
--- |