File size: 3,084 Bytes

87d101a
 
 
 
 
 
 
 
 
 
 
 
 
7fa487f
 
 
 
 
87d101a
 
 
7b22b9e
87d101a
e0afa42
 
 
 
 
 
 
87d101a
 
 
 
7b22b9e
 
87d101a
 
7b22b9e
87d101a
7b22b9e
 
 
87d101a
7b22b9e
 
 
 
 
 
 
87d101a
 
7b22b9e

---
base_model: stabilityai/stable-diffusion-xl-base-1.0
library_name: diffusers
license: openrail++
tags:
- text-to-image
- text-to-image
- diffusers-training
- diffusers
- lora
- template:sd-lora
- stable-diffusion-xl
- stable-diffusion-xl-diffusers
datasets:
- data-is-better-together/open-image-preferences-v1-binarized
language:
- en
pipeline_tag: text-to-image
---


# Low Rank Adapted Supervised Fine Tuned Stable Diffusion XL

## Comparison

| Prompt | SDXL | Fine Tuned |
| :--: | :--: | :--: |
| a boat in the canals of Venice, painted in gouache with soft, flowing brushstrokes and vibrant, translucent colors, capturing the serene reflection on the water under a misty ambiance, with rich textures and a dynamic perspective | ![image/png](https://cdn-uploads.huggingface.co/production/uploads/608aabf24955d2bfc3cd99c6/CtoZWxDmANYm7d95I3Fcp.png) | ![image/png](https://cdn-uploads.huggingface.co/production/uploads/608aabf24955d2bfc3cd99c6/hAxmaL-robradk1x_KqwQ.png) |
| Grainy shot of a robot cooking in the kitchen, with soft shadows and nostalgic film texture. | ![image/png](https://cdn-uploads.huggingface.co/production/uploads/608aabf24955d2bfc3cd99c6/LTQI1NdaEjJUgeDpqzv7k.png) | ![image/png](https://cdn-uploads.huggingface.co/production/uploads/608aabf24955d2bfc3cd99c6/vAjnMCW0nmbV0zHKT8oCJ.png) |

## Model description

These are ariG23498/open-image-preferences-v1-sdxl-lora LoRA adaption weights for stabilityai/stable-diffusion-xl-base-1.0.

The weights were trained using [DreamBooth](https://github.com/ariG23498/diffusers/blob/aritra/sdxl-lora/examples/dreambooth/train_dreambooth_lora_sdxl.py) using the
[open-image-preferences-v1-binarized](https://huggingface.co/datasets/data-is-better-together/open-image-preferences-v1-binarized) dataset.


## Use with `diffusers`

```py
from diffusers import AutoPipelineForText2Image
import torch

pipeline = AutoPipelineForText2Image.from_pretrained(
    "stabilityai/stable-diffusion-xl-base-1.0",
    torch_dtype=torch.bfloat16
).to('cuda')
pipeline.load_lora_weights('ariG23498/open-image-preferences-v1-sdxl-lora', weight_name='pytorch_lora_weights.safetensors')
prompt = "ENTER PROMPT"
image = pipeline(prompt).images[0]
```

## Command to train the model

```shell
!accelerate launch examples/dreambooth/train_dreambooth_lora_sdxl.py \
    --pretrained_model_name_or_path "stabilityai/stable-diffusion-xl-base-1.0" \
    --dataset_name "data-is-better-together/open-image-preferences-v1-binarized" \
    --hub_model_id "ariG23498/open-image-preferences-v1-sdxl-lora" \
    --push_to_hub \
    --output_dir "open-image-preferences-v1-sdxl-lora" \
    --image_column "chosen" \
    --caption_column "prompt" \
    --mixed_precision="bf16" \
    --resolution=1024 \
    --train_batch_size=1 \
    --repeats=1 \
    --report_to="wandb"\
    --gradient_accumulation_steps=1 \
    --gradient_checkpointing \
    --learning_rate=1.0 \
    --text_encoder_lr=1.0 \
    --optimizer="prodigy"\
    --lr_scheduler="constant" \
    --lr_warmup_steps=0 \
    --rank=8 \
    --checkpointing_steps=2000 \
    --seed="0"
```