File size: 3,909 Bytes
51ef6f4 695fc35 1265e2f 695fc35 4716210 1265e2f 695fc35 fc5ac5d 3fc281d 695fc35 4716210 695fc35 8eae614 695fc35 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 |
---
license: openrail
language:
- en
base_model:
- stabilityai/stable-diffusion-3.5-medium
pipeline_tag: image-to-image
---
# Stable Diffusion 3.5 Medium Turbo (SD3.5M Turbo)
**"All that has passed is but a prologue; what you hold in your heart will one day echo back."**
## Project Overview
Stable Diffusion 3.5 Medium Turbo (SD3.5M Turbo) is a high-performance text-to-image model distilled from StabilityAI's stable-diffusion-3.5-medium. This model emphasizes stability and efficiency, making it suitable for a wide range of art styles and creative expression scenarios.
<img src="https://github.com/user-attachments/assets/5d3313d6-6087-46f1-8add-5f322097c5be" alt="ComfyUI_temp_psqad_00205_" width="300"/>
## Model Features
- 🚀 **Turbo Performance**: Faster generation speeds, ideal for multitasking and high-demand scenarios.
- 🎨 **Versatile Styles**: Supports a wide range of styles, from photorealistic to abstract art.
- 🖼️ **High-Resolution Outputs**: Produces images with exceptional clarity and intricate details.
- ⚙️ **Easy to Extend**: Integrated with `LoRA` technology, making it easier for users to customize and experiment.
---
## How to Use
1. **Download the Model**
Download the latest versions of the model’s `ckpt` and `LoRA` files from the following links:
- [SD3.5M Checkpoint (Updated on 2024/12/16)](link_placeholder)
- [SD3.5M LoRA (Updated on 2024/12/15)](link_placeholder)
2. **Environment Setup**
Ensure that your environment meets the following requirements:
- Python 3.8+
- PyTorch 2.0+
- Required libraries such as `diffusers` and `transformers`
3. **Model Loading**
Load and use the model following the detailed instructions provided in the repository.
---
## Example Output
```python
import torch
from diffusers import StableDiffusion3Pipeline
pipe = StableDiffusion3Pipeline.from_pretrained("tensorart/stable-diffusion-3.5-medium-turbo", torch_dtype=torch.float16,)
pipe = pipe.to("cuda")
image = pipe(
"A beautiful bald girl with silver and white futuristic metal face jewelry, her full body made of intricately carved liquid glass in the style of Tadashi, the complexity master of cyberpunk, in the style of James Jean and Peter Mohrbacher. This concept design is trending on Artstation, with sharp focus, studio-quality photography, and highly detailed, intricate details.",
num_inference_steps=8,
guidance_scale=1.5,
height=1024,
width=768
).images[0]
image.save("./output_sd3.5m_turbo/test4-2.webp")
```
Using lora:
```python
import torch
from diffusers import StableDiffusion3Pipeline
import numpy as np
from safetensors.torch import load_file
pipe = StableDiffusion3Pipeline.from_pretrained("tensorart/stable-diffusion-3.5-medium-turbo", torch_dtype=torch.float16,)
pipe = pipe.to("cuda")
lora_weights_dir = 'tensorart/stable-diffusion-3.5-medium-turbo/lora_sd3.5m_turbo_8steps.safetensors'
pcm_lora_weight = load_file(lora_weights_dir)
alpha = 1.0
pcm_lora_weight = {
key: value * np.sqrt(alpha) for key, value in pcm_lora_weight.items()
}
pipe.load_lora_weights(lora_weights_dir)
pipe = pipe.to("cuda")
image = pipe(
"A beautiful bald girl with silver and white futuristic metal face jewelry, her full body made of intricately carved liquid glass in the style of Tadashi, the complexity master of cyberpunk, in the style of James Jean and Peter Mohrbacher. This concept design is trending on Artstation, with sharp focus, studio-quality photography, and highly detailed, intricate details.",
num_inference_steps=8,
guidance_scale=1.5,
height=1024,
width=768
).images[0]
image.save("./output_sd3.5m_turbo_lora/test1.webp")
``` |