File size: 3,841 Bytes
51ef6f4
 
 
 
 
 
a4a4b7b
51ef6f4
695fc35
 
d6f6d15
1626450
43f9b5a
695fc35
 
 
 
 
1265e2f
695fc35
 
 
 
 
 
 
 
 
fc5ac5d
3fc281d
695fc35
 
 
 
 
 
 
 
 
 
 
b2891b6
695fc35
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13bc8a5
695fc35
 
 
 
 
 
 
 
 
 
4716210
 
695fc35
 
 
 
 
 
 
 
 
 
 
8eae614
695fc35
 
 
 
 
 
 
13bc8a5
b765d1c
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
---
license: openrail
language:
- en
base_model:
- stabilityai/stable-diffusion-3.5-medium
pipeline_tag: text-to-image
---
# Stable Diffusion 3.5 Medium Turbo  (SD3.5M Turbo)

 *"All that has passed is but a prologue,what you hold in your heart will one day echo back."*
 
<img src="https://github.com/user-attachments/assets/5d3313d6-6087-46f1-8add-5f322097c5be" alt="ComfyUI_temp_psqad_00205_" width="400"/>

## Project Overview

Stable Diffusion 3.5 Medium Turbo (SD3.5M Turbo) is a high-performance text-to-image model distilled from StabilityAI's stable-diffusion-3.5-medium. This model emphasizes stability and efficiency, making it suitable for a wide range of art styles and creative expression scenarios.





## Model Features

- 🚀 **Turbo Performance**: Faster generation speeds, ideal for multitasking and high-demand scenarios.
- 🎨 **Versatile Styles**: Supports a wide range of styles, from photorealistic to abstract art.
- 🖼️ **High-Resolution Outputs**: Produces images with exceptional clarity and intricate details.
- ⚙️ **Easy to Extend**: Integrated with `LoRA` technology, making it easier for users to customize and experiment.
---

## How to Use

1. **Download the Model**  
   Download the latest versions of the model’s `ckpt` and `LoRA` files from the following links:
   - [SD3.5M Checkpoint (Updated on 2024/12/16)](link_placeholder)
   - [SD3.5M LoRA (Updated on 2024/12/15)](link_placeholder)

2. **Environment Setup**  
   Ensure that your environment meets the following requirements:
   - Python 3.8+
   - PyTorch 2.0+
   - Required libraries such as `diffusers` 

3. **Model Loading**  
   Load and use the model following the detailed instructions provided in the repository.

---

## Example Output


   ```python
   import torch
   from diffusers import StableDiffusion3Pipeline

   pipe = StableDiffusion3Pipeline.from_pretrained("tensorart/stable-diffusion-3.5-medium-turbo", torch_dtype=torch.float16,)
                                                   
   pipe = pipe.to("cuda")


   image = pipe(
      "A beautiful bald girl with silver and white futuristic metal face jewelry, her full body made of intricately carved liquid glass in the style of Tadashi, the complexity master of cyberpunk, in the style of James Jean and Peter Mohrbacher. This concept design is trending on Artstation, with sharp focus, studio-quality photography, and highly detailed, intricate details.",
      num_inference_steps=8,
      guidance_scale=1.5,
      height=1024,
      width=768 
   ).images[0]

   image.save("./test4-2.webp")
   ```

   Using lora:

   ```python
   import torch
   from diffusers import StableDiffusion3Pipeline
   import numpy as np
   from safetensors.torch import load_file

   pipe = StableDiffusion3Pipeline.from_pretrained("tensorart/stable-diffusion-3.5-medium-turbo", torch_dtype=torch.float16,)
                                                   
   pipe = pipe.to("cuda")

   lora_weights_dir = 'tensorart/stable-diffusion-3.5-medium-turbo/lora_sd3.5m_turbo_8steps.safetensors'
   pcm_lora_weight = load_file(lora_weights_dir)

   alpha = 1.0
   pcm_lora_weight = {
      key: value * np.sqrt(alpha) for key, value in pcm_lora_weight.items()
   }
   pipe.load_lora_weights(lora_weights_dir)
   pipe = pipe.to("cuda")

   image = pipe(
      "A beautiful bald girl with silver and white futuristic metal face jewelry, her full body made of intricately carved liquid glass in the style of Tadashi, the complexity master of cyberpunk, in the style of James Jean and Peter Mohrbacher. This concept design is trending on Artstation, with sharp focus, studio-quality photography, and highly detailed, intricate details.",
      num_inference_steps=8,
      guidance_scale=1.5,
      height=1024,
      width=768 
   ).images[0]
   image.save("./test1.webp")
   ```