Chan-Y commited on
Commit
de28e79
·
verified ·
1 Parent(s): f51e0a8

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +67 -0
README.md ADDED
@@ -0,0 +1,67 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - en
5
+ library_name: diffusers
6
+ ---
7
+ ## Model Details
8
+
9
+ - **Model Name:** Stable-Flash-Lightning
10
+ - **Model Card Authors:** M.Cihan Yalçın
11
+ - **Base Models Merged:**
12
+ - [stabilityai/stable-diffusion-xl-base-1.0](https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0)
13
+ - [sd-community/sdxl-flash-lora](https://huggingface.co/sd-community/sdxl-flash-lora)
14
+ - [ByteDance/SDXL-Lightning](https://huggingface.co/ByteDance/SDXL-Lightning)
15
+
16
+ ## Model Description
17
+
18
+ The Stable-Flash-Lightning model is a powerful text-to-image model that leverages the strengths of three distinct diffusion models. By merging `stabilityai/stable-diffusion-xl-base-1.0`, `sd-community/sdxl-flash-lora`, and `ByteDance/SDXL-Lightning`, this model aims to generate highly realistic and detailed images from textual descriptions. The combined capabilities of these models ensure high-quality output with intricate details and vivid realism.
19
+
20
+ ## Example Usage
21
+
22
+ ```python
23
+ import torch
24
+ from diffusers import DiffusionPipeline
25
+
26
+ # Load the pipeline
27
+ pipeline = DiffusionPipeline.from_pretrained("Chan-Y/Stable-Flash-Lightning")
28
+
29
+ # Define the prompt and negative prompt
30
+ prompt = "a ultra-realistic cute little rabbit with big green eyes that wears a hat"
31
+ neg = "low quality, blur"
32
+
33
+ # Set random seed for reproducibility
34
+ torch.manual_seed(1521)
35
+
36
+ # Generate the image
37
+ image = pipeline(prompt,
38
+ negative_prompt=neg,
39
+ cross_attention_kwargs={"scale": 1.0},
40
+ num_inference_steps=50,
41
+ resize={"target_size": [256, 256]}).images[0]
42
+
43
+ # Display the image
44
+ image
45
+ ```
46
+
47
+ ## Model Performance
48
+
49
+ The model performs exceptionally well in generating ultra-realistic images with intricate details. The merged architecture allows it to handle complex prompts and produce images with high fidelity. The negative prompt capability helps in refining the output by avoiding undesirable qualities.
50
+
51
+ ## Merging Process
52
+
53
+ The model was created by merging the safetensors of `sd-community/sdxl-flash-lora` and `ByteDance/SDXL-Lightning` with the base model `stabilityai/stable-diffusion-xl-base-1.0`. No further fine-tuning was performed after the merging process. This approach combines the unique features and strengths of each model, resulting in a versatile and powerful text-to-image generation tool.
54
+
55
+ ## Intended Use
56
+
57
+ The model is intended for creative and artistic purposes, enabling users to generate high-quality images from textual descriptions. It can be used in various applications such as digital art, content creation, and visualization.
58
+
59
+ ## Limitations
60
+
61
+ - The model may not always perfectly capture highly complex or abstract concepts.
62
+ - The quality of the output can be influenced by the specificity and clarity of the prompt.
63
+ - Ethical considerations should be taken into account when generating images to avoid misuse.
64
+
65
+ ## Contact Information
66
+
67
+ For any queries or further information, please contact [Linkedin](https://www.linkedin.com/in/chanyalcin/).