Update README.md
Browse files
README.md
CHANGED
@@ -6,11 +6,110 @@ tags:
|
|
6 |
- diffusers
|
7 |
pipeline_tag: text-to-image
|
8 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
9 |
```python
|
10 |
from diffusers import DiffusionPipeline
|
11 |
|
|
|
12 |
pipe = DiffusionPipeline.from_pretrained("miike-ai/skittles-v2")
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
13 |
|
14 |
-
|
15 |
-
|
16 |
-
|
|
|
|
|
|
|
|
|
|
|
|
6 |
- diffusers
|
7 |
pipeline_tag: text-to-image
|
8 |
---
|
9 |
+
|
10 |
+
Here's a draft for your **Skittles v2** model card on Hugging Face:
|
11 |
+
|
12 |
+
---
|
13 |
+
|
14 |
+
# Skittles v2
|
15 |
+
|
16 |
+
## Model Summary
|
17 |
+
|
18 |
+
**Skittles v2** is a cutting-edge text-to-image generation model, created by merging components from the **FLUX.1 Schnell** architecture. By combining the precision of **FLUX.1 Schnell** with advanced tweaks, **Skittles v2** is designed to offer high-quality image outputs comparable to **FLUX.1 Dev** while maintaining a streamlined configuration for efficient inference.
|
19 |
+
|
20 |
+
- **Type**: Text-to-Image Generation
|
21 |
+
- **Architecture**: Merged FLUX.1 Schnell with CFG capabilities
|
22 |
+
- **Output Quality**: On par with **FLUX.1 Dev**
|
23 |
+
- **Performance**: Optimized for both speed and image fidelity
|
24 |
+
|
25 |
+
---
|
26 |
+
|
27 |
+
## Features
|
28 |
+
|
29 |
+
- **CFG Integration**: Skittles v2 unlocks CFG (Classifier-Free Guidance) capabilities, offering fine-grained control over image generation.
|
30 |
+
- **High Fidelity**: Produces ultra-realistic and detailed images.
|
31 |
+
- **Optimized Performance**: Merged architecture reduces latency during inference while retaining quality.
|
32 |
+
- **Customizable Output**: Supports a wide range of prompts, styles, and configurations.
|
33 |
+
|
34 |
+
---
|
35 |
+
|
36 |
+
## Model Details
|
37 |
+
|
38 |
+
- **Base Model**: FLUX.1 Schnell
|
39 |
+
- **Merge Approach**: The model was combined using a custom merging strategy, blending FLUX.1 Schnell’s architecture with optimized CFG decoding.
|
40 |
+
- **Training Paradigm**: Not retrained, but restructured for improved inference performance.
|
41 |
+
- **Output Size**: Supports resolutions up to 1024x1024 pixels.
|
42 |
+
- **Quantization Compatibility**: Works with `optimum-quanto` for FP8 quantization.
|
43 |
+
|
44 |
+
---
|
45 |
+
|
46 |
+
## Intended Use
|
47 |
+
|
48 |
+
### Applications
|
49 |
+
- Generating ultra-realistic images for creative projects
|
50 |
+
- Creating concept art, visual prototypes, and artistic renderings
|
51 |
+
- Exploration of text-to-image synthesis for research or artistic purposes
|
52 |
+
|
53 |
+
### Examples
|
54 |
+
|
55 |
+
| Prompt | Image Description |
|
56 |
+
|-------|--------------------|
|
57 |
+
| "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" | Hyper-detailed astronaut surrounded by lush, muted jungle tones |
|
58 |
+
| "A futuristic cityscape at sunset, ultra-realistic, cinematic, 4K" | Vibrant, glowing cityscape with dynamic lighting effects |
|
59 |
+
| "A delicious ceviche cheesecake slice" | Highly detailed and realistic rendition of a culinary masterpiece |
|
60 |
+
|
61 |
+
---
|
62 |
+
|
63 |
+
## How to Use
|
64 |
+
|
65 |
```python
|
66 |
from diffusers import DiffusionPipeline
|
67 |
|
68 |
+
# Load the model
|
69 |
pipe = DiffusionPipeline.from_pretrained("miike-ai/skittles-v2")
|
70 |
+
pipe.to("cuda") # Ensure CUDA is available
|
71 |
+
|
72 |
+
# Generate an image
|
73 |
+
prompt = "An ultra-realistic image of a futuristic cityscape."
|
74 |
+
image = pipe(prompt, guidance_scale=3.5, num_inference_steps=28).images[0]
|
75 |
+
|
76 |
+
# Save the result
|
77 |
+
image.save("generated_image.png")
|
78 |
+
```
|
79 |
+
|
80 |
+
---
|
81 |
+
|
82 |
+
## Limitations and Biases
|
83 |
+
|
84 |
+
- The model may produce biased or stereotypical outputs based on the provided text prompts.
|
85 |
+
- Outputs are deterministic but rely heavily on the prompt quality. Results may vary with ambiguous descriptions.
|
86 |
+
- The model is not trained to handle NSFW content or harmful prompts.
|
87 |
+
|
88 |
+
---
|
89 |
+
|
90 |
+
## Acknowledgments
|
91 |
+
|
92 |
+
- Built on top of **FLUX.1 Schnell** by Black Forest Labs
|
93 |
+
- Contributions from **miike-ai**
|
94 |
+
- Integrated with Hugging Face Diffusers for seamless inference
|
95 |
+
|
96 |
+
---
|
97 |
+
|
98 |
+
## License
|
99 |
+
|
100 |
+
This model is released under the **Apache 2.0 License**.
|
101 |
+
|
102 |
+
---
|
103 |
+
|
104 |
+
## Citation
|
105 |
+
|
106 |
+
If you use **Skittles v2** in your work, please cite:
|
107 |
|
108 |
+
```bibtex
|
109 |
+
@misc{miike2024skittlesv2,
|
110 |
+
title={Skittles v2: A Merged Text-to-Image Generation Model},
|
111 |
+
author={miike-ai},
|
112 |
+
year={2024},
|
113 |
+
url={https://huggingface.co/miike-ai/skittles-v2},
|
114 |
+
}
|
115 |
+
```
|