File size: 3,221 Bytes
eeffd4d 9d25ef8 a02fa0b 9d25ef8 d838d28 9d25ef8 e507c73 8868100 9d25ef8 d838d28 9d25ef8 eeffd4d 9d25ef8 d838d28 9d25ef8 eeffd4d 9d25ef8 d838d28 9d25ef8 d838d28 9d25ef8 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 |
---
base_model:
- black-forest-labs/FLUX.1-schnell
tags:
- flux
- diffusers
pipeline_tag: text-to-image
---
# Flux Schnell CFG
## Model Summary
**Flux Schnell CFG** is a cutting-edge text-to-image generation model, created by merging components from the **FLUX.1 Schnell** architecture. By combining the precision of **FLUX.1 Schnell** with advanced tweaks, **Skittles v2** is designed to offer high-quality image outputs.
- **Type**: Text-to-Image Generation
- **Architecture**: Merged FLUX.1 Schnell with CFG capabilities
- **Output Quality**: Seems to be on par with **FLUX.1 Dev**
- **Performance**: Optimized for both image fidelity
---
## Features
- **CFG Integration**: Flux Schnell CFG unlocks CFG (Classifier-Free Guidance) capabilities, offering fine-grained control over image generation.
- **High Fidelity**: Produces ultra-realistic and detailed images.
- **Customizable Output**: Supports a wide range of prompts, styles, and configurations.
---
## Model Details
- **Base Model**: FLUX.1 Schnell
- **Merge Approach**: The model was combined using a custom merging strategy, blending FLUX.1 Schnell’s architecture with optimized CFG decoding.
- **Training Paradigm**: Not retrained, but restructured for improved inference performance.
- **Output Size**: Supports resolutions up to 1024x1024 pixels.
---
## Intended Use
### Applications
- Generating ultra-realistic images for creative projects
- Creating concept art, visual prototypes, and artistic renderings
- Exploration of text-to-image synthesis for research or artistic purposes
### Examples
| Prompt | Image Description |
|-------|--------------------|
| "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" | Hyper-detailed astronaut surrounded by lush, muted jungle tones |
| "A futuristic cityscape at sunset, ultra-realistic, cinematic, 4K" | Vibrant, glowing cityscape with dynamic lighting effects |
| "A delicious ceviche cheesecake slice" | Highly detailed and realistic rendition of a culinary masterpiece |
---
## How to Use
```python
from diffusers import DiffusionPipeline
# Load the model
pipe = DiffusionPipeline.from_pretrained("miike-ai/flux-schnell-cfg")
pipe.to("cuda") # Ensure CUDA is available
# Generate an image
prompt = "An ultra-realistic image of a futuristic cityscape."
image = pipe(prompt, guidance_scale=3.5, num_inference_steps=28).images[0]
# Save the result
image.save("generated_image.png")
```
---
## Limitations and Biases
- The model may produce biased or stereotypical outputs based on the provided text prompts.
- Outputs are deterministic but rely heavily on the prompt quality. Results may vary with ambiguous descriptions.
- The model is not trained to handle NSFW content or harmful prompts.
---
## Acknowledgments
- Built on top of **FLUX.1 Schnell** by Black Forest Labs
- Contributions from **miike-ai**
- Integrated with Hugging Face Diffusers for seamless inference
---
## Citation
If you use **Skittles v2** in your work, please cite:
```bibtex
@misc{miike2024flux-schnell-cfg,
title={Flux Schnell CFG: A Merged Text-to-Image Generation Model},
author={miike-ai},
year={2024},
url={https://huggingface.co/miike-ai/flux-schnell-cfg},
}
```
|