Model card auto-generated by SimpleTuner

Browse files

Files changed (1) hide show

README.md +299 -0

README.md ADDED Viewed

	@@ -0,0 +1,299 @@

+---
+license: other
+base_model: "stabilityai/stable-diffusion-3.5-medium"
+tags:
+  - sd3
+  - sd3-diffusers
+  - text-to-image
+  - diffusers
+  - simpletuner
+  - not-for-all-audiences
+  - lora
+  - template:sd-lora
+  - lycoris
+inference: true
+widget:
+- text: 'unconditional (blank prompt)'
+  parameters:
+    negative_prompt: 'blurry, cropped, ugly'
+  output:
+    url: ./assets/image_0_0.png
+- text: 'a close-up of a young woman in the foreground with windswept red hair, green eyes full of determination, while a soft light highlights her'
+  parameters:
+    negative_prompt: 'blurry, cropped, ugly'
+  output:
+    url: ./assets/image_1_0.png
+- text: 'a beautiful woman with flowing hair adorned with flowers and intricate patterns, her expression is confident and mysterious as she holds her hand near her face, she wears an ornate outfit with floral designs, her skin and features are bathed in soft pink tones, the background is filled with decorative swirls and patterns that enhance the elegant and sensual atmosphere, the overall composition is delicate and detailed with a romantic and alluring vibe'
+  parameters:
+    negative_prompt: 'blurry, cropped, ugly'
+  output:
+    url: ./assets/image_2_0.png
+- text: 'profile of a dark-skinned woman with long blonde hair and green-yellow eyes, set against a background of large, vibrant leaves in shades of green, yellow, and red, the leaves dominate the foreground, creating a vivid contrast with the woman’s calm and focused expression, as if she is deep in thought, the lighting highlights her face and hair subtly, while the foliage around her adds a sense of motion and energy, the background fades into deeper greens and yellows'
+  parameters:
+    negative_prompt: 'blurry, cropped, ugly'
+  output:
+    url: ./assets/image_3_0.png
+- text: 'a woman with long red hair is shown in a curled-up pose, her face tilted toward her knees, gazing intensely at the camera with a look that combines vulnerability and curiosity, her lips are slightly parted, adding a sense of intrigue, while her green eyes, half-shaded by her bangs, convey a mixture of thoughtfulness and quiet intensity, she wears a bright orange headband that contrasts vividly with the green of her outfit and the background, the outfit is made of olive-green lace, with intricate embroidery that stands out against her fair skin, her left arm is bent and rests on her leg, adorned with an ornate gold bracelet featuring white and black details, the background is a muted teal-green, which enhances the bold colors of the image, creating an elegant contrast between warm and cool tones, the overall atmosphere is sophisticated, with a mix of sensuality, mystery, and strength conveyed through her expressive gaze and compact pose'
+  parameters:
+    negative_prompt: 'blurry, cropped, ugly'
+  output:
+    url: ./assets/image_4_0.png
+- text: 'a mesmerizing woman with flowing orange-blonde hair cascading down her back her piercing green eyes framed by vibrant blue eyelashes gaze directly at the viewer contrasting beautifully with her bold orange eyebrows her pink lips add a soft yet striking touch she wears a flowing pink cape that glimmers with delicate sparkles catching the light her pose is elegant with her back turned to the camera yet her expression exudes confidence and mystery as she looks over her shoulder'
+  parameters:
+    negative_prompt: 'blurry, cropped, ugly'
+  output:
+    url: ./assets/image_5_0.png
+- text: 'a young woman with long brown hair with pink shades and gray eyes, crouching, holding a drink with a straw, looking to the side, wearing a white jacket with red inserts with a hood and tight black trousers and white sneakers with blue laces and red spider print, with one hand under her chin'
+  parameters:
+    negative_prompt: 'blurry, cropped, ugly'
+  output:
+    url: ./assets/image_6_0.png
+- text: 'a seductive vampire under a glowing crescent moon her pale porcelain skin gleams with ethereal light while shadows dance across her curves her deep black hair flows dramatically with hints of blue and purple highlights her crimson lips part in a serene expression as she interacts with a bat above her, minimal dark outfit and high collar enhance her elegance surrounded by swirling pink and teal clouds in a moody nocturnal sky'
+  parameters:
+    negative_prompt: 'blurry, cropped, ugly'
+  output:
+    url: ./assets/image_7_0.png
+- text: 'a jungle queen with piercing green eyes her long blonde hair is adorned with flowers and feathers partially catching the light her face is dramatically lit with sharp contrasts one side glowing warmly while the other is cast in soft shadow creating depth and mystery the light highlights her vibrant green eyes and glossed lips while the shadow accentuates her cheekbones and jawline the jungle backdrop intensifies the luminous and moody atmosphere surrounding her'
+  parameters:
+    negative_prompt: 'blurry, cropped, ugly'
+  output:
+    url: ./assets/image_8_0.png
+- text: 'a confident young queen with flowing blonde hair and striking blue eyes her gaze is bold and captivating her face illuminated by soft yet dynamic lighting emphasizing the gloss on her lips and the glow of her complexion she wears a hood adorned with playful designs a bold pink heart stands out near her eye the intricate necklace of sharp teeth contrasts with her playful yet regal expression a subtle onomatopoeia hinted through her softly puckered lips evokes a gentle kiss sound amplifying her charm'
+  parameters:
+    negative_prompt: 'blurry, cropped, ugly'
+  output:
+    url: ./assets/image_9_0.png
+- text: 'two young women stand side by side , one has dark curly hair, wearing a sleeveless top and shorts with a belt, the other has long blonde hair, wearing similar attire, they exchange a subtle, knowing glance, as if sharing an unspoken understanding'
+  parameters:
+    negative_prompt: 'blurry, cropped, ugly'
+  output:
+    url: ./assets/image_10_0.png
+---
+# besch-style-st-sd35m-lokr-8e-5-bs6-v03
+This is a LyCORIS adapter derived from [stabilityai/stable-diffusion-3.5-medium](https://huggingface.co/stabilityai/stable-diffusion-3.5-medium).
+No validation prompt was used during training.
+None
+## Validation settings
+- CFG: `6.0`
+- CFG Rescale: `0.0`
+- Steps: `30`
+- Sampler: `FlowMatchEulerDiscreteScheduler`
+- Seed: `42`
+- Resolution: `832x1216`
+- Skip-layer guidance:
+    skip_guidance_layers=[7, 8, 9],
+Note: The validation settings are not necessarily the same as the [training settings](#training-settings).
+You can find some example images in the following gallery:
+<Gallery />
+The text encoder **was not** trained.
+You may reuse the base model text encoder for inference.
+## Training settings
+- Training epochs: 0
+- Training steps: 798
+- Learning rate: 8e-05
+  - Learning rate schedule: polynomial
+  - Warmup steps: 798
+- Max grad norm: 0.01
+- Effective batch size: 6
+  - Micro-batch size: 6
+  - Gradient accumulation steps: 1
+  - Number of GPUs: 1
+- Gradient checkpointing: True
+- Prediction type: flow-matching (extra parameters=['flux_schedule_auto_shift', 'shift=0.0', 'flux_use_uniform_schedule'])
+- Optimizer: adamw_bf16
+- Trainable parameter precision: Pure BF16
+- Caption dropout probability: 25.0%
+### LyCORIS Config:
+```json
+{
+    "bypass_mode": true,
+    "algo": "lokr",
+    "multiplier": 1.0,
+    "full_matrix": true,
+    "linear_dim": 10000,
+    "linear_alpha": 1,
+    "factor": 4,
+    "apply_preset": {
+        "target_module": [
+            "Attention",
+            "FeedForward"
+        ],
+        "module_algo_map": {
+            "FeedForward": {
+                "factor": 4
+            },
+            "Attention": {
+                "factor": 2
+            }
+        }
+    }
+}
+```
+## Datasets
+### BESCH-CROP-SD35M-V03-512
+- Repeats: 1
+- Total number of images: 101
+- Total number of aspect buckets: 1
+- Resolution: 0.262144 megapixels
+- Cropped: True
+- Crop style: random
+- Crop aspect: closest
+- Used for regularisation data: No
+### BESCH-CROP-SD35M-V03-768
+- Repeats: 1
+- Total number of images: 101
+- Total number of aspect buckets: 1
+- Resolution: 0.589824 megapixels
+- Cropped: True
+- Crop style: random
+- Crop aspect: closest
+- Used for regularisation data: No
+### BESCH-CROP-SD35M-V03-1024
+- Repeats: 1
+- Total number of images: 101
+- Total number of aspect buckets: 1
+- Resolution: 1.048576 megapixels
+- Cropped: True
+- Crop style: random
+- Crop aspect: closest
+- Used for regularisation data: No
+### BESCH-MIX-SD35M-V03-512
+- Repeats: 3
+- Total number of images: 202
+- Total number of aspect buckets: 2
+- Resolution: 0.262144 megapixels
+- Cropped: True
+- Crop style: random
+- Crop aspect: closest
+- Used for regularisation data: No
+### BESCH-MIX-SD35M-V03-768
+- Repeats: 3
+- Total number of images: 202
+- Total number of aspect buckets: 12
+- Resolution: 0.589824 megapixels
+- Cropped: True
+- Crop style: random
+- Crop aspect: closest
+- Used for regularisation data: No
+### BESCH-MIX-SD35M-V03-1024
+- Repeats: 3
+- Total number of images: 201
+- Total number of aspect buckets: 2
+- Resolution: 1.048576 megapixels
+- Cropped: True
+- Crop style: random
+- Crop aspect: closest
+- Used for regularisation data: No
+### BESCH-MIX-SD35M-V03-1280
+- Repeats: 3
+- Total number of images: 199
+- Total number of aspect buckets: 14
+- Resolution: 1.6384 megapixels
+- Cropped: True
+- Crop style: random
+- Crop aspect: closest
+- Used for regularisation data: No
+### BESCH-ORIGINAL-SD35M-V03-512
+- Repeats: 3
+- Total number of images: 68
+- Total number of aspect buckets: 3
+- Resolution: 0.262144 megapixels
+- Cropped: True
+- Crop style: random
+- Crop aspect: closest
+- Used for regularisation data: No
+### BESCH-ORIGINAL-SD35M-V03-768
+- Repeats: 3
+- Total number of images: 68
+- Total number of aspect buckets: 4
+- Resolution: 0.589824 megapixels
+- Cropped: True
+- Crop style: random
+- Crop aspect: closest
+- Used for regularisation data: No
+### BESCH-ORIGINAL-SD35M-V03-1024
+- Repeats: 3
+- Total number of images: 68
+- Total number of aspect buckets: 1
+- Resolution: 1.048576 megapixels
+- Cropped: True
+- Crop style: random
+- Crop aspect: closest
+- Used for regularisation data: No
+## Inference
+```python
+import torch
+from diffusers import DiffusionPipeline
+from lycoris import create_lycoris_from_weights
+def download_adapter(repo_id: str):
+    import os
+    from huggingface_hub import hf_hub_download
+    adapter_filename = "pytorch_lora_weights.safetensors"
+    cache_dir = os.environ.get('HF_PATH', os.path.expanduser('~/.cache/huggingface/hub/models'))
+    cleaned_adapter_path = repo_id.replace("/", "_").replace("\\", "_").replace(":", "_")
+    path_to_adapter = os.path.join(cache_dir, cleaned_adapter_path)
+    path_to_adapter_file = os.path.join(path_to_adapter, adapter_filename)
+    os.makedirs(path_to_adapter, exist_ok=True)
+    hf_hub_download(
+        repo_id=repo_id, filename=adapter_filename, local_dir=path_to_adapter
+    )
+    return path_to_adapter_file
+model_id = 'stabilityai/stable-diffusion-3.5-medium'
+adapter_repo_id = 'gattaplayer/besch-style-st-sd35m-lokr-8e-5-bs6-v03'
+adapter_filename = 'pytorch_lora_weights.safetensors'
+adapter_file_path = download_adapter(repo_id=adapter_repo_id)
+pipeline = DiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.bfloat16) # loading directly in bf16
+lora_scale = 1.0
+wrapper, _ = create_lycoris_from_weights(lora_scale, adapter_file_path, pipeline.transformer)
+wrapper.merge_to()
+prompt = "An astronaut is riding a horse through the jungles of Thailand."
+negative_prompt = 'blurry, cropped, ugly'
+## Optional: quantise the model to save on vram.
+## Note: The model was quantised during training, and so it is recommended to do the same during inference time.
+from optimum.quanto import quantize, freeze, qint8
+quantize(pipeline.transformer, weights=qint8)
+freeze(pipeline.transformer)
+pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu') # the pipeline is already in its target precision level
+image = pipeline(
+    prompt=prompt,
+    negative_prompt=negative_prompt,
+    num_inference_steps=30,
+    generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(42),
+    width=832,
+    height=1216,
+    guidance_scale=6.0,
+    skip_guidance_layers=[7, 8, 9],
+).images[0]
+image.save("output.png", format="PNG")
+```