multimodalart HF staff commited on
Commit
7f6d837
1 Parent(s): d279db2

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,9 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ image-0.png filter=lfs diff=lfs merge=lfs -text
37
+ image-1.png filter=lfs diff=lfs merge=lfs -text
38
+ image-2.png filter=lfs diff=lfs merge=lfs -text
39
+ image-3.png filter=lfs diff=lfs merge=lfs -text
40
+ image-4.png filter=lfs diff=lfs merge=lfs -text
41
+ image-5.png filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,92 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - stable-diffusion-xl
4
+ - stable-diffusion-xl-diffusers
5
+ - text-to-image
6
+ - diffusers
7
+ - lora
8
+ - template:sd-lora
9
+ widget:
10
+ - text: A photo of <s0><s1> man wearing headphones and a blue shirt
11
+ output:
12
+ url: image-0.png
13
+ - text: A photo of <s0><s1> man with glasses and a beard smiles
14
+ output:
15
+ url: image-1.png
16
+ - text: A photo of <s0><s1> bald man with glasses and a colorful shirt
17
+ output:
18
+ url: image-2.png
19
+ - text: A photo of <s0><s1> man with glasses and a hat wearing an orange cap
20
+ output:
21
+ url: image-3.png
22
+ - text: A photo of <s0><s1> man wearing glasses and a yellow hat taking a selfie
23
+ output:
24
+ url: image-4.png
25
+ - text: A photo of <s0><s1> man wearing a yellow hat and glasses
26
+ output:
27
+ url: image-5.png
28
+ base_model: stabilityai/stable-diffusion-xl-base-1.0
29
+ instance_prompt: A photo of <s0><s1>
30
+ license: openrail++
31
+ ---
32
+
33
+ # SDXL LoRA DreamBooth - multimodalart/apolinario-face-final
34
+
35
+ <Gallery />
36
+
37
+ ## Model description
38
+
39
+ ### These are multimodalart/apolinario-face-final LoRA adaption weights for stabilityai/stable-diffusion-xl-base-1.0.
40
+
41
+ ## Download model
42
+
43
+ ### Use it with UIs such as AUTOMATIC1111, Comfy UI, SD.Next, Invoke
44
+
45
+ - **LoRA**: download **[`apolinario-face-final.safetensors` here 💾](/multimodalart/apolinario-face-final/blob/main/apolinario-face-final.safetensors)**.
46
+ - Place it on your `models/Lora` folder.
47
+ - On AUTOMATIC1111, load the LoRA by adding `<lora:apolinario-face-final:1>` to your prompt. On ComfyUI just [load it as a regular LoRA](https://comfyanonymous.github.io/ComfyUI_examples/lora/).
48
+ - *Embeddings*: download **[`apolinario-face-final_emb.safetensors` here 💾](/multimodalart/apolinario-face-final/blob/main/apolinario-face-final_emb.safetensors)**.
49
+ - Place it on it on your `embeddings` folder
50
+ - Use it by adding `apolinario-face-final_emb` to your prompt. For example, `A photo of apolinario-face-final_emb`
51
+ (you need both the LoRA and the embeddings as they were trained together for this LoRA)
52
+
53
+
54
+ ## Use it with the [🧨 diffusers library](https://github.com/huggingface/diffusers)
55
+
56
+ ```py
57
+ from diffusers import AutoPipelineForText2Image
58
+ import torch
59
+ from huggingface_hub import hf_hub_download
60
+ from safetensors.torch import load_file
61
+
62
+ pipeline = AutoPipelineForText2Image.from_pretrained('stabilityai/stable-diffusion-xl-base-1.0', torch_dtype=torch.float16).to('cuda')
63
+ pipeline.load_lora_weights('multimodalart/apolinario-face-final', weight_name='pytorch_lora_weights.safetensors')
64
+ embedding_path = hf_hub_download(repo_id='multimodalart/apolinario-face-final', filename='apolinario-face-final_emb.safetensors' repo_type="model")
65
+ state_dict = load_file(embedding_path)
66
+ pipeline.load_textual_inversion(state_dict["clip_l"], token=["<s0>", "<s1>"], text_encoder=pipeline.text_encoder, tokenizer=pipeline.tokenizer)
67
+ pipeline.load_textual_inversion(state_dict["clip_g"], token=["<s0>", "<s1>"], text_encoder=pipeline.text_encoder_2, tokenizer=pipeline.tokenizer_2)
68
+
69
+ image = pipeline('A photo of <s0><s1>').images[0]
70
+ ```
71
+
72
+ For more details, including weighting, merging and fusing LoRAs, check the [documentation on loading LoRAs in diffusers](https://huggingface.co/docs/diffusers/main/en/using-diffusers/loading_adapters)
73
+
74
+ ## Trigger words
75
+
76
+ To trigger image generation of trained concept(or concepts) replace each concept identifier in you prompt with the new inserted tokens:
77
+
78
+ to trigger concept `TOK` → use `<s0><s1>` in your prompt
79
+
80
+
81
+
82
+ ## Details
83
+ All [Files & versions](/multimodalart/apolinario-face-final/tree/main).
84
+
85
+ The weights were trained using [🧨 diffusers Advanced Dreambooth Training Script](https://github.com/huggingface/diffusers/blob/main/examples/advanced_diffusion_training/train_dreambooth_lora_sdxl_advanced.py).
86
+
87
+ LoRA for the text encoder was enabled. False.
88
+
89
+ Pivotal tuning was enabled: True.
90
+
91
+ Special VAE used for training: madebyollin/sdxl-vae-fp16-fix.
92
+
apolinario-face-final.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c24c76139fbb3bcbee3eb957237cb91409d068375d5aa94cd79fd734a5bf991d
3
+ size 186046568
apolinario-face-final_emb.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f4b6903c7e01ef2894e322ac85bc4c4e371e2cedaacfa1f9d84dc9c3c644bd15
3
+ size 8344
image-0.png ADDED

Git LFS Details

  • SHA256: 9c6fda65028afb3426cfde7499318f0c8a90aab98c365d6739d5ee917285f1d7
  • Pointer size: 132 Bytes
  • Size of remote file: 1.4 MB
image-1.png ADDED

Git LFS Details

  • SHA256: 73d47b1f31963f62869967a9b22a9095718d05ae94bee49bac1f465797b7b37d
  • Pointer size: 132 Bytes
  • Size of remote file: 1.44 MB
image-2.png ADDED

Git LFS Details

  • SHA256: 8aeed521a1518ad8ebd3faebee68724331dbfd9532892c986ab3b72bada059ff
  • Pointer size: 132 Bytes
  • Size of remote file: 1.56 MB
image-3.png ADDED

Git LFS Details

  • SHA256: ac93f4b182fa923c67ea61f16a6196fa8a7fbf5556ee7e396ad64d8dcde77a9b
  • Pointer size: 132 Bytes
  • Size of remote file: 1.46 MB
image-4.png ADDED

Git LFS Details

  • SHA256: 4a09b3fe01725cea1b6edd61505fda6c8101807c79eb5ae91a8d93e1632e6e54
  • Pointer size: 132 Bytes
  • Size of remote file: 1.44 MB
image-5.png ADDED

Git LFS Details

  • SHA256: 8f475c0443c504741dcdec1d42294fb2ea3b6684a3a39b96bbbb1d5fb7390531
  • Pointer size: 132 Bytes
  • Size of remote file: 1.45 MB
logs/dreambooth-lora-sd-xl/1704210123.5538557/events.out.tfevents.1704210123.r-multimodalart-autotrain-apolinario-face-final-dirsn-8954c8dd8.214.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e714a4c7e1e3d7ecbf6491b02fccd4b4d74770dd19e52165a596443a427a0666
3
+ size 3669
logs/dreambooth-lora-sd-xl/1704210123.5558538/hparams.yml ADDED
@@ -0,0 +1,74 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ adam_beta1: 0.9
2
+ adam_beta2: 0.999
3
+ adam_epsilon: 1.0e-08
4
+ adam_weight_decay: 0.0001
5
+ adam_weight_decay_text_encoder: null
6
+ allow_tf32: false
7
+ cache_dir: null
8
+ cache_latents: true
9
+ caption_column: prompt
10
+ center_crop: false
11
+ checkpointing_steps: 5000
12
+ checkpoints_total_limit: null
13
+ class_data_dir: 0275ca88-08a5-492b-a8f4-4444f247e0f5
14
+ class_prompt: a photo of a person
15
+ crops_coords_top_left_h: 0
16
+ crops_coords_top_left_w: 0
17
+ dataloader_num_workers: 0
18
+ dataset_config_name: null
19
+ dataset_name: ./6875eaf9-781a-4e2f-9aa1-e7f5c47a3d77
20
+ enable_xformers_memory_efficient_attention: false
21
+ gradient_accumulation_steps: 1
22
+ gradient_checkpointing: true
23
+ hub_model_id: null
24
+ hub_token: null
25
+ image_column: image
26
+ instance_data_dir: null
27
+ instance_prompt: A photo of <s0><s1>
28
+ learning_rate: 1.0
29
+ local_rank: -1
30
+ logging_dir: logs
31
+ lr_num_cycles: 1
32
+ lr_power: 1.0
33
+ lr_scheduler: constant
34
+ lr_warmup_steps: 0
35
+ max_grad_norm: 1.0
36
+ max_train_steps: 500
37
+ mixed_precision: bf16
38
+ num_class_images: 150
39
+ num_new_tokens_per_abstraction: 2
40
+ num_train_epochs: 7
41
+ num_validation_images: 4
42
+ optimizer: prodigy
43
+ output_dir: apolinario-face-final
44
+ pretrained_model_name_or_path: stabilityai/stable-diffusion-xl-base-1.0
45
+ pretrained_vae_model_name_or_path: madebyollin/sdxl-vae-fp16-fix
46
+ prior_generation_precision: null
47
+ prior_loss_weight: 1.0
48
+ prodigy_beta3: null
49
+ prodigy_decouple: true
50
+ prodigy_safeguard_warmup: true
51
+ prodigy_use_bias_correction: true
52
+ push_to_hub: false
53
+ rank: 32
54
+ repeats: 3
55
+ report_to: tensorboard
56
+ resolution: 1024
57
+ resume_from_checkpoint: null
58
+ revision: null
59
+ sample_batch_size: 4
60
+ scale_lr: false
61
+ seed: 42
62
+ snr_gamma: null
63
+ text_encoder_lr: 1.0
64
+ token_abstraction: TOK
65
+ train_batch_size: 2
66
+ train_text_encoder: false
67
+ train_text_encoder_frac: 1.0
68
+ train_text_encoder_ti: true
69
+ train_text_encoder_ti_frac: 0.5
70
+ use_8bit_adam: false
71
+ validation_epochs: 50
72
+ validation_prompt: null
73
+ variant: null
74
+ with_prior_preservation: true
logs/dreambooth-lora-sd-xl/events.out.tfevents.1704210123.r-multimodalart-autotrain-apolinario-face-final-dirsn-8954c8dd8.214.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:94abb94029b98fc5bf352ef8fd3f159adf5d21d87226b3e1e8f125b17fff9ade
3
+ size 41834
pytorch_lora_weights.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:346e8b9071d965c297693abaa3e8b6b8f9eca99a435fc78c9d347c09bd44ada4
3
+ size 185963768