bghira commited on
Commit
ae46311
1 Parent(s): 3547081

Model card auto-generated by SimpleTuner

Browse files
Files changed (1) hide show
  1. README.md +263 -0
README.md ADDED
@@ -0,0 +1,263 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ base_model: "black-forest-labs/FLUX.1-dev"
4
+ tags:
5
+ - flux
6
+ - flux-diffusers
7
+ - text-to-image
8
+ - diffusers
9
+ - simpletuner
10
+ - lora
11
+ - template:sd-lora
12
+ inference: true
13
+ widget:
14
+ - text: 'unconditional (blank prompt)'
15
+ parameters:
16
+ negative_prompt: 'blurry, cropped, ugly'
17
+ output:
18
+ url: ./assets/image_0_0.png
19
+ - text: 'unconditional (blank prompt)'
20
+ parameters:
21
+ negative_prompt: 'blurry, cropped, ugly'
22
+ output:
23
+ url: ./assets/image_1_1.png
24
+ - text: 'unconditional (blank prompt)'
25
+ parameters:
26
+ negative_prompt: 'blurry, cropped, ugly'
27
+ output:
28
+ url: ./assets/image_2_2.png
29
+ - text: 'moonman riding a motorcycle to the sun'
30
+ parameters:
31
+ negative_prompt: 'blurry, cropped, ugly'
32
+ output:
33
+ url: ./assets/image_3_0.png
34
+ - text: 'moonman riding a motorcycle to the sun'
35
+ parameters:
36
+ negative_prompt: 'blurry, cropped, ugly'
37
+ output:
38
+ url: ./assets/image_4_1.png
39
+ - text: 'moonman riding a motorcycle to the sun'
40
+ parameters:
41
+ negative_prompt: 'blurry, cropped, ugly'
42
+ output:
43
+ url: ./assets/image_5_2.png
44
+ - text: 'moonman on the theatre broadway doing a musical. the banner overhead reads TED TALK by MOONMAN'
45
+ parameters:
46
+ negative_prompt: 'blurry, cropped, ugly'
47
+ output:
48
+ url: ./assets/image_6_0.png
49
+ - text: 'moonman on the theatre broadway doing a musical. the banner overhead reads TED TALK by MOONMAN'
50
+ parameters:
51
+ negative_prompt: 'blurry, cropped, ugly'
52
+ output:
53
+ url: ./assets/image_7_1.png
54
+ - text: 'moonman on the theatre broadway doing a musical. the banner overhead reads TED TALK by MOONMAN'
55
+ parameters:
56
+ negative_prompt: 'blurry, cropped, ugly'
57
+ output:
58
+ url: ./assets/image_8_2.png
59
+ - text: 'moonman the DJ is performing at a club in ibitha in the year 1944, there is old timey rave feeling'
60
+ parameters:
61
+ negative_prompt: 'blurry, cropped, ugly'
62
+ output:
63
+ url: ./assets/image_9_0.png
64
+ - text: 'moonman the DJ is performing at a club in ibitha in the year 1944, there is old timey rave feeling'
65
+ parameters:
66
+ negative_prompt: 'blurry, cropped, ugly'
67
+ output:
68
+ url: ./assets/image_10_1.png
69
+ - text: 'moonman the DJ is performing at a club in ibitha in the year 1944, there is old timey rave feeling'
70
+ parameters:
71
+ negative_prompt: 'blurry, cropped, ugly'
72
+ output:
73
+ url: ./assets/image_11_2.png
74
+ - text: 'moonman the anime character fighting with an anthropomorphic tree'
75
+ parameters:
76
+ negative_prompt: 'blurry, cropped, ugly'
77
+ output:
78
+ url: ./assets/image_12_0.png
79
+ - text: 'moonman the anime character fighting with an anthropomorphic tree'
80
+ parameters:
81
+ negative_prompt: 'blurry, cropped, ugly'
82
+ output:
83
+ url: ./assets/image_13_1.png
84
+ - text: 'moonman the anime character fighting with an anthropomorphic tree'
85
+ parameters:
86
+ negative_prompt: 'blurry, cropped, ugly'
87
+ output:
88
+ url: ./assets/image_14_2.png
89
+ - text: 'moonman a rollercoaster mechanic working hard on the underlying gears and structures of the famous rollercoaster'
90
+ parameters:
91
+ negative_prompt: 'blurry, cropped, ugly'
92
+ output:
93
+ url: ./assets/image_15_0.png
94
+ - text: 'moonman a rollercoaster mechanic working hard on the underlying gears and structures of the famous rollercoaster'
95
+ parameters:
96
+ negative_prompt: 'blurry, cropped, ugly'
97
+ output:
98
+ url: ./assets/image_16_1.png
99
+ - text: 'moonman a rollercoaster mechanic working hard on the underlying gears and structures of the famous rollercoaster'
100
+ parameters:
101
+ negative_prompt: 'blurry, cropped, ugly'
102
+ output:
103
+ url: ./assets/image_17_2.png
104
+ - text: 'moonman the musician on stage performing to a crowd of people'
105
+ parameters:
106
+ negative_prompt: 'blurry, cropped, ugly'
107
+ output:
108
+ url: ./assets/image_18_0.png
109
+ - text: 'moonman the musician on stage performing to a crowd of people'
110
+ parameters:
111
+ negative_prompt: 'blurry, cropped, ugly'
112
+ output:
113
+ url: ./assets/image_19_1.png
114
+ - text: 'moonman the musician on stage performing to a crowd of people'
115
+ parameters:
116
+ negative_prompt: 'blurry, cropped, ugly'
117
+ output:
118
+ url: ./assets/image_20_2.png
119
+ - text: 'A photo-realistic image of a moonman cat'
120
+ parameters:
121
+ negative_prompt: 'blurry, cropped, ugly'
122
+ output:
123
+ url: ./assets/image_21_0.png
124
+ - text: 'A photo-realistic image of a moonman cat'
125
+ parameters:
126
+ negative_prompt: 'blurry, cropped, ugly'
127
+ output:
128
+ url: ./assets/image_22_1.png
129
+ - text: 'A photo-realistic image of a moonman cat'
130
+ parameters:
131
+ negative_prompt: 'blurry, cropped, ugly'
132
+ output:
133
+ url: ./assets/image_23_2.png
134
+ ---
135
+
136
+ # simpletuner-lokr-moonman
137
+
138
+ This is a LyCORIS adapter derived from [black-forest-labs/FLUX.1-dev](https://huggingface.co/black-forest-labs/FLUX.1-dev).
139
+
140
+
141
+ The main validation prompt used during training was:
142
+
143
+
144
+
145
+ ```
146
+ A photo-realistic image of a moonman cat
147
+ ```
148
+
149
+ ## Validation settings
150
+ - CFG: `3.0`
151
+ - CFG Rescale: `0.0`
152
+ - Steps: `20`
153
+ - Sampler: `None`
154
+ - Seed: `42`
155
+ - Resolutions: `512x512,1024x1024,1280x768`
156
+
157
+ Note: The validation settings are not necessarily the same as the [training settings](#training-settings).
158
+
159
+ You can find some example images in the following gallery:
160
+
161
+
162
+ <Gallery />
163
+
164
+ The text encoder **was not** trained.
165
+ You may reuse the base model text encoder for inference.
166
+
167
+
168
+ ## Training settings
169
+
170
+ - Training epochs: 6
171
+ - Training steps: 500
172
+ - Learning rate: 0.001
173
+ - Effective batch size: 1
174
+ - Micro-batch size: 1
175
+ - Gradient accumulation steps: 1
176
+ - Number of GPUs: 1
177
+ - Prediction type: flow-matching
178
+ - Rescaled betas zero SNR: False
179
+ - Optimizer: optimi-stableadamw
180
+ - Precision: bf16
181
+ - Quantised: Yes: int8-quanto
182
+ - Xformers: Not used
183
+ - LyCORIS Config:
184
+ ```json
185
+ {
186
+ "algo": "lokr",
187
+ "multiplier": 1.0,
188
+ "linear_dim": 10000,
189
+ "linear_alpha": 1,
190
+ "factor": 16,
191
+ "apply_preset": {
192
+ "target_module": [
193
+ "Attention",
194
+ "FeedForward"
195
+ ],
196
+ "module_algo_map": {
197
+ "Attention": {
198
+ "factor": 16
199
+ },
200
+ "FeedForward": {
201
+ "factor": 8
202
+ }
203
+ }
204
+ }
205
+ }
206
+ ```
207
+
208
+ ## Datasets
209
+
210
+ ### moonman1024
211
+ - Repeats: 0
212
+ - Total number of images: 26
213
+ - Total number of aspect buckets: 6
214
+ - Resolution: 1.048576 megapixels
215
+ - Cropped: False
216
+ - Crop style: None
217
+ - Crop aspect: None
218
+ ### moonman512
219
+ - Repeats: 0
220
+ - Total number of images: 26
221
+ - Total number of aspect buckets: 7
222
+ - Resolution: 0.262144 megapixels
223
+ - Cropped: False
224
+ - Crop style: None
225
+ - Crop aspect: None
226
+ ### moonman768
227
+ - Repeats: 0
228
+ - Total number of images: 26
229
+ - Total number of aspect buckets: 6
230
+ - Resolution: 0.589824 megapixels
231
+ - Cropped: False
232
+ - Crop style: None
233
+ - Crop aspect: None
234
+
235
+
236
+ ## Inference
237
+
238
+
239
+ ```python
240
+ import torch
241
+ from diffusers import DiffusionPipeline
242
+ from lycoris import create_lycoris_from_weights
243
+
244
+ model_id = 'black-forest-labs/FLUX.1-dev'
245
+ adapter_id = 'pytorch_lora_weights.safetensors' # you will have to download this manually
246
+ lora_scale = 1.0
247
+ wrapper, _ = create_lycoris_from_weights(lora_scale, adapter_id, pipeline.transformer)
248
+ wrapper.merge_to()
249
+
250
+ prompt = "A photo-realistic image of a moonman cat"
251
+
252
+ pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu')
253
+ image = pipeline(
254
+ prompt=prompt,
255
+ num_inference_steps=20,
256
+ generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826),
257
+ width=512,
258
+ height=512,
259
+ guidance_scale=3.0,
260
+ ).images[0]
261
+ image.save("output.png", format="PNG")
262
+ ```
263
+