PseudoTerminal X
commited on
Trained for 0 epochs and 5800 steps.
Browse filesTrained with datasets ['text-embeds-sdxl', 'photocb-clip-embeds', 'celebrities', 'movieposters', 'normalnudes', 'propagandaposters', 'guys', 'pixel-art', 'signs', 'moviecollection', 'bookcovers', 'nijijourney', 'experimental', 'ethnic', 'sports', 'gay', 'architecture', 'shutterstock', 'cinemamix-1mp', 'nsfw-1024', 'anatomy', 'bg20k-1024', 'yoga', 'photo-aesthetics', 'text-1mp', 'photo-concept-bucket']
Learning rate 1e-06, batch size 8, and 4 gradient accumulation steps.
Used DDPM noise scheduler for training with v_prediction prediction type and rescaled_betas_zero_snr=True
Using 'trailing' timestep spacing.
Base model: ptx0/terminus-xl-velocity-v1
VAE: madebyollin/sdxl-vae-fp16-fix
- README.md +3 -3
- optimizer.bin +1 -1
- random_states_0.pkl +1 -1
- scheduler.bin +1 -1
- training_state-anatomy.json +0 -0
- training_state-bg20k-1024.json +2 -2
- training_state-nsfw-1024.json +0 -0
- training_state-photo-aesthetics.json +0 -0
- training_state-photo-concept-bucket.json +2 -2
- training_state-shutterstock.json +0 -0
- training_state-text-1mp.json +0 -0
- training_state.json +1 -1
- unet/config.json +1 -1
- unet/diffusion_pytorch_model.safetensors +1 -1
README.md
CHANGED
@@ -44,7 +44,7 @@ You may reuse the base model text encoder for inference.
|
|
44 |
## Training settings
|
45 |
|
46 |
- Training epochs: 0
|
47 |
-
- Training steps:
|
48 |
- Learning rate: 1e-06
|
49 |
- Effective batch size: 32
|
50 |
- Micro-batch size: 8
|
@@ -213,7 +213,7 @@ You may reuse the base model text encoder for inference.
|
|
213 |
### bg20k-1024
|
214 |
- Repeats: 0
|
215 |
- Total number of images: 89250
|
216 |
-
- Total number of aspect buckets:
|
217 |
- Resolution: 1.0 megapixels
|
218 |
- Cropped: True
|
219 |
- Crop style: random
|
@@ -235,7 +235,7 @@ You may reuse the base model text encoder for inference.
|
|
235 |
- Crop style: random
|
236 |
- Crop aspect: random
|
237 |
### text-1mp
|
238 |
-
- Repeats:
|
239 |
- Total number of images: 13123
|
240 |
- Total number of aspect buckets: 3
|
241 |
- Resolution: 1.0 megapixels
|
|
|
44 |
## Training settings
|
45 |
|
46 |
- Training epochs: 0
|
47 |
+
- Training steps: 5800
|
48 |
- Learning rate: 1e-06
|
49 |
- Effective batch size: 32
|
50 |
- Micro-batch size: 8
|
|
|
213 |
### bg20k-1024
|
214 |
- Repeats: 0
|
215 |
- Total number of images: 89250
|
216 |
+
- Total number of aspect buckets: 3
|
217 |
- Resolution: 1.0 megapixels
|
218 |
- Cropped: True
|
219 |
- Crop style: random
|
|
|
235 |
- Crop style: random
|
236 |
- Crop aspect: random
|
237 |
### text-1mp
|
238 |
+
- Repeats: 25
|
239 |
- Total number of images: 13123
|
240 |
- Total number of aspect buckets: 3
|
241 |
- Resolution: 1.0 megapixels
|
optimizer.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 15406336826
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b2955a06ec624e837ad3705e23ce727473341d891fd0b9a8bdc2211e648e145a
|
3 |
size 15406336826
|
random_states_0.pkl
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 14344
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:946b5e8032258dd0529d94abcd17e9115d4b1baad482f372052f9f3938b62c1d
|
3 |
size 14344
|
scheduler.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1000
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:bf540ec46f1645bf1bd0591d26680b80d10dfc410b377b20c2ff7e114588fba6
|
3 |
size 1000
|
training_state-anatomy.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-bg20k-1024.json
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:66e67db2db7881eee6610fc3ba7f3b16d038d63ceb11f5776d9828c5cdc20998
|
3 |
+
size 16310449
|
training_state-nsfw-1024.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-photo-aesthetics.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-photo-concept-bucket.json
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ef9906eb25f3120400a4a2205156f8a0526f516b9eeba6ed92a1831e5d92d4c3
|
3 |
+
size 15501702
|
training_state-shutterstock.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-text-1mp.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"global_step":
|
|
|
1 |
+
{"global_step": 5800, "epoch_step": 160, "epoch": 1, "exhausted_backends": ["guys", "signs", "nijijourney", "propagandaposters", "bookcovers", "pixel-art", "normalnudes", "celebrities", "sports", "movieposters", "moviecollection", "gay", "ethnic", "experimental", "yoga", "architecture", "cinemamix-1mp"], "repeats": {"guys": 0, "signs": 0, "nijijourney": 0, "propagandaposters": 0, "bookcovers": 0, "pixel-art": 0, "normalnudes": 0, "celebrities": 0, "sports": 0, "movieposters": 0, "moviecollection": 0, "gay": 0, "ethnic": 0, "experimental": 0, "yoga": 0, "architecture": 0, "cinemamix-1mp": 0}}
|
unet/config.json
CHANGED
@@ -1,7 +1,7 @@
|
|
1 |
{
|
2 |
"_class_name": "UNet2DConditionModel",
|
3 |
"_diffusers_version": "0.27.2",
|
4 |
-
"_name_or_path": "/notebooks/datasets/models/checkpoint-
|
5 |
"act_fn": "silu",
|
6 |
"addition_embed_type": "text_time",
|
7 |
"addition_embed_type_num_heads": 64,
|
|
|
1 |
{
|
2 |
"_class_name": "UNet2DConditionModel",
|
3 |
"_diffusers_version": "0.27.2",
|
4 |
+
"_name_or_path": "/notebooks/datasets/models/checkpoint-5700",
|
5 |
"act_fn": "silu",
|
6 |
"addition_embed_type": "text_time",
|
7 |
"addition_embed_type_num_heads": 64,
|
unet/diffusion_pytorch_model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5135151440
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0457bc97dc56c993dfc8e256df6303a79dc9c0cd9d3a3fa482243beac513076a
|
3 |
size 5135151440
|