PseudoTerminal X
commited on
Trained for 0 epochs and 5600 steps.
Browse filesTrained with datasets ['text-embeds-sdxl', 'photocb-clip-embeds', 'celebrities', 'movieposters', 'normalnudes', 'propagandaposters', 'guys', 'pixel-art', 'signs', 'moviecollection', 'bookcovers', 'nijijourney', 'experimental', 'ethnic', 'sports', 'gay', 'architecture', 'shutterstock', 'cinemamix-1mp', 'nsfw-1024', 'anatomy', 'bg20k-1024', 'yoga', 'photo-aesthetics', 'text-1mp', 'photo-concept-bucket']
Learning rate 1e-06, batch size 8, and 4 gradient accumulation steps.
Used DDPM noise scheduler for training with v_prediction prediction type and rescaled_betas_zero_snr=True
Using 'trailing' timestep spacing.
Base model: ptx0/terminus-xl-velocity-v1
VAE: madebyollin/sdxl-vae-fp16-fix
- README.md +4 -4
- optimizer.bin +1 -1
- random_states_0.pkl +2 -2
- scheduler.bin +1 -1
- training_state-anatomy.json +0 -0
- training_state-bg20k-1024.json +2 -2
- training_state-nsfw-1024.json +0 -0
- training_state-photo-aesthetics.json +0 -0
- training_state-photo-concept-bucket.json +2 -2
- training_state-shutterstock.json +0 -0
- training_state-text-1mp.json +0 -0
- training_state.json +1 -1
- unet/config.json +1 -1
- unet/diffusion_pytorch_model.safetensors +1 -1
README.md
CHANGED
@@ -44,7 +44,7 @@ You may reuse the base model text encoder for inference.
|
|
44 |
## Training settings
|
45 |
|
46 |
- Training epochs: 0
|
47 |
-
- Training steps:
|
48 |
- Learning rate: 1e-06
|
49 |
- Effective batch size: 32
|
50 |
- Micro-batch size: 8
|
@@ -67,7 +67,7 @@ You may reuse the base model text encoder for inference.
|
|
67 |
- Crop style: random
|
68 |
- Crop aspect: random
|
69 |
### movieposters
|
70 |
-
- Repeats:
|
71 |
- Total number of images: 1728
|
72 |
- Total number of aspect buckets: 3
|
73 |
- Resolution: 1.0 megapixels
|
@@ -107,7 +107,7 @@ You may reuse the base model text encoder for inference.
|
|
107 |
- Crop style: random
|
108 |
- Crop aspect: random
|
109 |
### signs
|
110 |
-
- Repeats:
|
111 |
- Total number of images: 352
|
112 |
- Total number of aspect buckets: 3
|
113 |
- Resolution: 1.0 megapixels
|
@@ -213,7 +213,7 @@ You may reuse the base model text encoder for inference.
|
|
213 |
### bg20k-1024
|
214 |
- Repeats: 0
|
215 |
- Total number of images: 89250
|
216 |
-
- Total number of aspect buckets:
|
217 |
- Resolution: 1.0 megapixels
|
218 |
- Cropped: True
|
219 |
- Crop style: random
|
|
|
44 |
## Training settings
|
45 |
|
46 |
- Training epochs: 0
|
47 |
+
- Training steps: 5600
|
48 |
- Learning rate: 1e-06
|
49 |
- Effective batch size: 32
|
50 |
- Micro-batch size: 8
|
|
|
67 |
- Crop style: random
|
68 |
- Crop aspect: random
|
69 |
### movieposters
|
70 |
+
- Repeats: 25
|
71 |
- Total number of images: 1728
|
72 |
- Total number of aspect buckets: 3
|
73 |
- Resolution: 1.0 megapixels
|
|
|
107 |
- Crop style: random
|
108 |
- Crop aspect: random
|
109 |
### signs
|
110 |
+
- Repeats: 25
|
111 |
- Total number of images: 352
|
112 |
- Total number of aspect buckets: 3
|
113 |
- Resolution: 1.0 megapixels
|
|
|
213 |
### bg20k-1024
|
214 |
- Repeats: 0
|
215 |
- Total number of images: 89250
|
216 |
+
- Total number of aspect buckets: 3
|
217 |
- Resolution: 1.0 megapixels
|
218 |
- Cropped: True
|
219 |
- Crop style: random
|
optimizer.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 15406336826
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:39aac7d3ef92c6c46baa04d9c50dbcd04f96fd15bb210fd5c0e9adf8a3f598b0
|
3 |
size 15406336826
|
random_states_0.pkl
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:1bdafbf7bec3f162698104cd2d02020cd492ce580bf7e3d197dc5b98d0e74800
|
3 |
+
size 14280
|
scheduler.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1000
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b6813e50b115c99d61ffcf61ce231df3f75282c5a4ea0ce645961bd0ef935048
|
3 |
size 1000
|
training_state-anatomy.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-bg20k-1024.json
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:91ec1218433718ad538871edf3103079633bf2c65e11a0c0ff0c22caa96dd196
|
3 |
+
size 16135315
|
training_state-nsfw-1024.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-photo-aesthetics.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-photo-concept-bucket.json
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ec5a0c8b7efa20fc10ac0c09cbfa7752e3b395f3405ff5a7474474308eac8f3a
|
3 |
+
size 15466180
|
training_state-shutterstock.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-text-1mp.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"global_step":
|
|
|
1 |
+
{"global_step": 5600, "epoch_step": 141, "epoch": 1, "exhausted_backends": ["guys", "signs", "nijijourney", "propagandaposters", "bookcovers", "pixel-art", "normalnudes", "celebrities", "sports", "movieposters", "moviecollection", "gay", "ethnic", "experimental", "yoga", "architecture", "cinemamix-1mp"], "repeats": {"guys": 0, "signs": 0, "nijijourney": 0, "propagandaposters": 0, "bookcovers": 0, "pixel-art": 0, "normalnudes": 0, "celebrities": 0, "sports": 0, "movieposters": 0, "moviecollection": 0, "gay": 0, "ethnic": 0, "experimental": 0, "yoga": 0, "architecture": 0, "cinemamix-1mp": 0}}
|
unet/config.json
CHANGED
@@ -1,7 +1,7 @@
|
|
1 |
{
|
2 |
"_class_name": "UNet2DConditionModel",
|
3 |
"_diffusers_version": "0.27.2",
|
4 |
-
"_name_or_path": "/notebooks/datasets/models/checkpoint-
|
5 |
"act_fn": "silu",
|
6 |
"addition_embed_type": "text_time",
|
7 |
"addition_embed_type_num_heads": 64,
|
|
|
1 |
{
|
2 |
"_class_name": "UNet2DConditionModel",
|
3 |
"_diffusers_version": "0.27.2",
|
4 |
+
"_name_or_path": "/notebooks/datasets/models/checkpoint-5500",
|
5 |
"act_fn": "silu",
|
6 |
"addition_embed_type": "text_time",
|
7 |
"addition_embed_type_num_heads": 64,
|
unet/diffusion_pytorch_model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5135151440
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f18f1033e3bf14393b5d6f0ce51bb4a6eda072ca906902beeff95fb3f50985fb
|
3 |
size 5135151440
|