PseudoTerminal X
commited on
Trained for 0 epochs and 2000 steps.
Browse filesTrained with datasets ['text-embeds-sd3-nofilter', 'photo-concept-bucket', 'dalle3']
Learning rate 1e-06, batch size 6, and 2 gradient accumulation steps.
Used DDPM noise scheduler for training with epsilon prediction type and rescaled_betas_zero_snr=False
Using 'trailing' timestep spacing.
Base model: stabilityai/stable-diffusion-3-medium-diffusers
VAE: None
- README.md +2 -2
- optimizer.bin +1 -1
- random_states_0.pkl +1 -1
- scheduler.bin +1 -1
- training_state-dalle3.json +0 -0
- training_state-photo-concept-bucket.json +0 -0
- training_state.json +1 -1
- transformer/diffusion_pytorch_model.safetensors +1 -1
README.md
CHANGED
@@ -1164,7 +1164,7 @@ You may reuse the base model text encoder for inference.
|
|
1164 |
## Training settings
|
1165 |
|
1166 |
- Training epochs: 0
|
1167 |
-
- Training steps:
|
1168 |
- Learning rate: 1e-06
|
1169 |
- Effective batch size: 96
|
1170 |
- Micro-batch size: 6
|
@@ -1190,7 +1190,7 @@ You may reuse the base model text encoder for inference.
|
|
1190 |
### dalle3
|
1191 |
- Repeats: 0
|
1192 |
- Total number of images: ~984960
|
1193 |
-
- Total number of aspect buckets:
|
1194 |
- Resolution: 1.0 megapixels
|
1195 |
- Cropped: False
|
1196 |
- Crop style: None
|
|
|
1164 |
## Training settings
|
1165 |
|
1166 |
- Training epochs: 0
|
1167 |
+
- Training steps: 2000
|
1168 |
- Learning rate: 1e-06
|
1169 |
- Effective batch size: 96
|
1170 |
- Micro-batch size: 6
|
|
|
1190 |
### dalle3
|
1191 |
- Repeats: 0
|
1192 |
- Total number of images: ~984960
|
1193 |
+
- Total number of aspect buckets: 1
|
1194 |
- Resolution: 1.0 megapixels
|
1195 |
- Cropped: False
|
1196 |
- Crop style: None
|
optimizer.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 12170595712
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:29a9d120d49bb408b234d6d2caf6d7a72efa842a3ef8d7ffe66d865ad379d102
|
3 |
size 12170595712
|
random_states_0.pkl
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 16100
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:20a54d01c6efef4ee7ca5075c74040c21c60a27b9c684bc4820d25c3efcbb136
|
3 |
size 16100
|
scheduler.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1128
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d03caeadb11306d05a947b42cfe2042c3eec52f84ae904f74fd6888491aa4389
|
3 |
size 1128
|
training_state-dalle3.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-photo-concept-bucket.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"global_step":
|
|
|
1 |
+
{"global_step": 2000, "epoch_step": 2000, "epoch": 1, "exhausted_backends": [], "repeats": {}}
|
transformer/diffusion_pytorch_model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4169982088
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:405bcfd98738eb73030209f260595357e4daa5bed3eecc8d933e6dfba1da5f6f
|
3 |
size 4169982088
|