PseudoTerminal X
commited on
Commit
•
f55267c
1
Parent(s):
c2d0d2d
Trained for 1 epochs and 29500 steps.
Browse filesTrained with datasets ['text-embeds-pixart-filter', 'photo-concept-bucket', 'moviecollection', 'experimental', 'ethnic', 'sports', 'architecture', 'shutterstock', 'cinemamix-1mp', 'nsfw-1024', 'anatomy', 'bg20k-1024', 'yoga', 'photo-aesthetics', 'text-1mp', 'movieposters', 'normalnudes', 'pixel-art', 'signs', 'midjourney-v6-520k-raw', 'sfwbooru', 'nijijourney-v6-520k-raw', 'dalle3']
Learning rate 1e-06, batch size 24, and 1 gradient accumulation steps.
Used DDPM noise scheduler for training with epsilon prediction type and rescaled_betas_zero_snr=False
Using 'linspace' timestep spacing.
Base model: ptx0/pixart-900m-1024-ft-large
VAE: madebyollin/sdxl-vae-fp16-fix
- README.md +7 -7
- optimizer.bin +1 -1
- random_states_0.pkl +1 -1
- scheduler.bin +1 -1
- training_state-anatomy.json +0 -0
- training_state-bg20k-1024.json +0 -0
- training_state-dalle3.json +2 -2
- training_state-midjourney-v6-520k-raw.json +2 -2
- training_state-nijijourney-v6-520k-raw.json +2 -2
- training_state-nsfw-1024.json +0 -0
- training_state-photo-aesthetics.json +0 -0
- training_state-photo-concept-bucket.json +2 -2
- training_state-sfwbooru.json +0 -0
- training_state-text-1mp.json +0 -0
- training_state.json +1 -1
- transformer/diffusion_pytorch_model.safetensors +1 -1
README.md
CHANGED
@@ -62,7 +62,7 @@ You may reuse the base model text encoder for inference.
|
|
62 |
## Training settings
|
63 |
|
64 |
- Training epochs: 1
|
65 |
-
- Training steps:
|
66 |
- Learning rate: 1e-06
|
67 |
- Effective batch size: 192
|
68 |
- Micro-batch size: 24
|
@@ -80,7 +80,7 @@ You may reuse the base model text encoder for inference.
|
|
80 |
### photo-concept-bucket
|
81 |
- Repeats: 0
|
82 |
- Total number of images: ~564672
|
83 |
-
- Total number of aspect buckets:
|
84 |
- Resolution: 1.0 megapixels
|
85 |
- Cropped: False
|
86 |
- Crop style: None
|
@@ -176,7 +176,7 @@ You may reuse the base model text encoder for inference.
|
|
176 |
### photo-aesthetics
|
177 |
- Repeats: 0
|
178 |
- Total number of images: ~28608
|
179 |
-
- Total number of aspect buckets:
|
180 |
- Resolution: 1.0 megapixels
|
181 |
- Cropped: True
|
182 |
- Crop style: random
|
@@ -224,7 +224,7 @@ You may reuse the base model text encoder for inference.
|
|
224 |
### midjourney-v6-520k-raw
|
225 |
- Repeats: 0
|
226 |
- Total number of images: ~513792
|
227 |
-
- Total number of aspect buckets:
|
228 |
- Resolution: 1.0 megapixels
|
229 |
- Cropped: False
|
230 |
- Crop style: None
|
@@ -232,7 +232,7 @@ You may reuse the base model text encoder for inference.
|
|
232 |
### sfwbooru
|
233 |
- Repeats: 0
|
234 |
- Total number of images: ~271488
|
235 |
-
- Total number of aspect buckets:
|
236 |
- Resolution: 1.0 megapixels
|
237 |
- Cropped: False
|
238 |
- Crop style: None
|
@@ -240,7 +240,7 @@ You may reuse the base model text encoder for inference.
|
|
240 |
### nijijourney-v6-520k-raw
|
241 |
- Repeats: 0
|
242 |
- Total number of images: ~516288
|
243 |
-
- Total number of aspect buckets:
|
244 |
- Resolution: 1.0 megapixels
|
245 |
- Cropped: False
|
246 |
- Crop style: None
|
@@ -248,7 +248,7 @@ You may reuse the base model text encoder for inference.
|
|
248 |
### dalle3
|
249 |
- Repeats: 0
|
250 |
- Total number of images: ~1119168
|
251 |
-
- Total number of aspect buckets:
|
252 |
- Resolution: 1.0 megapixels
|
253 |
- Cropped: False
|
254 |
- Crop style: None
|
|
|
62 |
## Training settings
|
63 |
|
64 |
- Training epochs: 1
|
65 |
+
- Training steps: 29500
|
66 |
- Learning rate: 1e-06
|
67 |
- Effective batch size: 192
|
68 |
- Micro-batch size: 24
|
|
|
80 |
### photo-concept-bucket
|
81 |
- Repeats: 0
|
82 |
- Total number of images: ~564672
|
83 |
+
- Total number of aspect buckets: 13
|
84 |
- Resolution: 1.0 megapixels
|
85 |
- Cropped: False
|
86 |
- Crop style: None
|
|
|
176 |
### photo-aesthetics
|
177 |
- Repeats: 0
|
178 |
- Total number of images: ~28608
|
179 |
+
- Total number of aspect buckets: 17
|
180 |
- Resolution: 1.0 megapixels
|
181 |
- Cropped: True
|
182 |
- Crop style: random
|
|
|
224 |
### midjourney-v6-520k-raw
|
225 |
- Repeats: 0
|
226 |
- Total number of images: ~513792
|
227 |
+
- Total number of aspect buckets: 15
|
228 |
- Resolution: 1.0 megapixels
|
229 |
- Cropped: False
|
230 |
- Crop style: None
|
|
|
232 |
### sfwbooru
|
233 |
- Repeats: 0
|
234 |
- Total number of images: ~271488
|
235 |
+
- Total number of aspect buckets: 35
|
236 |
- Resolution: 1.0 megapixels
|
237 |
- Cropped: False
|
238 |
- Crop style: None
|
|
|
240 |
### nijijourney-v6-520k-raw
|
241 |
- Repeats: 0
|
242 |
- Total number of images: ~516288
|
243 |
+
- Total number of aspect buckets: 11
|
244 |
- Resolution: 1.0 megapixels
|
245 |
- Cropped: False
|
246 |
- Crop style: None
|
|
|
248 |
### dalle3
|
249 |
- Repeats: 0
|
250 |
- Total number of images: ~1119168
|
251 |
+
- Total number of aspect buckets: 3
|
252 |
- Resolution: 1.0 megapixels
|
253 |
- Cropped: False
|
254 |
- Crop style: None
|
optimizer.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5451415117
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ae0eb6b041a0028b3a24d48874d631286d19e8b33e968ed0643a9b4659d2dd51
|
3 |
size 5451415117
|
random_states_0.pkl
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 16100
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:360f68542b7351a488cde608cb684f61ba4d5fe6a08f52fa2651ae63d26f2604
|
3 |
size 16100
|
scheduler.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1000
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b1a9b5b20185dcbb5340513578a6ce11f5f1d73ebc0940e7fb0827717d4e77b4
|
3 |
size 1000
|
training_state-anatomy.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-bg20k-1024.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-dalle3.json
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:73319ac6942a41bd35538d940e32414caf4afcc958bc6755ab2c031db7e2b1f4
|
3 |
+
size 9073645
|
training_state-midjourney-v6-520k-raw.json
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:82e5d11c71fa63ce38b568d6f1c2056f3ba4236e9547b631c35576169e7eb1f7
|
3 |
+
size 6623991
|
training_state-nijijourney-v6-520k-raw.json
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:66ee4d1c0ce8716bc3e907d2072ff1ad8a33017cf3625622343f92242a24e88a
|
3 |
+
size 7119331
|
training_state-nsfw-1024.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-photo-aesthetics.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-photo-concept-bucket.json
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b1878fde59c5028a4d28030c43b7c04a4f7d1ba9d099779a8c408f47afc80932
|
3 |
+
size 5541572
|
training_state-sfwbooru.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-text-1mp.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"global_step":
|
|
|
1 |
+
{"global_step": 29500, "epoch_step": 1, "epoch": 2, "exhausted_backends": ["pixel-art", "signs", "sports", "ethnic", "experimental", "movieposters", "normalnudes", "yoga", "cinemamix-1mp", "architecture", "moviecollection", "shutterstock", "nsfw-1024", "photo-aesthetics"], "repeats": {"bookcovers": 0, "signs": 0, "normalnudes": 0, "nijijourney": 0, "movieposters": 0, "celebrities": 0, "pixel-art": 0, "propagandaposters": 0, "sports": 0, "moviecollection": 0, "gay": 0, "experimental": 0, "yoga": 0, "ethnic": 0, "cinemamix-1mp": 0, "architecture": 0, "mj-60": 0, "text-1mp": 2, "shutterstock": 0, "nsfw-1024": 0, "photo-aesthetics": 0, "anatomy": 1, "bg20k-1024": 0, "sfwbooru": 0, "midjourney-v6-520k-raw": 0, "nijijourney-v6-520k-raw": 0, "photo-concept-bucket": 0, "dalle3": 0}}
|
transformer/diffusion_pytorch_model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1816969728
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d7d059800121590ddd35a4824a1a6734045ebd62667a773eef9ae94a8f0e6b0a
|
3 |
size 1816969728
|