Crosstyan
/

BPModel

StableDiffusionPipeline

stable-diffusion

stable-diffusion-diffusers

Inference Endpoints

Model card Files Files and versions Community

Crosstyan commited on Dec 21, 2022

Commit

30b3c66

•

1 Parent(s): 1cee318

typo

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -22,7 +22,7 @@ Why is the Model even existing? There are loads of Stable Diffusion model out th
 Well, is there any models trained with resolution base resolution (`base_res`) 768 even 1024 before? Don't think so.
 Here it is, the BPModel, a Stable Diffusion model you may love or hate.
 Trained with 5k high quality images that suit my taste (not necessary yours unfortunately) from [Sankaku Complex](https://chan.sankakucomplex.com) with annotations.
-The dataset is public in [Crosstyan/BPDataset](https://huggingface.co/datasets/Crosstyan/BPDataset) for full disclosure.
 Pure combination of tags may not be the optimal way to describe the image,
 but I don't need to do extra work.
 And no, I won't feed any AI generated image
@@ -97,7 +97,7 @@ better than some artist style DreamBooth model which only train with a few
 hundred images or even less. I also oppose changing style by merging model since You
 could apply different style by training with proper captions and prompting.
-Besides some of images in my dataset has the artist name in the caption, however some artist name will
 be misinterpreted by CLIP when tokenizing. For example, *as109* will be tokenized as `[as, 1, 0, 9]` and
 *fuzichoco* will become `[fu, z, ic, hoco]`. Romanized Japanese suffers from the problem a lot and
 I don't have a good solution to fix it other than changing the artist name in the caption, which is

 Well, is there any models trained with resolution base resolution (`base_res`) 768 even 1024 before? Don't think so.
 Here it is, the BPModel, a Stable Diffusion model you may love or hate.
 Trained with 5k high quality images that suit my taste (not necessary yours unfortunately) from [Sankaku Complex](https://chan.sankakucomplex.com) with annotations.
+The dataset is public in [Crosstyan/BPDataset](https://huggingface.co/datasets/Crosstyan/BPDataset) for the sake of full disclosure .
 Pure combination of tags may not be the optimal way to describe the image,
 but I don't need to do extra work.
 And no, I won't feed any AI generated image
 hundred images or even less. I also oppose changing style by merging model since You
 could apply different style by training with proper captions and prompting.
+Besides some of images in my dataset have the artist name in the caption, however some artist name will
 be misinterpreted by CLIP when tokenizing. For example, *as109* will be tokenized as `[as, 1, 0, 9]` and
 *fuzichoco* will become `[fu, z, ic, hoco]`. Romanized Japanese suffers from the problem a lot and
 I don't have a good solution to fix it other than changing the artist name in the caption, which is