|
# Tagging methodology for Kazusa (blue archive) |
|
|
|
## README / Intro |
|
Since I've seen a few people share this already I'll provide this disclaimer. |
|
|
|
This is not really intended to be a guide, it's just a log/checklist of my process, for my own benefit, since I repeat this for a lot of LoRAs and I got tired of winging it every single time. I've put only the slightest amount of effort into making it accessible to others. |
|
|
|
I don't claim that any or all of these optimal, nor can I confidently put them forth as recommendations. They're literally just a record of the steps I follow while tagging, gradually developed after ~16 characters using some version of the below process. |
|
|
|
Still, I can at least point to my pre-Koharu LoRAs (which used pure WD1.4 tags) and the ones that came after (where I started heavily editing tags) and see a steady progression in quality and prompting flexibility despite using mostly the same training settings for each one. |
|
|
|
Yes, it takes forever to do all of this shit. No, I don't recommend it unless you're extremely autistic; raw WD1.4 tags are probably good enough for most people. If you intend to do this for more than a few characters, I strongly recommend learning [Hydrus](https://hydrusnetwork.github.io/hydrus/introduction.html) it makes all of this way, way less tedious compared to doing it with crappier tools. |
|
|
|
--- |
|
|
|
## Prep |
|
|
|
- Scraped `1girl kazusa_(blue_archive) order:popularity` from sancom, curated for quality, then exported from Hydrus to feed into WD1.4 Tagger. |
|
- Kazusa has a shitload of good art so I had to be very picky to get down to 280 images, which is still a lot. In hindsight I think huge datasets aren't really a problem; they let you train for longer without overfitting. |
|
- Gelbooru is probably fine too. Danbooru sucks for ロリ unless you have Gold. |
|
- I also got a few newer images from pixiv, don't remember which ones. |
|
- Exported final images from Hydrus to feed into WD1.4 Tagger |
|
- Auto-tagged with WD1.4 Swinv2 at 0.25 confidence |
|
- Reimported images+tags into Hydrus using the .txt sidecar feature. I strongly recommend putting WD1.4 tags in a separate tag domain so they aren't mixed in with shit scraped from boorus. |
|
|
|
## Tagging |
|
|
|
- Tag unique features |
|
- `halo` / `demon horns` / `low wings` |
|
- Remove when not present or out of view. WD1.4 likes putting `halo` even on images where no halo is visible. |
|
- **Kazusa**: `halo` / `animal ears` |
|
- Pruned `extra ears` as it seems redundant and intrinsic to the character. |
|
- Tag outfit variants with a single master tag |
|
- **Kazusa**: |
|
- Uniform: `school uniform` / `black jacket` |
|
- Sometimes the jacket appears without anything else, which was not tagged `school uniform` |
|
- Non-canon costumes |
|
- Add `alternate costume` |
|
- Nudity (WD1.4 usually does this accurately) |
|
- `nude` / `completely nude` |
|
- Prune eye colors |
|
- Keep tags which describe unusual eye features (`multicolored eyes`, `heterochromia`, `slit pupils`) as they can otherwise be too subtle and inconsistently drawn for the AI to notice |
|
- Prune hair colors |
|
- This includes `two-toned hair`, `gradiant hair`, etc. The AI learns all of these very consistently without the tags, likely because artists tend to draw them consistently |
|
- Partially prune hair styles |
|
- Leave key, defining style tags like `twintails`, `ponytail`, `short hair with long locks`, `twin braids`, etc. |
|
- Prune exceedingly common tags like `bangs` / `sidelocks` / `eyebrows visible through hair` / `hair between eyes`, etc. |
|
- Somewhat arbitrary, but I just don't think there's much value in them because they're ubiquitous and caption space is limited |
|
- Prune length, except for images which differ from the character's usual length |
|
- If you don't do this, it's more likely to get the hair length wrong when not prompted, which isn't a huge deal. |
|
- Add `alternate hairstyle` and/or `alternate hair length` on applicable images, which can be used to more easily change styles while prompting |
|
- **Kazusa**: `short hair, colored inner hair` -- while I would usually prune these, they're really her only defining hairstyle traits |
|
- Fixup hair ornaments |
|
- Prune generic `hair ornament` in favor of more specificity |
|
- `hairclip` / `black headband` / `hair flower` / `hair ribbon`, etc. |
|
- Consolidate tags that have color variants (`headband` >> `black headband`) |
|
- **Kazusa**: `hairclip` |
|
- Consolidate outfits |
|
- Only tag an item when it is actually visible. If it is only barely visible along the edge of an image, keep in mind it may be cropped during bucketing. |
|
- Danbooru's wiki entry for a character often provides a good list of tags for a character's entire outfit. |
|
- **Kazusa outfits**: |
|
- School Uniform |
|
- `black choker` |
|
- `hooded jacket` |
|
- `black jacket` |
|
- `green sailor collar` |
|
- `pink neckerchief` |
|
- `miniskirt` |
|
- `pleated skirt` |
|
- `white skirt` |
|
- `black pantyhose` |
|
- `sneakers` |
|
- Fixup sleeves |
|
- ie. `long sleeves` / `puffy long sleeves` / `detached sleeves` |
|
- You only need one, but pick one and be consistent. If sleeves aren't tagged the AI tends to add them inappropriately (such as when prompting for sleeveless outfits or nudity) |
|
- Fixup collars |
|
- ie. `detached collar` / `collared shirt` / `choker` / etc. |
|
- Same deal as sleeves, they tend to appear when unwanted if not consistently tagged according to actual visibility |
|
- Fixup clothing state |
|
- ie. `open jacket` / `open shirt` / `partially undressed` / `off shoulder` |
|
- The tagger is generally good at this but it can help to double-check for weird outfits |
|
- Tag expressions |
|
- This is tedious and the autotagger doesn't help you out much, but tagging these can really help the AI nail multiple iconic expressions for a character |
|
- Start by searching for images without one of these, and add them. |
|
- `open mouth` |
|
- `closed mouth` |
|
- `parted lips` |
|
- Sometimes applies with `open mouth` |
|
- Then proceed through each image and add one of these |
|
- `smile` / `light smile` / `:d` / `grin` (exposed teeth only) |
|
- `:o` / `:<` / `expressionless` / `serious` |
|
- `wavy mouth` / `embarrassed` |
|
- `pout` / `:t` / `tsundere` |
|
- `nervous` / `nervous smile` |
|
- `flustered` / `swirly eyes` / `@_@` |
|
- `surprised` / `o_o` / `wide-eyed` |
|
- `upset` / `annoyed` / `frustrated` / `v-shaped eyebrows` |
|
- `naughty face` / `seductive smile` |
|
- `smug` / `:3` / `smirk` |
|
- `yelling` / `frown` |
|
- `eyes closed` / `one eye closed` |
|
- WD1.4 almost always gets these two |
|
- Tag camera angles/composition |
|
- Most of these aren't very high value, but `from x` can be helpful. |
|
- `cowboy shot` |
|
- `upper body` |
|
- `full body` |
|
- `portrait` |
|
- `feet out of frame` |
|
- `cropped torso` / `cropped legs` |
|
- `from side` / `from above` / `from below` / `from behind` |
|
- Tag iconic poses, actions, or props |
|
- Props need to show up often in training data for this to be worth it. |
|
- `v` / `peace sign` / `standing on one leg` |
|
- `holding dango` / `weapon case` / `fashion magazine` |
|
- **Kazusa** |
|
- `mouth hold` |
|
- `eating` |
|
- `macaron` |
|
- Flip through each image and use Hydrus's "related tags" feature to quickly identify important tags that might be missing. |
|
- This feature looks at other images with similar tags to provide suggestions. Good for spotting things you or the tagger might have missed. |
|
|