SmilingWolf's picture
Super-squash branch 'main' using huggingface_hub
1590a54 verified
---
license: apache-2.0
library_name: timm
---
# IdolSankaku EVA02-Large Tagger v1
Supports ratings, characters and general tags.
Trained using https://github.com/SmilingWolf/JAX-CV.
TPUs used for training kindly provided by the [TRC program](https://sites.research.google/trc/about/).
## Dataset
Trained on a human annotated dataset of real world photos.
## Validation results
`v1.0: P=R: threshold = 0.4985, F1 = 0.6017`
## What's new
Model v1.0/Dataset v1:
First version of the dataset, tags updated on 2024-08-31.
`timm` compatible! Load it up and give it a spin using the canonical one-liner!
ONNX model is compatible with code developed for the v3 series of WD tagger models.
The batch dimension of the ONNX model is not fixed to 1 anymore. Now you can go crazy with batch inference.
Switched to Macro-F1 to measure model performance since it gives me a better gauge of overall training progress.
# Runtime deps
ONNX model requires `onnxruntime >= 1.17.0`
# Inference code examples
For timm: https://github.com/neggles/wdv3-timm
For ONNX: https://huggingface.co/spaces/SmilingWolf/wd-tagger
For JAX: https://github.com/SmilingWolf/wdv3-jax
## Final words
Subject to change and updates.
Downstream users are encouraged to use tagged releases rather than relying on the head of the repo.
## Thanks
Thanks to the whole DeepGHS team for data gathering and encouraging me to push the models much further than they had any reason to attempt to reach, much less succeed.