|
--- |
|
license: apache-2.0 |
|
library_name: timm |
|
--- |
|
# IdolSankaku EVA02-Large Tagger v1 |
|
|
|
Supports ratings, characters and general tags. |
|
|
|
Trained using https://github.com/SmilingWolf/JAX-CV. |
|
TPUs used for training kindly provided by the [TRC program](https://sites.research.google/trc/about/). |
|
|
|
## Dataset |
|
Trained on a human annotated dataset of real world photos. |
|
|
|
## Validation results |
|
`v1.0: P=R: threshold = 0.4985, F1 = 0.6017` |
|
|
|
## What's new |
|
Model v1.0/Dataset v1: |
|
First version of the dataset, tags updated on 2024-08-31. |
|
`timm` compatible! Load it up and give it a spin using the canonical one-liner! |
|
ONNX model is compatible with code developed for the v3 series of WD tagger models. |
|
The batch dimension of the ONNX model is not fixed to 1 anymore. Now you can go crazy with batch inference. |
|
Switched to Macro-F1 to measure model performance since it gives me a better gauge of overall training progress. |
|
|
|
# Runtime deps |
|
ONNX model requires `onnxruntime >= 1.17.0` |
|
|
|
# Inference code examples |
|
For timm: https://github.com/neggles/wdv3-timm |
|
For ONNX: https://huggingface.co/spaces/SmilingWolf/wd-tagger |
|
For JAX: https://github.com/SmilingWolf/wdv3-jax |
|
|
|
## Final words |
|
Subject to change and updates. |
|
Downstream users are encouraged to use tagged releases rather than relying on the head of the repo. |
|
|
|
## Thanks |
|
Thanks to the whole DeepGHS team for data gathering and encouraging me to push the models much further than they had any reason to attempt to reach, much less succeed. |
|
|