metadata
license: apache-2.0
base_model: facebook/convnextv2-nano-22k-384
tags:
- image-classification
- vision
- boulderspot
- climbing
- aerial imagery
- remote sensing
- bouldering
metrics:
- accuracy
- f1
- precision
- recall
- matthews_correlation
datasets:
- pszemraj/boulderspot
convnextv2-nano-22k-384-boulderspot
This is a model fine-tuned to classify whether an aerial/satellite image contains a climbing area or not.
You can find some images to test inference with in this old repo from the original project
Model description
This model is a fine-tuned version of facebook/convnextv2-nano-22k-384 on the pszemraj/boulderspot dataset. It achieves the following results on the evaluation set:
- Loss: 0.0340
- Accuracy: 0.9883
- F1: 0.9883
- Precision: 0.9883
- Recall: 0.9883
- Matthews Correlation: 0.8962
example usage
import requests
from PIL import Image
from transformers import pipeline
pipe = pipeline(
"image-classification",
model="pszemraj/convnextv2-nano-22k-384-boulderspot",
)
url = "https://huggingface.co/pszemraj/convnextv2-nano-22k-384-boulderspot/resolve/main/test_img_magic_wood.png?download=true"
image = Image.open(requests.get(url, stream=True).raw)
result = pipe(image)[0]
print(result)
# image.show()
Intended uses & limitations
Classification of aerial/satellite imagery, ideally with spacial resolution 10-25 cm (i.e. for 10 cm, each pixel in the image corresonds to approx. 10 cm x 10 cm area on the ground). It may be suitable outside of that, but should be validated as other resolutions were not present in the training data.
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 16
- eval_batch_size: 16
- seed: 7890
- gradient_accumulation_steps: 4
- total_train_batch_size: 64
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: cosine
- lr_scheduler_warmup_ratio: 0.05
- num_epochs: 5.0
Training results
Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall | Matthews Correlation |
---|---|---|---|---|---|---|---|---|
0.1102 | 1.0 | 203 | 0.0431 | 0.9839 | 0.9840 | 0.9841 | 0.9839 | 0.8590 |
0.0559 | 2.0 | 406 | 0.0476 | 0.9839 | 0.9845 | 0.9858 | 0.9839 | 0.8709 |
0.0402 | 3.0 | 609 | 0.0464 | 0.9810 | 0.9817 | 0.9831 | 0.9810 | 0.8468 |
0.0334 | 4.0 | 813 | 0.0348 | 0.9868 | 0.9869 | 0.9870 | 0.9868 | 0.8846 |
0.0445 | 4.99 | 1015 | 0.0340 | 0.9883 | 0.9883 | 0.9883 | 0.9883 | 0.8962 |
Framework versions
- Transformers 4.39.2
- Pytorch 2.4.0.dev20240328+cu121
- Datasets 2.18.0
- Tokenizers 0.15.2