Vision Transformer (ViT) for Music Genre Classification

Model Overview

It achieves the following results on the evaluation set:

  • Loss: 0.8358
  • Accuracy: 0.7460
Downloads last month
197
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for ghermoso/vit-eGTZANplus

Finetuned
(1868)
this model

Dataset used to train ghermoso/vit-eGTZANplus