KaushalB commited on
Commit
faba7b5
1 Parent(s): b41ee08

This is an implementation of the Google's Vision Transformer large patch 32 that is used for music classification into different genres. The dataset used is the gtzan dataset which has melspectrograms of many songs.

Browse files
Files changed (1) hide show
  1. README.md +13 -3
README.md CHANGED
@@ -1,3 +1,13 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - ghermoso/egtzan_plus
5
+ metrics:
6
+ - accuracy
7
+ library_name: transformers
8
+ pipeline_tag: image-classification
9
+ tags:
10
+ - ViT
11
+ - music
12
+ - CV
13
+ ---