admin
commited on
Commit
·
5afd5b6
1
Parent(s):
59b6383
upd md
Browse files
README.md
CHANGED
@@ -2,64 +2,62 @@
|
|
2 |
license: mit
|
3 |
---
|
4 |
|
5 |
-
# Intro
|
6 |
The Guzheng Performance Technique Recognition Model is trained on the GZ_IsoTech Dataset, which consists of 2,824 audio clips that showcase various Guzheng playing techniques. Of these, 2,328 clips are from a virtual sound library, and 496 clips are performed by a highly skilled professional Guzheng artist, covering the full tonal range inherent to the Guzheng instrument. The audio clips are categorized into eight different playing techniques based on the unique performance practices of the Guzheng: Vibrato (chanyin), Slide-up (shanghuayin), Slide-down (xiahuayin), Return Slide (huihuayin), Glissando (guazou, huazhi, etc.), Thumb Plucking (yaozhi), Harmonics (fanyin), and Plucking Techniques (gou, da, mo, tuo, etc.). The model utilizes feature extraction, time-domain and frequency-domain analysis, and pattern recognition to accurately identify these distinct Guzheng playing techniques. The application of this model provides strong support for the automatic recognition, digital analysis, and educational research of Guzheng performance techniques, promoting the preservation and innovation of Guzheng art.
|
7 |
|
8 |
-
|
9 |
-
|
10 |
-
## Demo 在线演示
|
11 |
<https://huggingface.co/spaces/ccmusic-database/GZ_IsoTech>
|
12 |
|
13 |
-
## Usage
|
14 |
```python
|
15 |
from modelscope import snapshot_download
|
16 |
model_dir = snapshot_download("ccmusic-database/GZ_IsoTech")
|
17 |
```
|
18 |
|
19 |
-
## Maintenance
|
20 |
```bash
|
21 |
git clone git@hf.co:ccmusic-database/GZ_IsoTech
|
22 |
cd GZ_IsoTech
|
23 |
```
|
24 |
|
25 |
-
## Results
|
26 |
-
| Backbone | Size(M) |
|
27 |
-
| :----------------: | :-----: |
|
28 |
-
| vit_l_16 | 304.3 | [**_0.855_**](#best-result
|
29 |
-
| maxvit_t | 30.9 |
|
30 |
-
| | |
|
31 |
-
| resnext101_64x4d | 83.5 |
|
32 |
-
| resnet101 | 44.5 |
|
33 |
-
| regnet_y_8gf | 39.4 |
|
34 |
-
| shufflenet_v2_x2_0 | 7.4 |
|
35 |
-
| mobilenet_v3_large | 5.5 |
|
36 |
|
37 |
-
### Best result
|
38 |
<table>
|
39 |
<tr>
|
40 |
<th>Loss curve</th>
|
41 |
-
<td><img src="https://www.modelscope.cn/
|
42 |
</tr>
|
43 |
<tr>
|
44 |
<th>Training and validation accuracy</th>
|
45 |
-
<td><img src="https://www.modelscope.cn/
|
46 |
</tr>
|
47 |
<tr>
|
48 |
<th>Confusion matrix</th>
|
49 |
-
<td><img src="https://www.modelscope.cn/
|
50 |
</tr>
|
51 |
</table>
|
52 |
|
53 |
-
## Dataset
|
54 |
<https://huggingface.co/datasets/ccmusic-database/GZ_IsoTech>
|
55 |
|
56 |
-
## Mirror
|
57 |
<https://www.modelscope.cn/models/ccmusic-database/GZ_IsoTech>
|
58 |
|
59 |
-
## Evaluation
|
60 |
<https://github.com/monetjoe/ccmusic_eval>
|
61 |
|
62 |
-
## Cite
|
63 |
```bibtex
|
64 |
@dataset{zhaorui_liu_2021_5676893,
|
65 |
author = {Monan Zhou, Shenyang Xu, Zhaorui Liu, Zhaowen Wang, Feng Yu, Wei Li and Baoqiang Han},
|
|
|
2 |
license: mit
|
3 |
---
|
4 |
|
5 |
+
# Intro
|
6 |
The Guzheng Performance Technique Recognition Model is trained on the GZ_IsoTech Dataset, which consists of 2,824 audio clips that showcase various Guzheng playing techniques. Of these, 2,328 clips are from a virtual sound library, and 496 clips are performed by a highly skilled professional Guzheng artist, covering the full tonal range inherent to the Guzheng instrument. The audio clips are categorized into eight different playing techniques based on the unique performance practices of the Guzheng: Vibrato (chanyin), Slide-up (shanghuayin), Slide-down (xiahuayin), Return Slide (huihuayin), Glissando (guazou, huazhi, etc.), Thumb Plucking (yaozhi), Harmonics (fanyin), and Plucking Techniques (gou, da, mo, tuo, etc.). The model utilizes feature extraction, time-domain and frequency-domain analysis, and pattern recognition to accurately identify these distinct Guzheng playing techniques. The application of this model provides strong support for the automatic recognition, digital analysis, and educational research of Guzheng performance techniques, promoting the preservation and innovation of Guzheng art.
|
7 |
|
8 |
+
## Demo
|
|
|
|
|
9 |
<https://huggingface.co/spaces/ccmusic-database/GZ_IsoTech>
|
10 |
|
11 |
+
## Usage
|
12 |
```python
|
13 |
from modelscope import snapshot_download
|
14 |
model_dir = snapshot_download("ccmusic-database/GZ_IsoTech")
|
15 |
```
|
16 |
|
17 |
+
## Maintenance
|
18 |
```bash
|
19 |
git clone git@hf.co:ccmusic-database/GZ_IsoTech
|
20 |
cd GZ_IsoTech
|
21 |
```
|
22 |
|
23 |
+
## Results
|
24 |
+
| Backbone | Size(M) | Mel | CQT | Chroma |
|
25 |
+
| :----------------: | :-----: | :-------------------------: | :---------: | :---------: |
|
26 |
+
| vit_l_16 | 304.3 | [**_0.855_**](#best-result) | **_0.824_** | **_0.770_** |
|
27 |
+
| maxvit_t | 30.9 | 0.763 | 0.776 | 0.642 |
|
28 |
+
| | | | | |
|
29 |
+
| resnext101_64x4d | 83.5 | 0.713 | 0.765 | 0.639 |
|
30 |
+
| resnet101 | 44.5 | 0.731 | 0.798 | **_0.719_** |
|
31 |
+
| regnet_y_8gf | 39.4 | 0.804 | **_0.807_** | 0.716 |
|
32 |
+
| shufflenet_v2_x2_0 | 7.4 | 0.702 | 0.799 | 0.665 |
|
33 |
+
| mobilenet_v3_large | 5.5 | **_0.806_** | 0.798 | 0.657 |
|
34 |
|
35 |
+
### Best result
|
36 |
<table>
|
37 |
<tr>
|
38 |
<th>Loss curve</th>
|
39 |
+
<td><img src="https://www.modelscope.cn/models/ccmusic-database/GZ_IsoTech/resolve/master/vit_l_16_mel_2024-12-06_08-28-13/loss.jpg"></td>
|
40 |
</tr>
|
41 |
<tr>
|
42 |
<th>Training and validation accuracy</th>
|
43 |
+
<td><img src="https://www.modelscope.cn/models/ccmusic-database/GZ_IsoTech/resolve/master/vit_l_16_mel_2024-12-06_08-28-13/acc.jpg"></td>
|
44 |
</tr>
|
45 |
<tr>
|
46 |
<th>Confusion matrix</th>
|
47 |
+
<td><img src="https://www.modelscope.cn/models/ccmusic-database/GZ_IsoTech/resolve/master/vit_l_16_mel_2024-12-06_08-28-13/mat.jpg"></td>
|
48 |
</tr>
|
49 |
</table>
|
50 |
|
51 |
+
## Dataset
|
52 |
<https://huggingface.co/datasets/ccmusic-database/GZ_IsoTech>
|
53 |
|
54 |
+
## Mirror
|
55 |
<https://www.modelscope.cn/models/ccmusic-database/GZ_IsoTech>
|
56 |
|
57 |
+
## Evaluation
|
58 |
<https://github.com/monetjoe/ccmusic_eval>
|
59 |
|
60 |
+
## Cite
|
61 |
```bibtex
|
62 |
@dataset{zhaorui_liu_2021_5676893,
|
63 |
author = {Monan Zhou, Shenyang Xu, Zhaorui Liu, Zhaowen Wang, Feng Yu, Wei Li and Baoqiang Han},
|