Text-to-Speech
Transformers
10 languages
music
Inference Endpoints
File size: 5,420 Bytes
fa05d06
 
 
 
 
094ba32
 
 
 
 
 
 
 
 
 
 
bf71ef8
 
 
 
d16843b
 
 
82501b8
 
 
 
 
80abc5d
 
 
 
 
 
 
99fb832
 
 
 
 
 
 
 
 
 
f398a9c
 
 
 
 
 
 
b716fe6
 
 
 
 
 
 
99fb832
24337f4
 
99fb832
 
 
 
 
 
f398a9c
 
 
0b72e67
341ebba
 
f398a9c
2f6d053
e1c1f49
 
f398a9c
e1c1f49
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
---
license: openrail
datasets:
- Fhrozen/AudioSet2K22
- Chr0my/Epidemic_sounds
- ChristophSchuhmann/lyrics-index
- Cropinky/rap_lyrics_english
- tsterbak/eurovision-lyrics-1956-2023
- brunokreiner/genius-lyrics
- google/MusicCaps
- ccmusic-database/music_genre
- Hyeon2/riffusion-musiccaps-dataset
- SamAct/autotrain-data-musicprompt
- Chr0my/Epidemic_music
- juliensimon/autonlp-data-song-lyrics
- Datatang/North_American_English_Speech_Data_by_Mobile_Phone_and_PC
- Chr0my/freesound.org
- teticio/audio-diffusion-256
- KELONMYOSA/dusha_emotion_audio
- Ar4ikov/iemocap_audio_text_splitted
- flexthink/ljspeech
- mozilla-foundation/common_voice_13_0
- facebook/voxpopuli
- SocialGrep/one-million-reddit-jokes
- breadlicker45/human-midi-rlhf
- breadlicker45/midi-gpt-music-small
- projectlosangeles/Los-Angeles-MIDI-Dataset
- huggingartists/epic-rap-battles-of-history
- SocialGrep/one-million-reddit-confessions
- shahules786/prosocial-nsfw-reddit
- Thewillonline/reddit-sarcasm
- autoevaluate/autoeval-eval-futin__guess-vi-4200fb-2012366606
- lmsys/chatbot_arena_conversations
- mozilla-foundation/common_voice_11_0
- mozilla-foundation/common_voice_4_0
- dell-research-harvard/AmericanStories
- zZWipeoutZz/insane_style
- mu-llama/MusicQA
- RaphaelOlivier/whisper_adversarial_examples
- huggingartists/metallica
- vldsavelyev/guitar_tab
- NLPCoreTeam/humaneval_ru
- seungheondoh/audioset-music
- gary109/onset-singing3_corpora_parliament_processed_MIR-ST500
- LDD5522/Rock_Vocals
- huggingartists/rage-against-the-machine
- huggingartists/chester-bennington
- huggingartists/logic
- cmsolson75/artist_song_lyric_dataset
- BhavyaMuni/artist-lyrics
- vjain/emotional_intelligence
- mhenrichsen/context-aware-splits
metrics:
- accuracy
- bertscore
- bleu
- bleurt
- brier_score
- character
- chrf
language:
- en
- es
- it
- pt
- la
- fr
- ru
- zh
- ja
- el
library_name: transformers
tags:
- music
pipeline_tag: text-to-speech
---
# SoundSlayerAI

SoundSlayerAI is an innovative project that focuses on music-related tasks This project aims to provide various functionalities for audio analysis and processing, making it easier to work with music datasets.

## Datasets

SoundSlayerAI makes use of the following datasets:

- Fhrozen/AudioSet2K22
- Chr0my/Epidemic_sounds
- ChristophSchuhmann/lyrics-index
- Cropinky/rap_lyrics_english
- tsterbak/eurovision-lyrics-1956-2023
- brunokreiner/genius-lyrics
- google/MusicCaps
- ccmusic-database/music_genre
- Hyeon2/riffusion-musiccaps-dataset
- SamAct/autotrain-data-musicprompt
- Chr0my/Epidemic_music
- juliensimon/autonlp-data-song-lyrics
- Datatang/North_American_English_Speech_Data_by_Mobile_Phone_and_PC
- Chr0my/freesound.org
- teticio/audio-diffusion-256
- KELONMYOSA/dusha_emotion_audio
- Ar4ikov/iemocap_audio_text_splitted
- flexthink/ljspeech
- mozilla-foundation/common_voice_13_0
- facebook/voxpopuli
- SocialGrep/one-million-reddit-jokes
- breadlicker45/human-midi-rlhf
- breadlicker45/midi-gpt-music-small
- projectlosangeles/Los-Angeles-MIDI-Dataset
- huggingartists/epic-rap-battles-of-history
- SocialGrep/one-million-reddit-confessions
- shahules786/prosocial-nsfw-reddit
- Thewillonline/reddit-sarcasm
- autoevaluate/autoeval-eval-futin__guess-vi-4200fb-2012366606
- lmsys/chatbot_arena_conversations
- mozilla-foundation/common_voice_11_0
- mozilla-foundation/common_voice_4_0

## Library

The core library used in this project is "pyannote-audio." This library provides a wide range of functionalities for audio analysis and processing, making it an excellent choice for working with music datasets. The "pyannote-audio" library offers a comprehensive set of tools and algorithms for tasks such as audio segmentation, speaker diarization, music transcription, and more.

## Metrics

To evaluate the performance of SoundSlayerAI, several metrics are employed, including:

- Accuracy
- Bertscore
- BLEU
- BLEURT
- Brier Score
- Character

These metrics help assess the effectiveness and accuracy of the implemented algorithms and models.

## Language

The SoundSlayerAI project primarily focuses on the English language. The datasets and models used in this project are optimized for English audio and text analysis tasks.

## Usage

To use SoundSlayerAI, follow these steps:

1. Install the required dependencies by running `pip install pyannote-audio`.

2. Import the necessary modules from the "pyannote.audio" package to access the desired functionalities.

3. Load the audio data or use the provided datasets to perform tasks such as audio segmentation, speaker diarization, music transcription, and more.

4. Apply the appropriate algorithms and models from the "pyannote.audio" library to process and analyze the audio data.

5. Evaluate the results using the specified metrics, such as accuracy, bertscore, BLEU, BLEURT, brier_score, and character.

6. Iterate and refine your approach to achieve the desired outcomes for your music-related tasks.

## License

SoundSlayerAI is released under the Openrail license. Please refer to the LICENSE file for more details.

## Contributions

Contributions to SoundSlayerAI are welcome! If you have any ideas, bug fixes, or enhancements, feel free to submit a pull request or open an issue on the GitHub repository.

## Contact

For any inquiries or questions regarding SoundSlayerAI, please reach out to the project maintainer at [insert email address].

Thank you for your interest in SoundSlayerAI!