File size: 5,201 Bytes
fa05d06 094ba32 bf71ef8 d16843b 82501b8 80abc5d 99fb832 b716fe6 99fb832 24337f4 99fb832 cf99fe4 341ebba 2f6d053 e1c1f49 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 |
---
license: openrail
datasets:
- Fhrozen/AudioSet2K22
- Chr0my/Epidemic_sounds
- ChristophSchuhmann/lyrics-index
- Cropinky/rap_lyrics_english
- tsterbak/eurovision-lyrics-1956-2023
- brunokreiner/genius-lyrics
- google/MusicCaps
- ccmusic-database/music_genre
- Hyeon2/riffusion-musiccaps-dataset
- SamAct/autotrain-data-musicprompt
- Chr0my/Epidemic_music
- juliensimon/autonlp-data-song-lyrics
- Datatang/North_American_English_Speech_Data_by_Mobile_Phone_and_PC
- Chr0my/freesound.org
- teticio/audio-diffusion-256
- KELONMYOSA/dusha_emotion_audio
- Ar4ikov/iemocap_audio_text_splitted
- flexthink/ljspeech
- mozilla-foundation/common_voice_13_0
- facebook/voxpopuli
- SocialGrep/one-million-reddit-jokes
- breadlicker45/human-midi-rlhf
- breadlicker45/midi-gpt-music-small
- projectlosangeles/Los-Angeles-MIDI-Dataset
- huggingartists/epic-rap-battles-of-history
- SocialGrep/one-million-reddit-confessions
- shahules786/prosocial-nsfw-reddit
- Thewillonline/reddit-sarcasm
- autoevaluate/autoeval-eval-futin__guess-vi-4200fb-2012366606
- lmsys/chatbot_arena_conversations
- mozilla-foundation/common_voice_11_0
- mozilla-foundation/common_voice_4_0
- dell-research-harvard/AmericanStories
- zZWipeoutZz/insane_style
- mu-llama/MusicQA
- RaphaelOlivier/whisper_adversarial_examples
- huggingartists/metallica
- vldsavelyev/guitar_tab
- NLPCoreTeam/humaneval_ru
- seungheondoh/audioset-music
- gary109/onset-singing3_corpora_parliament_processed_MIR-ST500
- LDD5522/Rock_Vocals
metrics:
- accuracy
- bertscore
- bleu
- bleurt
- brier_score
- character
- chrf
language:
- en
- es
- it
- pt
- la
- fr
- ru
library_name: pyannote-audio
tags:
- music
---
# SoundSlayerAI
SoundSlayerAI is an innovative project that focuses on music-related tasks and utilizes the power of the "pyannote-audio" library. This project aims to provide various functionalities for audio analysis and processing, making it easier to work with music datasets.
## Datasets
SoundSlayerAI makes use of the following datasets:
- Fhrozen/AudioSet2K22
- Chr0my/Epidemic_sounds
- ChristophSchuhmann/lyrics-index
- Cropinky/rap_lyrics_english
- tsterbak/eurovision-lyrics-1956-2023
- brunokreiner/genius-lyrics
- google/MusicCaps
- ccmusic-database/music_genre
- Hyeon2/riffusion-musiccaps-dataset
- SamAct/autotrain-data-musicprompt
- Chr0my/Epidemic_music
- juliensimon/autonlp-data-song-lyrics
- Datatang/North_American_English_Speech_Data_by_Mobile_Phone_and_PC
- Chr0my/freesound.org
- teticio/audio-diffusion-256
- KELONMYOSA/dusha_emotion_audio
- Ar4ikov/iemocap_audio_text_splitted
- flexthink/ljspeech
- mozilla-foundation/common_voice_13_0
- facebook/voxpopuli
- SocialGrep/one-million-reddit-jokes
- breadlicker45/human-midi-rlhf
- breadlicker45/midi-gpt-music-small
- projectlosangeles/Los-Angeles-MIDI-Dataset
- huggingartists/epic-rap-battles-of-history
- SocialGrep/one-million-reddit-confessions
- shahules786/prosocial-nsfw-reddit
- Thewillonline/reddit-sarcasm
- autoevaluate/autoeval-eval-futin__guess-vi-4200fb-2012366606
- lmsys/chatbot_arena_conversations
- mozilla-foundation/common_voice_11_0
- mozilla-foundation/common_voice_4_0
## Library
The core library used in this project is "pyannote-audio." This library provides a wide range of functionalities for audio analysis and processing, making it an excellent choice for working with music datasets. The "pyannote-audio" library offers a comprehensive set of tools and algorithms for tasks such as audio segmentation, speaker diarization, music transcription, and more.
## Metrics
To evaluate the performance of SoundSlayerAI, several metrics are employed, including:
- Accuracy
- Bertscore
- BLEU
- BLEURT
- Brier Score
- Character
These metrics help assess the effectiveness and accuracy of the implemented algorithms and models.
## Language
The SoundSlayerAI project primarily focuses on the English language. The datasets and models used in this project are optimized for English audio and text analysis tasks.
## Usage
To use SoundSlayerAI, follow these steps:
1. Install the required dependencies by running `pip install pyannote-audio`.
2. Import the necessary modules from the "pyannote.audio" package to access the desired functionalities.
3. Load the audio data or use the provided datasets to perform tasks such as audio segmentation, speaker diarization, music transcription, and more.
4. Apply the appropriate algorithms and models from the "pyannote.audio" library to process and analyze the audio data.
5. Evaluate the results using the specified metrics, such as accuracy, bertscore, BLEU, BLEURT, brier_score, and character.
6. Iterate and refine your approach to achieve the desired outcomes for your music-related tasks.
## License
SoundSlayerAI is released under the Openrail license. Please refer to the LICENSE file for more details.
## Contributions
Contributions to SoundSlayerAI are welcome! If you have any ideas, bug fixes, or enhancements, feel free to submit a pull request or open an issue on the GitHub repository.
## Contact
For any inquiries or questions regarding SoundSlayerAI, please reach out to the project maintainer at [insert email address].
Thank you for your interest in SoundSlayerAI! |