Spaces:
Build error
Build error
# ElevenLabs Voice Cloning Guide | |
This README serves as a guide on how to use ElevenLabs for voice cloning. Follow the steps below to clone a voice, test it, and fine-tune it for the best results. | |
## Collecting Data | |
Before you start, you'll need voice data. Download high quality vocal only audio clips. Check the [training_data](./training_data) folder for reference. | |
If you're creating your own dataset, ensure the audio is high quality. It should have no background noise, clear pronunciation. | |
The audio format must be mp3 and should be about 1 minute long in total. | |
## Creating an ElevenLabs Account | |
Visit [ElevenLabs](https://beta.elevenlabs.io/) to create an account. You'll need this to access the speech synthesis and voice cloning features. | |
Get your `ELEVEN_LABS_API_KEY`: | |
1. Click profile icon and select 'profile'. | |
2. Copy API Key | |
## Speech Synthesis/Voice Cloning | |
Follow these steps to clone a voice: | |
1. Go to the [speech synthesis page](https://beta.elevenlabs.io/speech-synthesis). | |
2. Click "Add Voice". | |
3. Click "Add Generative or Cloned Voice". | |
4. Click "Instant Voice Cloning". | |
5. Fill in all the required information and upload your audio samples. | |
6. Click "Add Voice". | |
## Testing Your Voice | |
To test the voice you've just created: | |
1. Go back to the [speech synthesis page](https://beta.elevenlabs.io/speech-synthesis). | |
2. Choose the voice you just created in Settings. | |
4. Type some text and click "Generate". | |
## Fine-tuning Your Voice | |
You can make the voice read better by adjusting system and user prompts. | |
Here are some tips: | |
- If the voice is too monotone, lower the Stability to make it more emotional. However, setting the Stability to zero can sometimes lead to a strange accent. | |
- Longer sentences tend to be spoken better because they provide more context for the AI speaker to understand. | |
- For shorter sentences that are spoken too quickly, replace "." with "...". Add "-" or a newline for a pause. | |
- Add emotion-related words or phrases, or use punctuation marks like β!β, β?β to add emotions to the voice. | |
## Using Your Custom Voice in Our Project | |
You need the voice id of cloned voice. Here's how: | |
1. go to https://api.elevenlabs.io/docs | |
2. choose Get Voices api | |
3. follow the instruction and find the specific voice_id in the Responses. | |
4. Do not forget to update your .env file with `ELEVEN_LABS_API_KEY` and voice ids. |