Spaces:
Runtime error
Runtime error
title: Cloned Voice Tanslator | |
emoji: π | |
colorFrom: gray | |
colorTo: pink | |
sdk: gradio | |
sdk_version: 4.37.2 | |
app_file: app.py | |
pinned: false | |
license: mit | |
short_description: Voice-to-Voice Translation with your own voice model | |
# Multilingual Voice-to-Voice Translation App ποΈπ | |
## Overview | |
This repository hosts a open source Gradio-based application that translates your voice into multiple languages. The app leverages state-of-the-art models for transcription, language detection, translation, and text-to-speech synthesis to provide an end-to-end solution for real-time multilingual voice translation. | |
## Features | |
- **Transcription**: Converts spoken language into written text. | |
- **Automatic Language Detection**: Detects the language of the spoken input automatically. | |
- **Translation**: Translates the transcribed text into a selected target language. | |
- **Text-to-Speech**: Converts the translated text back into speech, mimicking the original speaker's voice as closely as possible. | |
- **Language Selection**: Supports 17 languages for translation and speech synthesis. | |
- **User Agreement**: Includes an option to agree to the COQUI terms and conditions before using the service. | |
## Installation/Usage | |
### Method 1 (HTTPS) | |
App currently hosted on HuggingFace Spaces. Use the link below to access: | |
[v2vtranslator - HugginFace Spaces](https://huggingface.co/spaces/DhakkiTikki/v2vtranslator) | |
### Method 2 (local) | |
1. Clone the repository: | |
```sh | |
git clone https://github.com/pawan971/v2vtranslator | |
cd v2vtranslator | |
``` | |
2. Install the required packages: | |
```sh | |
pip install -r requirements.txt | |
``` | |
3. Run the application: | |
```sh | |
python app.py | |
``` | |
## Models Used | |
- **Whisper**: Used for audio transcription and automatic language detection. | |
- **translate**: Used for text translation between languages. | |
- **XTTS-v2**: Used for text-to-speech synthesis to generate audio from translated text in your voice. | |
## Open Source | |
This project is open source and contributions are welcome! Feel free to open issues, submit pull requests, or fork the repository to add your own features. | |
## License | |
This project is licensed under the MIT License. See the `LICENSE` file for more details. | |
## Acknowledgments | |
Special thanks to the developers of Whisper, translate, and XTTS-v2 for providing the foundational models used in this application. | |
--- |