Spaces:
Runtime error
A newer version of the Gradio SDK is available:
5.23.3
title: Cloned Voice Tanslator
emoji: π
colorFrom: gray
colorTo: pink
sdk: gradio
sdk_version: 4.37.2
app_file: app.py
pinned: false
license: mit
short_description: Voice-to-Voice Translation with your own voice model
Multilingual Voice-to-Voice Translation App ποΈπ
Overview
This repository hosts a open source Gradio-based application that translates your voice into multiple languages. The app leverages state-of-the-art models for transcription, language detection, translation, and text-to-speech synthesis to provide an end-to-end solution for real-time multilingual voice translation.
Features
- Transcription: Converts spoken language into written text.
- Automatic Language Detection: Detects the language of the spoken input automatically.
- Translation: Translates the transcribed text into a selected target language.
- Text-to-Speech: Converts the translated text back into speech, mimicking the original speaker's voice as closely as possible.
- Language Selection: Supports 17 languages for translation and speech synthesis.
- User Agreement: Includes an option to agree to the COQUI terms and conditions before using the service.
Installation/Usage
Method 1 (HTTPS)
App currently hosted on HuggingFace Spaces. Use the link below to access:
v2vtranslator - HugginFace Spaces
Method 2 (local)
Clone the repository:
git clone https://github.com/pawan971/v2vtranslator cd v2vtranslator
Install the required packages:
pip install -r requirements.txt
Run the application:
python app.py
Models Used
- Whisper: Used for audio transcription and automatic language detection.
- translate: Used for text translation between languages.
- XTTS-v2: Used for text-to-speech synthesis to generate audio from translated text in your voice.
Open Source
This project is open source and contributions are welcome! Feel free to open issues, submit pull requests, or fork the repository to add your own features.
License
This project is licensed under the MIT License. See the LICENSE
file for more details.
Acknowledgments
Special thanks to the developers of Whisper, translate, and XTTS-v2 for providing the foundational models used in this application.