v2vtranslator / README.md
DhakkiTikki's picture
Update README.md
dc08bae verified
---
title: Cloned Voice Tanslator
emoji: πŸ“ˆ
colorFrom: gray
colorTo: pink
sdk: gradio
sdk_version: 4.37.2
app_file: app.py
pinned: false
license: mit
short_description: Voice-to-Voice Translation with your own voice model
---
# Multilingual Voice-to-Voice Translation App πŸŽ™οΈπŸ”Š
## Overview
This repository hosts a open source Gradio-based application that translates your voice into multiple languages. The app leverages state-of-the-art models for transcription, language detection, translation, and text-to-speech synthesis to provide an end-to-end solution for real-time multilingual voice translation.
## Features
- **Transcription**: Converts spoken language into written text.
- **Automatic Language Detection**: Detects the language of the spoken input automatically.
- **Translation**: Translates the transcribed text into a selected target language.
- **Text-to-Speech**: Converts the translated text back into speech, mimicking the original speaker's voice as closely as possible.
- **Language Selection**: Supports 17 languages for translation and speech synthesis.
- **User Agreement**: Includes an option to agree to the COQUI terms and conditions before using the service.
## Installation/Usage
### Method 1 (HTTPS)
App currently hosted on HuggingFace Spaces. Use the link below to access:
[v2vtranslator - HugginFace Spaces](https://huggingface.co/spaces/DhakkiTikki/v2vtranslator)
### Method 2 (local)
1. Clone the repository:
```sh
git clone https://github.com/pawan971/v2vtranslator
cd v2vtranslator
```
2. Install the required packages:
```sh
pip install -r requirements.txt
```
3. Run the application:
```sh
python app.py
```
## Models Used
- **Whisper**: Used for audio transcription and automatic language detection.
- **translate**: Used for text translation between languages.
- **XTTS-v2**: Used for text-to-speech synthesis to generate audio from translated text in your voice.
## Open Source
This project is open source and contributions are welcome! Feel free to open issues, submit pull requests, or fork the repository to add your own features.
## License
This project is licensed under the MIT License. See the `LICENSE` file for more details.
## Acknowledgments
Special thanks to the developers of Whisper, translate, and XTTS-v2 for providing the foundational models used in this application.
---