Spaces:
Runtime error
Runtime error
File size: 2,404 Bytes
f68990a dc08bae f68990a b9acd9b f68990a c41b7d8 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 |
---
title: Cloned Voice Tanslator
emoji: π
colorFrom: gray
colorTo: pink
sdk: gradio
sdk_version: 4.37.2
app_file: app.py
pinned: false
license: mit
short_description: Voice-to-Voice Translation with your own voice model
---
# Multilingual Voice-to-Voice Translation App ποΈπ
## Overview
This repository hosts a open source Gradio-based application that translates your voice into multiple languages. The app leverages state-of-the-art models for transcription, language detection, translation, and text-to-speech synthesis to provide an end-to-end solution for real-time multilingual voice translation.
## Features
- **Transcription**: Converts spoken language into written text.
- **Automatic Language Detection**: Detects the language of the spoken input automatically.
- **Translation**: Translates the transcribed text into a selected target language.
- **Text-to-Speech**: Converts the translated text back into speech, mimicking the original speaker's voice as closely as possible.
- **Language Selection**: Supports 17 languages for translation and speech synthesis.
- **User Agreement**: Includes an option to agree to the COQUI terms and conditions before using the service.
## Installation/Usage
### Method 1 (HTTPS)
App currently hosted on HuggingFace Spaces. Use the link below to access:
[v2vtranslator - HugginFace Spaces](https://huggingface.co/spaces/DhakkiTikki/v2vtranslator)
### Method 2 (local)
1. Clone the repository:
```sh
git clone https://github.com/pawan971/v2vtranslator
cd v2vtranslator
```
2. Install the required packages:
```sh
pip install -r requirements.txt
```
3. Run the application:
```sh
python app.py
```
## Models Used
- **Whisper**: Used for audio transcription and automatic language detection.
- **translate**: Used for text translation between languages.
- **XTTS-v2**: Used for text-to-speech synthesis to generate audio from translated text in your voice.
## Open Source
This project is open source and contributions are welcome! Feel free to open issues, submit pull requests, or fork the repository to add your own features.
## License
This project is licensed under the MIT License. See the `LICENSE` file for more details.
## Acknowledgments
Special thanks to the developers of Whisper, translate, and XTTS-v2 for providing the foundational models used in this application.
--- |