File size: 2,404 Bytes
f68990a
dc08bae
f68990a
 
 
 
 
 
 
 
b9acd9b
f68990a
 
c41b7d8
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
---
title: Cloned Voice Tanslator
emoji: πŸ“ˆ
colorFrom: gray
colorTo: pink
sdk: gradio
sdk_version: 4.37.2
app_file: app.py
pinned: false
license: mit
short_description: Voice-to-Voice Translation with your own voice model
---

# Multilingual Voice-to-Voice Translation App πŸŽ™οΈπŸ”Š

## Overview

This repository hosts a open source Gradio-based application that translates your voice into multiple languages. The app leverages state-of-the-art models for transcription, language detection, translation, and text-to-speech synthesis to provide an end-to-end solution for real-time multilingual voice translation.

## Features

- **Transcription**: Converts spoken language into written text.
- **Automatic Language Detection**: Detects the language of the spoken input automatically.
- **Translation**: Translates the transcribed text into a selected target language.
- **Text-to-Speech**: Converts the translated text back into speech, mimicking the original speaker's voice as closely as possible.
- **Language Selection**: Supports 17 languages for translation and speech synthesis.
- **User Agreement**: Includes an option to agree to the COQUI terms and conditions before using the service.

## Installation/Usage

### Method 1 (HTTPS)
 
App currently hosted on HuggingFace Spaces. Use the link below to access: 

[v2vtranslator - HugginFace Spaces](https://huggingface.co/spaces/DhakkiTikki/v2vtranslator)

### Method 2 (local)

1. Clone the repository:

    ```sh
    git clone https://github.com/pawan971/v2vtranslator
    cd v2vtranslator
    ```

2. Install the required packages:

    ```sh
    pip install -r requirements.txt
    ```

3. Run the application:

    ```sh
    python app.py
    ```

## Models Used

- **Whisper**: Used for audio transcription and automatic language detection.
- **translate**: Used for text translation between languages.
- **XTTS-v2**: Used for text-to-speech synthesis to generate audio from translated text in your voice.

## Open Source

This project is open source and contributions are welcome! Feel free to open issues, submit pull requests, or fork the repository to add your own features.

## License

This project is licensed under the MIT License. See the `LICENSE` file for more details.

## Acknowledgments

Special thanks to the developers of Whisper, translate, and XTTS-v2 for providing the foundational models used in this application.

---