Spaces:

DhakkiTikki
/

v2vtranslator

Runtime error

App Files Files Community

v2vtranslator / README.md

DhakkiTikki

Update README.md

dc08bae verified 9 months ago

preview code

raw

history blame contribute delete

2.4 kB

	---
	title: Cloned Voice Tanslator
	emoji: 📈
	colorFrom: gray
	colorTo: pink
	sdk: gradio
	sdk_version: 4.37.2
	app_file: app.py
	pinned: false
	license: mit
	short_description: Voice-to-Voice Translation with your own voice model
	---

	# Multilingual Voice-to-Voice Translation App 🎙️🔊

	## Overview

	This repository hosts a open source Gradio-based application that translates your voice into multiple languages. The app leverages state-of-the-art models for transcription, language detection, translation, and text-to-speech synthesis to provide an end-to-end solution for real-time multilingual voice translation.

	## Features

	- Transcription: Converts spoken language into written text.
	- Automatic Language Detection: Detects the language of the spoken input automatically.
	- Translation: Translates the transcribed text into a selected target language.
	- Text-to-Speech: Converts the translated text back into speech, mimicking the original speaker's voice as closely as possible.
	- Language Selection: Supports 17 languages for translation and speech synthesis.
	- User Agreement: Includes an option to agree to the COQUI terms and conditions before using the service.

	## Installation/Usage

	### Method 1 (HTTPS)

	App currently hosted on HuggingFace Spaces. Use the link below to access:

	[v2vtranslator - HugginFace Spaces](https://huggingface.co/spaces/DhakkiTikki/v2vtranslator)

	### Method 2 (local)

	1. Clone the repository:

	```sh
	git clone https://github.com/pawan971/v2vtranslator
	cd v2vtranslator
	```

	2. Install the required packages:

	```sh
	pip install -r requirements.txt
	```

	3. Run the application:

	```sh
	python app.py
	```

	## Models Used

	- Whisper: Used for audio transcription and automatic language detection.
	- translate: Used for text translation between languages.
	- XTTS-v2: Used for text-to-speech synthesis to generate audio from translated text in your voice.

	## Open Source

	This project is open source and contributions are welcome! Feel free to open issues, submit pull requests, or fork the repository to add your own features.

	## License

	This project is licensed under the MIT License. See the `LICENSE` file for more details.

	## Acknowledgments

	Special thanks to the developers of Whisper, translate, and XTTS-v2 for providing the foundational models used in this application.

	---