Spaces:

gauravgulati619
/

MediVox

Running

App Files Files Community

MediVox / README.md

gauravgulati619's picture

gauravgulati619

Update README.md

562ec5a verified 13 days ago

|

history blame contribute delete

1.38 kB

A newer version of the Gradio SDK is available: 5.20.1

Upgrade

metadata

title: MediVox - AI Doctor with Vision and Voice
emoji: 👨‍⚕️
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 5.19.0
app_file: app.py
pinned: false

AI Doctor with Vision and Voice

This is an AI-powered medical assistant that can:

Accept voice input from patients
Analyze medical images
Provide medical insights using RAG (Retrieval Augmented Generation)
Respond with natural voice output

Features

Speech-to-Text using Whisper
Image Analysis using LLaVA
RAG using FAISS and medical knowledge base
Text-to-Speech using ElevenLabs
Context-aware responses using medical domain knowledge

Environment Variables Required

GROQ_API_KEY=your_groq_api_key
ELEVENLABS_API_KEY=your_elevenlabs_api_key

Usage

Click the microphone button to record your question
Upload or take a picture of the medical condition
Wait for the AI doctor to analyze and respond
Listen to the voice response or read the text output

Model Details

Vision Model: LLaVA 3.2 11B
Speech-to-Text: Whisper Large V3
Text Generation: Groq
Voice Generation: ElevenLabs
Embeddings: sentence-transformers/all-mpnet-base-v2

Citation

If you use this space, please cite:

@misc{medivoicebot2024,
  author = {Gaurav Gulati},
  title = {AI Doctor with Vision and Voice},
  year = {2024},
  publisher = {Hugging Face Spaces},
}