Visual Question Answering (VQA) for Medical Imaging

Kalbe Digital Lab

Overview

This project addresses the challenge of accurate and efficient medical imaging analysis in healthcare, aiming to reduce human error and workload for radiologists. The proposed solution involves developing advanced AI models for Visual Question Answering (VQA) to assist healthcare professionals in analyzing medical images (radiology images) quickly and accurately. We fine-tune HuggingFace multimodal model Idefics2-8b using radiology VQA datasets.

Dataset

We fine-tune pre-trained model using these datasets :

Model Architecture

The model is trained using Idefics2-8b.

model-architecture

Demo

Please upload an image and question or select from the examples to see the answer prediction