This application allows users to generate EAD/XML records using voice commands. Users can record their instructions, highlight existing XML content to add context, and send the combined prompt to the Ollama model for processing.
To use the application:
This application utilizes the Xenova/whisper-small model for audio transcription, which is implemented using transformers.js and leverages WebGPU in the browser for efficient processing.
For EAD generation, the application uses the Q5_K_M quantized variant of the fine-tuned model Geraldine/FineLlama-3.2-3B-Instruct-ead. This model is designed to understand and generate EAD/XML records based on the user’s instructions and context.
See this blog post for explanations on fine-tuning
To use the WebGPU features of this application, ensure that you are using a compatible browser. The following requirements must be met: