---
language:
- de
license: apache-2.0
tags:
- voice
- classification
- age
- gender
- speech
- audio
datasets:
- mozilla-foundation/common_voice_12_0
widget:
- src: >-
    https://huggingface.co/padmalcom/wav2vec2-asr-ultimate-german/resolve/main/test.wav
  example_title: Sample 1
pipeline_tag: audio-classification
metrics:
- accuracy
---

This multi-task wav2vec2 based asr model has two additional classification heads to detect:
- age
- gender

... of the current speaker in one forward pass.

It was trained on  [mozilla common voice](https://commonvoice.mozilla.org/).

Code for training can be found [here](https://github.com/padmalcom/wav2vec2-asr-ultimate-german).

*inference_online.py* shows, how the model can be used.