Safetensors
t5

Swedish OCR Correction

This model is an updated version of https://huggingface.co/viklofg/swedish-ocr-correction

The model has been trained to correct OCR predictions by Abbyy, Tesseract, and a combination of those on newspaper from 1818-2018 (see A Two-OCR Engine Method for Digitized Swedish Newspapers ).

Please check the original model for more information.

This new model has been trained much longer and manages to outperform the previous one using the same train-test split.

Model CER WER
Original OCR 3.01 13.23
viklofg 1.92 7.41
KBLab 1.57 6.23
Downloads last month
20
Safetensors
Model size
300M params
Tensor type
F32
ยท
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Space using KBLab/swedish-ocr-correction 1