diversen/doctr-torch-crnn_vgg16_bn-danish-v1 · How can i train the DOCTR model? How can i create the efficient dataset for train this DOCTR model?

Hi,

The model is fine-tuned from the french model. The training is done using only synthetic images. I think this will work quite well with all latin languages.

The training material is made using TextRecognitionDataGenerator

Because of some difficulties installing it I have made a clone of the TextRecognitionDataGenerator at:

https://github.com/diversen/TextRecognitionDataGenerator

In the README you can see instructions on how to generate synthetic images which can then be used as training data.

E.g. if you want to train a model you could use 250000 words and make 8 images for each word.

Then you have 2.000.000 images as training material.

There is also a couple of notes about how to fine-tune a docTR model using this data. This process is also described quite well in the docTR repo.