Add fimu-docproc-research/vitstr_small_DoctrOcrEngine model

Files changed (3) hide show

README.md ADDED Viewed

+---
+language: cz
+---
+**Optical Character Recognition made seamless & accessible to anyone, powered by PyTorch**
+## Task: recognition
+### Example usage:
+```python
+>>> from doctr.io import DocumentFile
+>>> from doctr.models import ocr_predictor, from_hub
+>>> img = DocumentFile.from_images(['<image_path>'])
+>>> # Load your model from the hub
+>>> model = from_hub('mindee/my-model')
+>>> # Pass it to the predictor
+>>> # If your model is a recognition model:
+>>> predictor = ocr_predictor(det_arch='db_resnet50',
+>>>                           reco_arch=model,
+>>>                           pretrained=True)
+>>> # Get your predictions
+>>> res = predictor(img)
+```
+Training configuration and logs: https://wandb.ai/xbankov/text-recognition
+### Run Configuration
+{
+  "hf_dataset_name": "fimu-docproc-research/born_digital_recognition",
+  "name": "vitstr_small_25_512_32_0.03481964801161507_0.05483515188085567_cosine_c368f4a3_92dfd51b",
+  "epochs": 25,
+  "lr": 0.03481964801161507,
+  "weight_decay": 0.05483515188085567,
+  "batch_size": 512,
+  "input_size": 32,
+  "sched": "cosine",
+  "sample": null,
+  "workers": 16,
+  "wb": true,
+  "push_to_hub": "fimu-docproc-research/vitstr_small",
+  "test_only": false,
+  "arch": "vitstr_small"
+}

config.json ADDED Viewed

+{
+  "mean": [
+    0.694,
+    0.695,
+    0.693
+  ],
+  "std": [
+    0.299,
+    0.296,
+    0.301
+  ],
+  "input_shape": [
+    3,
+    32,
+    128
+  ],
+  "vocab": "®©äàőôöüűĺľŕĽÖÜß§0123456789abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ!\"#$%&'()*+,-./:;<=>?@[\\]^_`{|}~°£€¥¢฿áčďéěíňóřšťúůýžÁČĎÉĚÍŇÓŘŠŤÚŮÝŽ",
+  "url": null,
+  "arch": "vitstr_small",
+  "task": "recognition"
+}

pytorch_model.bin ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:5b947470496b60c222ce9ac8d1d14ba484aa942018d5b46b92da324fea53a46d
+size 85800521