Darwin Bautista commited on
Commit
8632fdd
1 Parent(s): 2788cde

Initial commit of weights

Browse files
Files changed (3) hide show
  1. README.md +38 -0
  2. pytorch_model.bin +3 -0
  3. torchscript_model.bin +3 -0
README.md ADDED
@@ -0,0 +1,38 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: apache-2.0
5
+ tags:
6
+ - image-to-text
7
+ ---
8
+
9
+ # ViTSTR small v1.0
10
+
11
+ ViTSTR model pre-trained on various real [STR datasets](https://github.com/baudm/parseq/blob/main/Datasets.md) at image size 224x224 with a patch size of 16x16.
12
+
13
+ Disclaimer: this model card was not written by the original author.
14
+
15
+ ## Model description
16
+
17
+ *TODO*
18
+
19
+ ## Intended uses & limitations
20
+
21
+ You can use the model for STR on images containing Latin characters (62 case-sensitive alphanumeric + 32 punctuation marks).
22
+
23
+ ### How to use
24
+
25
+ *TODO*
26
+
27
+ ### BibTeX entry and citation info
28
+
29
+ ```bibtex
30
+ @InProceedings{atienza2021vision,
31
+ title={Vision transformer for fast and efficient scene text recognition},
32
+ author={Atienza, Rowel},
33
+ booktitle={International Conference on Document Analysis and Recognition},
34
+ pages={319--334},
35
+ year={2021},
36
+ organization={Springer}
37
+ }
38
+ ```
pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:add948d03ba3e53c7ea68fe043fd76f67fc4f6cd7922fb13ed5cfc6ddb386657
3
+ size 86864401
torchscript_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:213ed22ee81d4f05ec67b66b266ffbeac89386c42804c10e659d06c9d3a79fbf
3
+ size 86875678