JustinLin610's picture
add easyocr
85d9fef
|
raw
history blame
2.38 kB

EasyOCR Lite

从EasyOCR提取文本定位有关代码,进一步适配中文,修正缺陷

安装

Python版本至少为3.8。

首先按照PyTorch官方说明安装PyTorch。

pip install -e .

使用

from easyocrlite import ReaderLite

reader = ReaderLite()
results = reader.process('my_awesome_handwriting.png')

返回的内容为边界框和对应的图像区域的列表。 其它说明见demo

致谢

基于EasyOCR修改实现。以下为EasyOCR致谢:

This project is based on research and code from several papers and open-source repositories.

All deep learning execution is based on Pytorch. :heart:

Detection execution uses the CRAFT algorithm from this official repository and their paper (Thanks @YoungminBaek from @clovaai). We also use their pretrained model. Training script is provided by @gmuffiness.

The recognition model is a CRNN (paper). It is composed of 3 main components: feature extraction (we are currently using Resnet) and VGG, sequence labeling (LSTM) and decoding (CTC). The training pipeline for recognition execution is a modified version of the deep-text-recognition-benchmark framework. (Thanks @ku21fan from @clovaai) This repository is a gem that deserves more recognition.

Beam search code is based on this repository and his blog. (Thanks @githubharald)

Data synthesis is based on TextRecognitionDataGenerator. (Thanks @Belval)

And a good read about CTC from distill.pub here.

许可证 (注意!)

Apache 2.0