Spaces:
Running
Running
Update app.py
Browse files
app.py
CHANGED
@@ -16,7 +16,7 @@ with demo:
|
|
16 |
This project leverages Donut model proposed in this <a href=\"https://arxiv.org/abs/2111.15664/\">paper </a> for the parsing of the required data from cheques." \
|
17 |
"Donut is based on a very simple transformer encoder and decoder architecture. It's main USP is that it is an OCR-free approach to information extraction from documents. \
|
18 |
OCR based techniques come with several limitations such as use of additional downstream models, lack of understanding about document structure, use of hand crafted rules,etc. \
|
19 |
-
Donut helps you get rid of all of these OCR specific limitations.")
|
20 |
|
21 |
|
22 |
with gr.Tabs():
|
|
|
16 |
This project leverages Donut model proposed in this <a href=\"https://arxiv.org/abs/2111.15664/\">paper </a> for the parsing of the required data from cheques." \
|
17 |
"Donut is based on a very simple transformer encoder and decoder architecture. It's main USP is that it is an OCR-free approach to information extraction from documents. \
|
18 |
OCR based techniques come with several limitations such as use of additional downstream models, lack of understanding about document structure, use of hand crafted rules,etc. \
|
19 |
+
Donut helps you get rid of all of these OCR specific limitations. The model for the project has been trained using this <a href=\"https://huggingface.co/datasets/shivi/cheques_sample_data/\"> dataset </a>. This HF dataset is actually a filtered version of this <a href=\"https://www.kaggle.com/datasets/medali1992/cheque-images/\"> kaggle dataset </a>.")
|
20 |
|
21 |
|
22 |
with gr.Tabs():
|