Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,16 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
tags:
|
3 |
+
- donut
|
4 |
+
- image-to-text
|
5 |
+
- invoices
|
6 |
+
---
|
7 |
+
# Overview
|
8 |
+
This repository contains a fine-tuned version of the Donut model for document understanding, specifically tailored for invoice processing. The Donut model is based on the OCR-free Document Understanding Transformer, introduced in the paper by Geewok et al. OCR-free Document Understanding Transformer, and initially released in the repository https://github.com/clovaai/donut.
|
9 |
+
|
10 |
+
The purpose of this custom fine-tuning is to enhance the Donut model's performance specifically for invoice analysis and extraction. The model was trained on a custom dataset of annotated invoices, comprising several hundred examples. Although the dataset is not included in this repository, details on its availability will be provided later.
|
11 |
+
|
12 |
+
# Model Details
|
13 |
+
|
14 |
+
The Donut model is a transformer-based architecture that leverages self-attention mechanisms for document understanding. By fine-tuning the model with a custom dataset of invoices, we aim to improve its ability to accurately extract relevant information from invoices, such as vendor details, billing information, line items, and totals.
|
15 |
+
|
16 |
+
[Demo can be found here](https://colab.research.google.com/drive/1zDvSysp24bCk60LR6172Z94eY1mRhKWF#scrollTo=f7RoSOEXUa6i)
|