|
--- |
|
license: mit |
|
language: |
|
- en |
|
pipeline_tag: image-to-text |
|
datasets: |
|
- katanaml-org/invoices-donut-data-v1 |
|
--- |
|
|
|
## Sparrow - Data extraction from documents with ML |
|
|
|
This model is finetuned Donut ML base model on invoices data. Model aims to verify how well Donut performs on enterprise docs. |
|
|
|
Mean accuracy on test set: 0.96 |
|
|
|
Inference: |
|
|
|
 |
|
|
|
Training loss: |
|
|
|
 |
|
|
|
Sparrow on [GitHub](https://github.com/katanaml/sparrow) |
|
|
|
Sample invoice [docs](https://github.com/katanaml/sparrow/tree/main/sparrow-ui/docs/images) to use for inference (docs up to 500 were used for fine-tuning, use docs from 500 for inference) |
|
|
|
Our website [KatanaML](https://www.katanaml.io) |
|
|
|
On [Twitter](https://twitter.com/katana_ml) |