metadata
license: mit
base_model:
- google/efficientnet-b0
EfficientNet-B0 Document Image Classifier
This is an image classification model based on Google EfficientNet-B0, fine-tuned to classify input images into one of the following 16 categories:
- bar_chart
- bar_code
- chemistry_markush_structure
- chemistry_molecular_structure
- flow_chart
- icon
- line_chart
- logo
- map
- other
- pie_chart
- qr_code
- remote_sensing
- screenshot
- signature
- stamp
Citation
If you use this model in your work, please cite the following papers:
@article{Tan2019EfficientNetRM,
title={EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks},
author={Mingxing Tan and Quoc V. Le},
journal={ArXiv},
year={2019},
volume={abs/1905.11946}
}
@techreport{Docling,
author = {Deep Search Team},
month = {8},
title = {{Docling Technical Report}},
url={https://arxiv.org/abs/2408.09869},
eprint={2408.09869},
doi = "10.48550/arXiv.2408.09869",
version = {1.0.0},
year = {2024}
}