cmarkea
/

detr-layout-detection

Image Segmentation

Inference Endpoints

Model card Files Files and versions Community

Cyrile commited on Aug 14

Commit

f20b6ed

•

1 Parent(s): c0dd7a9

Update README.md

Files changed (1) hide show

README.md +4 -5

README.md CHANGED Viewed

@@ -6,19 +6,18 @@ datasets:
 pipeline_tag: image-segmentation
 ---
-# Model Card for Model ID
 We present the model cmarkea/detr-layout-detection, which allows extracting different layouts (Text, Picture, Caption, Footnote, etc.) from an image of a document.
 This is a fine-tuning of the model [detr-resnet-50](https://huggingface.co/facebook/detr-resnet-50) on the [DocLayNet](https://huggingface.co/datasets/ds4sd/DocLayNet)
 dataset. This model can jointly predict masks and bounding boxes for documentary objects. It is ideal for processing documentary corpora to be ingested into an
 ODQA system.
-## Model Details
-### Model Description
-### Direct Use
 ```python
 from transformers import AutoImageProcessor

 pipeline_tag: image-segmentation
 ---
+# DETR-layout-detection
 We present the model cmarkea/detr-layout-detection, which allows extracting different layouts (Text, Picture, Caption, Footnote, etc.) from an image of a document.
 This is a fine-tuning of the model [detr-resnet-50](https://huggingface.co/facebook/detr-resnet-50) on the [DocLayNet](https://huggingface.co/datasets/ds4sd/DocLayNet)
 dataset. This model can jointly predict masks and bounding boxes for documentary objects. It is ideal for processing documentary corpora to be ingested into an
 ODQA system.
+This model allows extracting 11 entities, which are: Caption, Footnote, Formula, List-item, Page-footer, Page-header, Picture, Section-header, Table, Text, and Title.
+## Performance
+## Direct Use
 ```python
 from transformers import AutoImageProcessor