Update README.md
Browse files
README.md
CHANGED
@@ -6,19 +6,18 @@ datasets:
|
|
6 |
pipeline_tag: image-segmentation
|
7 |
---
|
8 |
|
9 |
-
#
|
10 |
|
11 |
We present the model cmarkea/detr-layout-detection, which allows extracting different layouts (Text, Picture, Caption, Footnote, etc.) from an image of a document.
|
12 |
This is a fine-tuning of the model [detr-resnet-50](https://huggingface.co/facebook/detr-resnet-50) on the [DocLayNet](https://huggingface.co/datasets/ds4sd/DocLayNet)
|
13 |
dataset. This model can jointly predict masks and bounding boxes for documentary objects. It is ideal for processing documentary corpora to be ingested into an
|
14 |
ODQA system.
|
15 |
|
16 |
-
|
17 |
|
18 |
-
|
19 |
|
20 |
-
|
21 |
-
### Direct Use
|
22 |
|
23 |
```python
|
24 |
from transformers import AutoImageProcessor
|
|
|
6 |
pipeline_tag: image-segmentation
|
7 |
---
|
8 |
|
9 |
+
# DETR-layout-detection
|
10 |
|
11 |
We present the model cmarkea/detr-layout-detection, which allows extracting different layouts (Text, Picture, Caption, Footnote, etc.) from an image of a document.
|
12 |
This is a fine-tuning of the model [detr-resnet-50](https://huggingface.co/facebook/detr-resnet-50) on the [DocLayNet](https://huggingface.co/datasets/ds4sd/DocLayNet)
|
13 |
dataset. This model can jointly predict masks and bounding boxes for documentary objects. It is ideal for processing documentary corpora to be ingested into an
|
14 |
ODQA system.
|
15 |
|
16 |
+
This model allows extracting 11 entities, which are: Caption, Footnote, Formula, List-item, Page-footer, Page-header, Picture, Section-header, Table, Text, and Title.
|
17 |
|
18 |
+
## Performance
|
19 |
|
20 |
+
## Direct Use
|
|
|
21 |
|
22 |
```python
|
23 |
from transformers import AutoImageProcessor
|