fmajer commited on
Commit
bceb2fe
·
1 Parent(s): e306dfa

description updated

Browse files
Files changed (1) hide show
  1. app.py +3 -3
app.py CHANGED
@@ -60,7 +60,7 @@ description = """
60
  Gradio demo for an object detection architecture, introduced in my bachelor thesis (link will be added).
61
  \n\n
62
  You can use this architecture to detect objects using textual queries. To use it, simply upload an image and enter any query you want.
63
- It can be a single word or a sentence. The model is trained to recognize only 80 categories from the COCO Detection 2017 dataset.
64
  Refer to <a href="https://tech.amikelive.com/node-718/what-object-categories-labels-are-in-coco-dataset/">this</a> website
65
  or the original <a href="https://arxiv.org/pdf/1405.0312.pdf">COCO</a> paper to see the full list of categories.
66
  \n\n
@@ -90,11 +90,11 @@ Best results are obtained using one of these sentences, which were used during t
90
  </div>
91
  </div>
92
  \n\n
93
- When the binarize option is turned off, model will output propabilities of requested {class} for each patch. When binarize option is turned on
94
  the model will binarize each propability based on set eval_threshold.
95
  \n\n
96
  Each input image is transformed to size 224x224 so it can be processed by ViT. During this transformation, different
97
- crop_modes and crop_percentages can be selected. No image is lost if crop_pct = 1.0. The model was trained using crop_mode='center' and crop_pct = 0.9.
98
  For explanation of different crop_modes, please refer to
99
  <a href="https://github.com/huggingface/pytorch-image-models/blob/main/timm/data/transforms_factory.py">this</a> website, lines 155-172.
100
  """
 
60
  Gradio demo for an object detection architecture, introduced in my bachelor thesis (link will be added).
61
  \n\n
62
  You can use this architecture to detect objects using textual queries. To use it, simply upload an image and enter any query you want.
63
+ It can be a single word or a sentence. The model is trained to recognize only 80 categories (classes) from the COCO Detection 2017 dataset.
64
  Refer to <a href="https://tech.amikelive.com/node-718/what-object-categories-labels-are-in-coco-dataset/">this</a> website
65
  or the original <a href="https://arxiv.org/pdf/1405.0312.pdf">COCO</a> paper to see the full list of categories.
66
  \n\n
 
90
  </div>
91
  </div>
92
  \n\n
93
+ When the binarize option is turned off, model will output propabilities of requested {class} for each patch. When the binarize option is turned on
94
  the model will binarize each propability based on set eval_threshold.
95
  \n\n
96
  Each input image is transformed to size 224x224 so it can be processed by ViT. During this transformation, different
97
+ crop_modes and crop_percentages can be selected. No image is lost if crop_pct = 1.0 and crop_mode='squash' or 'border'. The model was trained using crop_mode='center' and crop_pct = 0.9.
98
  For explanation of different crop_modes, please refer to
99
  <a href="https://github.com/huggingface/pytorch-image-models/blob/main/timm/data/transforms_factory.py">this</a> website, lines 155-172.
100
  """