Output labels different from labels in input prompt.

#9
by jcorsetti - opened

Hello, after some experiments it seems that GroundingDino will output different labels from the ones provided in input. I tried with the following prompt: "s = "a chest of drawers. a door. a bed.", thus I expected GroundingDino to find only "chest of drawers", "door" or "bed". Instead one of the output labels is just "a chest". It seems that the first label I provided got truncated. Is this an expected behaviour?

Yes, when you use .post_processing_grounded_object_detection from GroundingDinoProcessor it uses the text_threshold to select the tokens. We could probably return both the original prompt and the thresholded prompt though, feel free to open an issue in the transformers repo

Sign up or log in to comment