Model produces arbitrary output

#7
by rene-hof - opened

I've collected a dataset with celebrities and ran it with different models.
It runs fine with "llava-hf/llava-1.5-7b-hf", instructBlip and CogVLM.
However, with this model, the output is just arbitrary. If you set temperature to 0.1 it always outputs Abraham Lincoln, no matter what image.
If you set it to 0.7 as suggested in the model card, it varies a lot, but there is no connection between the output and the pictures. Not even woman and man are correct, nor black and white people. Something went completely wrong here.

Sign up or log in to comment